International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 186 - Number 71 |
Year of Publication: 2025 |
Authors: Edy Saputro, Mustafid, Jatmiko Endro Suseno |
![]() |
Edy Saputro, Mustafid, Jatmiko Endro Suseno . Integrating K-Means Clustering with PoA Blockchain and IPFS for Clustered Data Synchronization: The OriBloX CDSF Approach. International Journal of Computer Applications. 186, 71 ( Mar 2025), 11-18. DOI=10.5120/ijca2025924561
Efficient data synchronization in distributed systems presents significant challenges, as centralized solutions often face limitations in scalability, bandwidth efficiency, and resilience to single points of failure. Existing blockchain and decentralized storage technologies struggle to manage frequent data updates effectively. To address these issues, OriBloX CDSF integrates K-Means clustering (optimized with the Elbow method), TF-IDF analysis, Hyperledger Besu (using QBFT PoA consensus), and IPFS to deliver a secure, scalable, and decentralized synchronization framework. Its selective synchronization mechanism optimizes bandwidth usage by retrieving only updated cluster files, reducing unnecessary data transfers by up to 70%. Using the Amazon product catalog dataset, the framework demonstrated robust clustering performance, with the Elbow method consistently identifying optimal clusters and silhouette scores reaching up to 0.114, reflecting well-separated and meaningful groupings. OriBloX’s design ensures efficient metadata synchronization, scalability, and fault tolerance, making it a reliable solution for distributed ecosystems.