University of Bahrain
Scientific Journals

Data Mining for Non-Redundant Big Data Using dynamic KMEAN

Show simple item record

dc.contributor.author Ahmed, Saja T.
dc.date.accessioned 2023-05-05T11:57:04Z
dc.date.available 2023-05-05T11:57:04Z
dc.date.issued 2023-05-05
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/4904
dc.description.abstract There is an increasing demand for techniques that can process and collect valuable information from huge data in the Big Data era. Duplicates can seriously influence data processing and data mining, so the major challenge is finding as many duplicate records as possible. Data deduplication (or Redundancy Removal) removes redundant data and stores only one copy, promoting single instance storage. The main idea suggests using K-Means clustering for big data deduplication. K-Means Clustering, a localized optimization approach, is vulnerable to the starting point chosen from the cluster's center. The K-Means Clustering technique will produce more errors and bad cluster outcomes if the center of a defective cluster is used as the starting point. The suggested deduplication solution is based on the numeric conversion of the dataset and pre-processing them to extract useful information utilized by Dynamic K-Mean clustering (DKMEAN) to categorize replicated chunks. The proposed system greatly improves dataset quality and ultimately reduces resource consumption. It outperformed Traditional K-Means (TKMEAN) in terms of the number of detected redundant chunks, accuracy, the number of iterations, and efficiency. en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.subject Data Mining; Big Data; Clustering; Data Deduplication; Kmean algorithm en_US
dc.title Data Mining for Non-Redundant Big Data Using dynamic KMEAN en_US
dc.identifier.doi http://dx.doi.org/10.12785/ijcds/140121
dc.volume 14 en_US
dc.issue 1 en_US
dc.pagestart 1 en_US
dc.pageend 1 en_US
dc.contributor.authorcountry Iraq en_US
dc.contributor.authoraffiliation Ministry of Education en_US
dc.source.title International Journal of Computing and Digital Systems en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

This item appears in the following Issue(s)

Show simple item record

All Journals


Advanced Search

Browse

Administrator Account