Academic Journal

K-Means Performance Optimization Using Rank Order Centroid (ROC) And Braycurtis Distance

التفاصيل البيبلوغرافية
العنوان: K-Means Performance Optimization Using Rank Order Centroid (ROC) And Braycurtis Distance
المؤلفون: Irwandi, Hafiz, Sitompul, Opim Salim, Sutarman, Sutarman
المصدر: Sinkron : jurnal dan penelitian teknik informatika; Vol. 6 No. 2 (2022): Articles Research Volume 6 Issue 2, April 2022; 472-478 ; 2541-2019 ; 2541-044X ; 10.33395/sinkron.v7i2
بيانات النشر: Politeknik Ganesha Medan
سنة النشر: 2022
مصطلحات موضوعية: Clustering, K-Mans, Rank Order Centroid, Braycurtis Distance, Silhouette Coefficient
الوصف: K-Means is a clustering algorithm that groups data based on similarities between data. Some of the problems that arise from this algorithm are when determining the center point of the cluster randomly. This will certainly affect the final result of a clustering process. To anticipate the poor accuracy value, a process is needed to determine the initial centroid in the initialization process. The second problem is when calculating the Euclidean distance on the distance between data. However, this method only gives the same impact on each data attribute. From some of these problems, this study proposes the Rank Order Centroid (ROC) method for initializing the cluster center point and using the Braycurtis distance method to calculate the distance between data. With the experiment K=2 to K=10, the results obtained in this study are the proposed method obtains an iteration reduction of 6.6% on the Student Performance Exams dataset and 19.3% on the Body Fat Prediction dataset. However, there was an increase in iterations on the Heart Failure dataset by 24.2%. In testing the cluster results using the Silhouette Coefficient, this method shows an increase in the evaluation value of 5.9% in the Student Performance Exams dataset. However, the evaluation value decreased by 8.3% in the Body Fat Prediction dataset and 3.3% in the Heart Failure dataset. ; K-Means adalah algoritma clustering yang mengelompokkan data berdasarkan kemiripan antar data. Beberapa permasalah yang timbul dari algoritma ini adalah ketika menentukan titik pusat cluster secara acak dan menghitung kedekatan antar data dengan menggunakan metode Euclidean distance. Penelitian ini mengusulkan metode Rank Order Centroid (ROC) untuk penentuan titik pusat cluster dan menggunakan metode Braycurtis distance untuk menghitung jarak antar data. Dengan percobaan K=2 sampai K=10 , hasil yang didapat dalam penelitian ini adalah metode yang diusulkan memperoleh penurunan iterasi sebesar 6,6% pada dataset Student Performance Exams dan 19,3% pada dataset Body Fat ...
نوع الوثيقة: article in journal/newspaper
وصف الملف: application/pdf
اللغة: English
Relation: https://jurnal.polgan.ac.id/index.php/sinkron/article/view/11371/883; https://jurnal.polgan.ac.id/index.php/sinkron/article/view/11371
الاتاحة: https://jurnal.polgan.ac.id/index.php/sinkron/article/view/11371
Rights: Copyright (c) 2022 Hafiz Irwandi, Opim Salim Sitompul, Sutarman Sutarman ; http://creativecommons.org/licenses/by-nc/4.0
رقم الانضمام: edsbas.EB70E62E
قاعدة البيانات: BASE