Academic Journal

A Comparative Performance Study of Hybrid Firefly Algorithms for Automatic Data Clustering

التفاصيل البيبلوغرافية
العنوان: A Comparative Performance Study of Hybrid Firefly Algorithms for Automatic Data Clustering
المؤلفون: Absalom El-Shamir Ezugwu, Moyinoluwa B. Agbaje, Nahla Aljojo, Rosanne Els, Haruna Chiroma, Mohamed Abd Elaziz
المصدر: IEEE Access, Vol 8, Pp 121089-121118 (2020)
بيانات النشر: IEEE, 2020.
سنة النشر: 2020
المجموعة: LCC:Electrical engineering. Electronics. Nuclear engineering
مصطلحات موضوعية: Automatic clustering, firefly algorithm, firefly-based hybrid algorithms, clustering validity index, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
الوصف: In cluster analysis, the goal has always been to extemporize the best possible means of automatically determining the number of clusters. However, because of lack of prior domain knowledge and uncertainty associated with data objects characteristics, it is challenging to choose an appropriate number of clusters, especially when dealing with data objects of high dimensions, varying data sizes, and density. In the last few decades, different researchers have proposed and developed several nature-inspired metaheuristic algorithms to solve data clustering problems. Many studies have shown that the firefly algorithm is a very robust, efficient and effective nature-inspired swarm intelligence global search technique, which has been successfully applied to solve diverse NP-hard optimization problems. However, the diversification search process employed by the firefly algorithm can lead to reduced speed and convergence rate for large-scale optimization problems. Thus this study investigates the application of four hybrid firefly algorithms to the task of automatic clustering of high density and large-scaled unlabelled datasets. In contrast to most of the existing classical heuristic-based data clustering analyses techniques, the proposed hybrid algorithms do not require any prior knowledge of the data objects to be classified. Instead, the hybrid methods automatically determine the optimal number of clusters empirically and during the program execution. Two well-known clustering validity indices, namely the Compact-Separated and Davis-Bouldin indices, are employed to evaluate the superiority of the implemented firefly hybrid algorithms. Furthermore, twelve standard ground truth clustering datasets from the UCI Machine Learning Repository are used to evaluate the robustness and effectiveness of the algorithms against those of the classical swarm optimization algorithms and other related clustering results from the literature. The experimental results show that the new clustering methods depict high superiority in comparison with existing standalone and other hybrid metaheuristic techniques in terms of clustering validity measures.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2169-3536
Relation: https://ieeexplore.ieee.org/document/9130694/; https://doaj.org/toc/2169-3536
DOI: 10.1109/ACCESS.2020.3006173
URL الوصول: https://doaj.org/article/434079cbc04946068ae636441b046fd0
رقم الانضمام: edsdoj.434079cbc04946068ae636441b046fd0
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:21693536
DOI:10.1109/ACCESS.2020.3006173