Academic Journal

Sentiment Analysis on Big Data: A Hybrid SED-TABU Feature Selection Method

التفاصيل البيبلوغرافية
العنوان: Sentiment Analysis on Big Data: A Hybrid SED-TABU Feature Selection Method
المؤلفون: Sabitha Rajagopal, Sreemathy Jayaprakash, Karthik Subburathinam
المصدر: Tehnički Vjesnik, Vol 31, Iss 6, Pp 2079-2086 (2024)
بيانات النشر: Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek, 2024.
سنة النشر: 2024
المجموعة: LCC:Engineering (General). Civil engineering (General)
مصطلحات موضوعية: adaboost classifier, Big data, feature selection, sentiment analysis (SA), stream evolution dynamics (SED), tabu search (TS), Engineering (General). Civil engineering (General), TA1-2040
الوصف: Big data mining is a crucial component of contemporary decision support systems linked to social networks and other data sources. Sentiment Analysis (SA) is the process by which text analytics is used to mine many data sources for opinions. This research seeks to create a feature selection method for sentiment analysis that is efficient and robust against noise and high dimensionality in Big data environments. The objective is to choose a condensed collection of useful features that increases sentiment categorization precision. It is suggested to use a novel hybrid feature selection method that combines Tabu Search (TS) and Stream Evolution Dynamics (SED). SED offers exploratory power, and TS offers exploitation. The classifier assesses the performance for each feature subset that SED-TS chose. Instances are classified using the AdaBoost classifier. The suggested method was assessed using data from Amazon product reviews. As a result, our technique outperforms wrapper and filter-based feature selection methods. By extracting a small feature subset, the SED-TS hybrid technique attained the best accuracy of 93% and an F1 score of 0.95. The work effectively combined SED and TS for feature selection specifically suited to sentiment analysis on Big data. The hybrid strategy offers higher accuracy and better generalization by utilizing the complementing characteristics of the two strategies. This shows how metaheuristic approaches can be used to classify sentiment in high-dimensional noisy data.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 1330-3651
1848-6339
Relation: https://hrcak.srce.hr/file/465290; https://doaj.org/toc/1330-3651; https://doaj.org/toc/1848-6339
DOI: 10.17559/TV-20231004000989
URL الوصول: https://doaj.org/article/4e925da641c04c7f8c3ca15e7ca758c2
رقم الانضمام: edsdoj.4e925da641c04c7f8c3ca15e7ca758c2
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:13303651
18486339
DOI:10.17559/TV-20231004000989