Academic Journal

Identifying Top-Performing Students via VKontakte Social Media Communities Using Advanced NLP Techniques

التفاصيل البيبلوغرافية
العنوان: Identifying Top-Performing Students via VKontakte Social Media Communities Using Advanced NLP Techniques
المؤلفون: Sergei S. Gorshkov, Dmitry I. Ignatov, Anastasia Yu. Chernysheva, Vyacheslav L. Goiko, Vitaliy V. Kashpur
المصدر: IEEE Access, Vol 13, Pp 962-979 (2025)
بيانات النشر: IEEE, 2025.
سنة النشر: 2025
المجموعة: LCC:Electrical engineering. Electronics. Nuclear engineering
مصطلحات موضوعية: Digital footprint, domain adaptation, educational data mining, information technologies in education, natural language processing, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
الوصف: Identifying potentially high-performing students is crucial for universities aiming to enhance educational outcomes, for companies seeking to recruit top talents early, and for advertising platforms looking to optimize targeted marketing. This paper introduces an algorithm designed to identify students with exceptional academic performance by analyzing their subscriptions to communities on the social network VKontakte. The study examines a sample of 4445 students from Tomsk State University with publicly accessible VK profiles. The research methodology involves generating vector representations for each community based on embeddings, topic modeling, sentiment and emotion analysis, as well as text complexity metrics. To generate the embeddings, a separate model was trained and made publicly available on HuggingFace. The integration of diverse features was achieved using attention mechanisms, allowing the model to dynamically weigh their importance and capture intricate interrelations. These representations are then used to construct a digital user profile, capturing the students’ interests as reflected in their community subscriptions. Additionally, the machine learning pipeline incorporated stacking to combine predictions from multiple models, enhancing robustness and classification performance. Through a series of experiments, we developed a machine learning algorithm that effectively distinguishes between high- and low-performing students based on these profiles. This approach also enabled the identification and interpretation of key factors differentiating high-performing students from their lower-performing peers. Additionally, we investigated the factors positively and negatively associated with academic performance.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2169-3536
Relation: https://ieeexplore.ieee.org/document/10812733/; https://doaj.org/toc/2169-3536
DOI: 10.1109/ACCESS.2024.3521857
URL الوصول: https://doaj.org/article/c5fcb02b1b1f4689bc43f20ede0f5828
رقم الانضمام: edsdoj.5fcb02b1b1f4689bc43f20ede0f5828
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:21693536
DOI:10.1109/ACCESS.2024.3521857