Dissertation/ Thesis

Računalna statistička analiza jezika religijskih rasprava na internetskim forumima ; Computational Statistical Analysis of the Language of Religious Discussions on Internet Forums

التفاصيل البيبلوغرافية
العنوان: Računalna statistička analiza jezika religijskih rasprava na internetskim forumima ; Computational Statistical Analysis of the Language of Religious Discussions on Internet Forums
المؤلفون: Torić, Josip
المساهمون: Šnajder, Jan
بيانات النشر: Sveučilište u Zagrebu. Fakultet elektrotehnike i računarstva.
University of Zagreb. Faculty of Electrical Engineering and Computing.
سنة النشر: 2018
المجموعة: Croatian Digital Theses Repository (National and University Library in Zagreb)
مصطلحات موضوعية: obrada prirodnog jezika, strojno učenje, Reddit, logistička regresija, stroj potpornih vektora, LIWC, statistička analiza podatka, natural language processing, machine learning, logistic regression, support vector machine, statistical data analysis, TEHNIČKE ZNANOSTI. Računarstvo, TECHNICAL SCIENCES. Computing
الوصف: Analiza religijskih pitanja je uvijek vrlo kontroverzna. Religija kod svakog čovjeka predstavlja nešto osobno, a objektivno sagledati religiju je vrlo zahtjevna stvar. Cilj rada je bio analizirati tekstove religijskih rasprava uz pomoć statističke analize. Skup podataka korišten u radu smo preuzeli s društvene mreže Reddit. Model smo izgradili koristeći frekvencije riječi i značajke LIWC. Isprobali smo dva modela strojnog učenja, logističku regresiju i stroj potpornih vektora za predviđanje religije. Ostvarili smo rezultate koji su zadovoljavajući te s točnošću od otprilike 65% predviđaju religiju korisnika na temelju njegovih ili njezinih komentara. ; Analysis of religion questions is always very controversial. Religion is something personal for every person, and to objectively consider religion is a very demanding thing. The aim of the thesis was to analyze the texts of religious discussions through statistical data analysis. The data set used in this work was downloaded from the Reddit social network. We built the model using word frequencies and LIWC features. We tested two machine learning models, logistic regression and the support vectors machine for predicting religion. We have achieved satisfactory results and with an accuracy of approximately 65% predict the user's religion based on his or her comments.
نوع الوثيقة: bachelor thesis
وصف الملف: application/pdf
اللغة: Croatian
Relation: https://zir.nsk.hr/islandora/object/fer:4205; https://urn.nsk.hr/urn:nbn:hr:168:648883; https://repozitorij.unizg.hr/islandora/object/fer:4205; https://repozitorij.unizg.hr/islandora/object/fer:4205/datastream/PDF
الاتاحة: https://zir.nsk.hr/islandora/object/fer:4205
https://urn.nsk.hr/urn:nbn:hr:168:648883
https://repozitorij.unizg.hr/islandora/object/fer:4205
https://repozitorij.unizg.hr/islandora/object/fer:4205/datastream/PDF
Rights: http://rightsstatements.org/vocab/InC/1.0/ ; info:eu-repo/semantics/closedAccess
رقم الانضمام: edsbas.4324FD18
قاعدة البيانات: BASE