Dissertation/ Thesis
Računalna statistička analiza jezika religijskih rasprava na internetskim forumima ; Computational Statistical Analysis of the Language of Religious Discussions on Internet Forums
العنوان: | Računalna statistička analiza jezika religijskih rasprava na internetskim forumima ; Computational Statistical Analysis of the Language of Religious Discussions on Internet Forums |
---|---|
المؤلفون: | Torić, Josip |
المساهمون: | Šnajder, Jan |
بيانات النشر: | Sveučilište u Zagrebu. Fakultet elektrotehnike i računarstva. University of Zagreb. Faculty of Electrical Engineering and Computing. |
سنة النشر: | 2018 |
المجموعة: | Croatian Digital Theses Repository (National and University Library in Zagreb) |
مصطلحات موضوعية: | obrada prirodnog jezika, strojno učenje, Reddit, logistička regresija, stroj potpornih vektora, LIWC, statistička analiza podatka, natural language processing, machine learning, logistic regression, support vector machine, statistical data analysis, TEHNIČKE ZNANOSTI. Računarstvo, TECHNICAL SCIENCES. Computing |
الوصف: | Analiza religijskih pitanja je uvijek vrlo kontroverzna. Religija kod svakog čovjeka predstavlja nešto osobno, a objektivno sagledati religiju je vrlo zahtjevna stvar. Cilj rada je bio analizirati tekstove religijskih rasprava uz pomoć statističke analize. Skup podataka korišten u radu smo preuzeli s društvene mreže Reddit. Model smo izgradili koristeći frekvencije riječi i značajke LIWC. Isprobali smo dva modela strojnog učenja, logističku regresiju i stroj potpornih vektora za predviđanje religije. Ostvarili smo rezultate koji su zadovoljavajući te s točnošću od otprilike 65% predviđaju religiju korisnika na temelju njegovih ili njezinih komentara. ; Analysis of religion questions is always very controversial. Religion is something personal for every person, and to objectively consider religion is a very demanding thing. The aim of the thesis was to analyze the texts of religious discussions through statistical data analysis. The data set used in this work was downloaded from the Reddit social network. We built the model using word frequencies and LIWC features. We tested two machine learning models, logistic regression and the support vectors machine for predicting religion. We have achieved satisfactory results and with an accuracy of approximately 65% predict the user's religion based on his or her comments. |
نوع الوثيقة: | bachelor thesis |
وصف الملف: | application/pdf |
اللغة: | Croatian |
Relation: | https://zir.nsk.hr/islandora/object/fer:4205; https://urn.nsk.hr/urn:nbn:hr:168:648883; https://repozitorij.unizg.hr/islandora/object/fer:4205; https://repozitorij.unizg.hr/islandora/object/fer:4205/datastream/PDF |
الاتاحة: | https://zir.nsk.hr/islandora/object/fer:4205 https://urn.nsk.hr/urn:nbn:hr:168:648883 https://repozitorij.unizg.hr/islandora/object/fer:4205 https://repozitorij.unizg.hr/islandora/object/fer:4205/datastream/PDF |
Rights: | http://rightsstatements.org/vocab/InC/1.0/ ; info:eu-repo/semantics/closedAccess |
رقم الانضمام: | edsbas.4324FD18 |
قاعدة البيانات: | BASE |
الوصف غير متاح. |