Principal Components of the Meaning

التفاصيل البيبلوغرافية
العنوان: Principal Components of the Meaning
المؤلفون: Suzen, Neslihan, Gorban, Alexander, Levesley, Jeremy, Mirkes, Evgeny
سنة النشر: 2020
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Machine Learning
الوصف: In this paper we argue that (lexical) meaning in science can be represented in a 13 dimension Meaning Space. This space is constructed using principal component analysis (singular decomposition) on the matrix of word category relative information gains, where the categories are those used by the Web of Science, and the words are taken from a reduced word set from texts in the Web of Science. We show that this reduced word set plausibly represents all texts in the corpus, so that the principal component analysis has some objective meaning with respect to the corpus. We argue that 13 dimensions is adequate to describe the meaning of scientific texts, and hypothesise about the qualitative meaning of the principal components.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2009.08859
رقم الانضمام: edsarx.2009.08859
قاعدة البيانات: arXiv