Comparing Styles across Languages

التفاصيل البيبلوغرافية
العنوان: Comparing Styles across Languages
المؤلفون: Havaldar, Shreya, Pressimone, Matthew, Wong, Eric, Ungar, Lyle
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language
الوصف: Understanding how styles differ across languages is advantageous for training both humans and computers to generate culturally appropriate text. We introduce an explanation framework to extract stylistic differences from multilingual LMs and compare styles across languages. Our framework (1) generates comprehensive style lexica in any language and (2) consolidates feature importances from LMs into comparable lexical categories. We apply this framework to compare politeness, creating the first holistic multilingual politeness dataset and exploring how politeness varies across four languages. Our approach enables an effective evaluation of how distinct linguistic categories contribute to stylistic variations and provides interpretable insights into how people communicate differently around the world.
Comment: Accepted to EMNLP 2023
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2310.07135
رقم الانضمام: edsarx.2310.07135
قاعدة البيانات: arXiv