Syntax-based data augmentation for Hungarian-English machine translation

التفاصيل البيبلوغرافية
العنوان: Syntax-based data augmentation for Hungarian-English machine translation
المؤلفون: Nagy, Attila, Nanys, Patrick, Konrád, Balázs Frey, Bial, Bence, Ács, Judit
سنة النشر: 2022
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Machine Learning
الوصف: We train Transformer-based neural machine translation models for Hungarian-English and English-Hungarian using the Hunglish2 corpus. Our best models achieve a BLEU score of 40.0 on HungarianEnglish and 33.4 on English-Hungarian. Furthermore, we present results on an ongoing work about syntax-based augmentation for neural machine translation. Both our code and models are publicly available.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2201.06876
رقم الانضمام: edsarx.2201.06876
قاعدة البيانات: arXiv