Generative Data Augmentation Challenge: Synthesis of Room Acoustics for Speaker Distance Estimation

التفاصيل البيبلوغرافية
العنوان: Generative Data Augmentation Challenge: Synthesis of Room Acoustics for Speaker Distance Estimation
المؤلفون: Lin, Jackie, Götz, Georg, Llopis, Hermes Sampedro, Hafsteinsson, Haukur, Guðjónsson, Steinar, Nielsen, Daniel Gert, Pind, Finnur, Smaragdis, Paris, Manocha, Dinesh, Hershey, John, Kristjansson, Trausti, Kim, Minje
سنة النشر: 2025
المجموعة: Computer Science
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
الوصف: This paper describes the synthesis of the room acoustics challenge as a part of the generative data augmentation workshop at ICASSP 2025. The challenge defines a unique generative task that is designed to improve the quantity and diversity of the room impulse responses dataset so that it can be used for spatially sensitive downstream tasks: speaker distance estimation. The challenge identifies the technical difficulty in measuring or simulating many rooms' acoustic characteristics precisely. As a solution, it proposes generative data augmentation as an alternative that can potentially be used to improve various downstream tasks. The challenge website, dataset, and evaluation code are available at https://sites.google.com/view/genda2025.
Comment: Accepted to the Workshop on Generative Data Augmentation at ICASSP 2025. Challenge website: https://sites.google.com/view/genda2025
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2501.13250
رقم الانضمام: edsarx.2501.13250
قاعدة البيانات: arXiv