Report
Generative Data Augmentation Challenge: Synthesis of Room Acoustics for Speaker Distance Estimation
العنوان: | Generative Data Augmentation Challenge: Synthesis of Room Acoustics for Speaker Distance Estimation |
---|---|
المؤلفون: | Lin, Jackie, Götz, Georg, Llopis, Hermes Sampedro, Hafsteinsson, Haukur, Guðjónsson, Steinar, Nielsen, Daniel Gert, Pind, Finnur, Smaragdis, Paris, Manocha, Dinesh, Hershey, John, Kristjansson, Trausti, Kim, Minje |
سنة النشر: | 2025 |
المجموعة: | Computer Science |
مصطلحات موضوعية: | Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound |
الوصف: | This paper describes the synthesis of the room acoustics challenge as a part of the generative data augmentation workshop at ICASSP 2025. The challenge defines a unique generative task that is designed to improve the quantity and diversity of the room impulse responses dataset so that it can be used for spatially sensitive downstream tasks: speaker distance estimation. The challenge identifies the technical difficulty in measuring or simulating many rooms' acoustic characteristics precisely. As a solution, it proposes generative data augmentation as an alternative that can potentially be used to improve various downstream tasks. The challenge website, dataset, and evaluation code are available at https://sites.google.com/view/genda2025. Comment: Accepted to the Workshop on Generative Data Augmentation at ICASSP 2025. Challenge website: https://sites.google.com/view/genda2025 |
نوع الوثيقة: | Working Paper |
URL الوصول: | http://arxiv.org/abs/2501.13250 |
رقم الانضمام: | edsarx.2501.13250 |
قاعدة البيانات: | arXiv |
الوصف غير متاح. |