Graph neural networks for sound source localization on distributed microphone networks

التفاصيل البيبلوغرافية
العنوان: Graph neural networks for sound source localization on distributed microphone networks
المؤلفون: Grinstein, Eric, Brookes, Mike, Naylor, Patrick A.
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
الوصف: Distributed Microphone Arrays (DMAs) present many challenges with respect to centralized microphone arrays. An important requirement of applications on these arrays is handling a variable number of input channels. We consider the use of Graph Neural Networks (GNNs) as a solution to this challenge. We present a localization method using the Relation Network GNN, which we show shares many similarities to classical signal processing algorithms for Sound Source Localization (SSL). We apply our method for the task of SSL and validate it experimentally using an unseen number of microphones. We test different feature extractors and show that our approach significantly outperforms classical baselines.
Comment: Presented as a poster at ICASSP 2023
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2306.16081
رقم الانضمام: edsarx.2306.16081
قاعدة البيانات: arXiv