-
1ReportNeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts
المؤلفون: Lin, Yen-Ting, Yang, Chao-Han Huck, Chen, Zhehuai, Zelasko, Piotr, Yang, Xuesong, Chen, Zih-Ching, Puvvada, Krishna C, Fu, Szu-Wei, Hu, Ke, Chiu, Jun Wei, Balam, Jagadeesh, Ginsburg, Boris, Wang, Yu-Chiang Frank
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multiagent Systems, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2411.05945
-
2Report
المؤلفون: Ouyang, Siqi, Hrinchuk, Oleksii, Chen, Zhehuai, Lavrukhin, Vitaly, Balam, Jagadeesh, Li, Lei, Ginsburg, Boris
مصطلحات موضوعية: Computer Science - Computation and Language
URL الوصول: http://arxiv.org/abs/2410.22499
-
3Report
المؤلفون: Peng, Yifan, Puvvada, Krishna C., Chen, Zhehuai, Zelasko, Piotr, Huang, He, Dhawan, Kunal, Hu, Ke, Watanabe, Shinji, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2410.17485
-
4Report
المؤلفون: Lu, Ke-Han, Chen, Zhehuai, Fu, Szu-Wei, Yang, Chao-Han Huck, Balam, Jagadeesh, Ginsburg, Boris, Wang, Yu-Chiang Frank, Lee, Hung-yi
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Sound
URL الوصول: http://arxiv.org/abs/2409.20007
-
5Report
المؤلفون: Żelasko, Piotr, Chen, Zhehuai, Wang, Mengru, Galvez, Daniel, Hrinchuk, Oleksii, Ding, Shuoyang, Hu, Ke, Balam, Jagadeesh, Lavrukhin, Vitaly, Ginsburg, Boris
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2409.13523
-
6ReportMETA-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR
المؤلفون: Wang, Jinhan, Wang, Weiqing, Dhawan, Kunal, Park, Taejin, Kim, Myungjong, Medennikov, Ivan, Huang, He, Koluguri, Nithin, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
URL الوصول: http://arxiv.org/abs/2409.12352
-
7Report
المؤلفون: Hu, Ke, Chen, Zhehuai, Yang, Chao-Han Huck, Żelasko, Piotr, Hrinchuk, Oleksii, Lavrukhin, Vitaly, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Computer Science - Computation and Language
URL الوصول: http://arxiv.org/abs/2409.11538
-
8Report
المؤلفون: Yang, Chao-Han Huck, Park, Taejin, Gong, Yuan, Li, Yuanchao, Chen, Zhehuai, Lin, Yen-Ting, Chen, Chen, Hu, Yuchen, Dhawan, Kunal, Żelasko, Piotr, Zhang, Chao, Chen, Yun-Nung, Tsao, Yu, Balam, Jagadeesh, Ginsburg, Boris, Siniscalchi, Sabato Marco, Chng, Eng Siong, Bell, Peter, Lai, Catherine, Watanabe, Shinji, Stolcke, Andreas
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2409.09785
-
9Report
المؤلفون: Park, Taejin, Medennikov, Ivan, Dhawan, Kunal, Wang, Weiqing, Huang, He, Koluguri, Nithin Rao, Puvvada, Krishna C., Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Sound
URL الوصول: http://arxiv.org/abs/2409.06656
-
10Report
المؤلفون: Koluguri, Nithin Rao, Bartley, Travis, Xu, Hainan, Hrinchuk, Oleksii, Balam, Jagadeesh, Ginsburg, Boris, Kucsko, Georg
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language
URL الوصول: http://arxiv.org/abs/2409.05601
-
11Report
المؤلفون: Wang, Weiqing, Dhawan, Kunal, Park, Taejin, Puvvada, Krishna C., Medennikov, Ivan, Majumdar, Somshubra, Huang, He, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
URL الوصول: http://arxiv.org/abs/2409.01438
-
12Report
المؤلفون: Huang, He, Park, Taejin, Dhawan, Kunal, Medennikov, Ivan, Puvvada, Krishna C., Koluguri, Nithin Rao, Wang, Weiqing, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2408.13106
-
13Report
المؤلفون: Majumdar, Somshubra, Noroozi, Vahid, Narenthiran, Sean, Ficek, Aleksander, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing
URL الوصول: http://arxiv.org/abs/2407.21077
-
14
-
15Report
المؤلفون: Dhawan, Kunal, Koluguri, Nithin Rao, Jukić, Ante, Langman, Ryan, Balam, Jagadeesh, Ginsburg, Boris
المصدر: Proceedings of Interspeech 2024
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Machine Learning
URL الوصول: http://arxiv.org/abs/2407.03495
-
16Report
المؤلفون: Chen, Zhehuai, Huang, He, Hrinchuk, Oleksii, Puvvada, Krishna C., Koluguri, Nithin Rao, Żelasko, Piotr, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Human-Computer Interaction, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, 68T10, I.2.7
URL الوصول: http://arxiv.org/abs/2406.19954
-
17Report
المؤلفون: Puvvada, Krishna C., Żelasko, Piotr, Huang, He, Hrinchuk, Oleksii, Koluguri, Nithin Rao, Dhawan, Kunal, Majumdar, Somshubra, Rastorgueva, Elena, Chen, Zhehuai, Lavrukhin, Vitaly, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2406.19674
-
18Report
المؤلفون: Noroozi, Vahid, Chen, Zhehuai, Majumdar, Somshubra, Huang, Steve, Balam, Jagadeesh, Ginsburg, Boris
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
URL الوصول: http://arxiv.org/abs/2406.12946
-
19Report
المؤلفون: Jukić, Ante, Balam, Jagadeesh, Ginsburg, Boris
المصدر: WASPAA 2023
URL الوصول: http://arxiv.org/abs/2406.04552
-
20Report
المؤلفون: Agrawal, Aviral, Lezcano, Carlos Mateo Samudio, Heredia-Marin, Iqui Balam, Sethi, Prabhdeep Singh
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language, Computer Science - Machine Learning
URL الوصول: http://arxiv.org/abs/2404.13530