-
1
المؤلفون: Soheil Khorram, Anshuman Tripathi, Jaeyoung Kim, Han Lu, Qian Zhang, Rohit Prabhavalkar, Hasim Sak
المصدر: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
-
2
المؤلفون: Cal Peyser, Ronny Huang, Tara Sainath, Rohit Prabhavalkar, Michael Picheny, Kyunghyun Cho
مصطلحات موضوعية: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computation and Language (cs.CL)
-
3
المؤلفون: Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Rohit Prabhavalkar, Tara N. Sainath, Trevor Strohman
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computer Science - Neural and Evolutionary Computing, Neural and Evolutionary Computing (cs.NE), Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Machine Learning (cs.LG)
-
4
المؤلفون: Rami Botros, Rohit Prabhavalkar, Johan Schalkwyk, Ciprian Chelba, Tara N. Sainath, Françoise Beaufays
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
-
5
المؤلفون: Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Computation and Language, Audio and Speech Processing (eess.AS), Computer Science - Artificial Intelligence, FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Computer Science - Sound, Machine Learning (cs.LG), Electrical Engineering and Systems Science - Audio and Speech Processing
-
6
المؤلفون: Ke Hu, Tara Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang
المصدر: Interspeech 2022.
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
-
7
المؤلفون: Shaojin Ding, Wang Weiran, Ding Zhao, Tara Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Machine Learning, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computer Science - Sound, Machine Learning (cs.LG), Electrical Engineering and Systems Science - Audio and Speech Processing
-
8
المؤلفون: W. Ronny Huang, Shuo-Yiin Chang, David Rybach, Tara Sainath, Rohit Prabhavalkar, Cal Peyser, Zhiyun Lu, Cyril Allauzen
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Computer Science - Computation and Language, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Machine Learning (cs.LG)
-
9
المؤلفون: Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
-
10
المؤلفون: Trevor Strohman, Yanzhang He, Sean Campbell, Tara N. Sainath, Rohit Prabhavalkar, David Rybach, Arun Narayanan
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Computer Science - Computation and Language, Computer science, Word error rate, Context (language use), Computer Science - Sound, Oracle, Machine Learning (cs.LG), Tree (data structure), Recurrent neural network, Audio and Speech Processing (eess.AS), Path (graph theory), FOS: Electrical engineering, electronic engineering, information engineering, Limit (mathematics), Computation and Language (cs.CL), Algorithm, Decoding methods, Electrical Engineering and Systems Science - Audio and Speech Processing
-
11
المؤلفون: Howard Nathan David, Rohit Prabhavalkar, Alexander H. Gruenstein, Alex Park, Turaj Zakizadeh Shabestary
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Signal processing, Artificial neural network, Computer science, Speech recognition, Echo (computing), Signal, Computer Science - Sound, Synthetic data, Speech enhancement, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Word (computer architecture), Electrical Engineering and Systems Science - Audio and Speech Processing, Communication channel
-
12
المؤلفون: Yanzhang He, Ke Hu, Rohit Prabhavalkar, Deepti Bhatia, Yu Zhang, Wei Li, David Qiu, Qiujia Li, Tara N. Sainath, Bo Li, Ian McGraw, Liangliang Cao
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, Vocabulary, Computer Science - Computation and Language, Correctness, Voice search, Computer science, media_common.quotation_subject, Speech recognition, Model selection, Context (language use), Machine Learning (cs.LG), Tokenization (data security), Audio and Speech Processing (eess.AS), Word recognition, FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Word (computer architecture), Electrical Engineering and Systems Science - Audio and Speech Processing, media_common
-
13
المؤلفون: Ching-Feng Yeh, Ozlem Kalinli, Yangyang Shi, Chunyang Wu, Rohit Prabhavalkar, Alex Xiao, Christian Fuegen, Duc Le, Michael L. Seltzer, Varun K. Nagaraja, Julian Chan, Jay Mahadeokar
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Computation and Language, Computer science, Computation, Latency (audio), Word error rate, Data set, Transducer, ComputingMethodologies_SYMBOLICANDALGEBRAICMANIPULATION, Algorithm, Encoder, Computation and Language (cs.CL), Decoding methods, Dropout (neural networks)
-
14
المؤلفون: Chunyang Wu, Jiatong Zhou, Christian Fuegen, Ozlem Kalinli, Hang Su, Duc Le, Yuan Shangguan, Jay Mahadeokar, Rohit Prabhavalkar, Michael L. Seltzer, Yangyang Shi
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Computer science, Speech recognition, Process (computing), Word error rate, Security token, FLOPS, Computer Science - Sound, Task (computing), Recurrent neural network, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Latency (engineering), Computation and Language (cs.CL), Decoding methods, Electrical Engineering and Systems Science - Audio and Speech Processing
-
15
المؤلفون: Jiahui Yu, Chung-Cheng Chiu, Ehsan Variani, Arun Narayanan, Rohit Prabhavalkar, Trevor Strohman, Ruoming Pang, Tara N. Sainath
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Signal processing, Computer Science - Computation and Language, Computer science, Computer Science - Sound, Mode (computer interface), Computer engineering, Audio and Speech Processing (eess.AS), Error analysis, FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Encoder, Word (computer architecture), Electrical Engineering and Systems Science - Audio and Speech Processing
-
16
المؤلفون: Rohit Prabhavalkar, Ananya Misra, Antoine Bruguier, Arun Narayanan
المصدر: INTERSPEECH
مصطلحات موضوعية: Computer science, Stacking, Anti-aliasing, Algorithm, Regularization (mathematics)
-
17
المؤلفون: Julia Proskurnia, Daria Soboleva, Justin Lu, Bogdan Prisacari, Balint Miklos, Rohit Prabhavalkar, Daniel Valcarce, Marius Sajgalik, Felix Weissenberger, Victor Carbune, Ondrej Skopek
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Machine Learning, Computer Science - Computation and Language, Artificial neural network, Computer science, media_common.quotation_subject, Speech recognition, Hash function, Latency (audio), Punctuation, Synthetic data, Computer Science - Sound, Machine Learning (cs.LG), Conjunction (grammar), Data modeling, Sound recording and reproduction, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), media_common, Electrical Engineering and Systems Science - Audio and Speech Processing
-
18
المؤلفون: Ruoming Pang, Chung-Cheng Chiu, Rohit Prabhavalkar, Yu Zhang, Tara N. Sainath, Liangliang Cao, Yonghui Wu, Wei Han, Arun Narayanan, Navdeep Jaitly, Patrick Nguyen
المصدر: SLT
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Computation and Language, Computer science, Generalization, Speech recognition, Inference, Word error rate, 020206 networking & telecommunications, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Data modeling, Set (abstract data type), Recurrent neural network, Audio and Speech Processing (eess.AS), Test set, 0202 electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), 0105 earth and related environmental sciences, Test data, Electrical Engineering and Systems Science - Audio and Speech Processing
-
19
المؤلفون: Tara N. Sainath, Ruoming Pang, Ke Hu, Rohit Prabhavalkar
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Voice search, Computer science, media_common.quotation_subject, Speech recognition, Context (language use), 02 engineering and technology, 010501 environmental sciences, Deliberation, 01 natural sciences, Computer Science - Sound, Reduction (complexity), Audio and Speech Processing (eess.AS), Test set, FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Computation and Language (cs.CL), Encoder, Decoding methods, Electrical Engineering and Systems Science - Audio and Speech Processing, 0105 earth and related environmental sciences, media_common
-
20A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency
المؤلفون: Zhifeng Chen, Ian McGraw, David Garcia, Mirko Visontai, Yuan Shangguan, Bo Li, Yanzhang He, Qiao Liang, Antoine Bruguier, Tara N. Sainath, Yash Sheth, Yu Zhang, Golan Pundak, Chung-Cheng Chiu, Raziel Alvarez, Ke Hu, Cal Peyser, David Rybach, Alex Gruenstein, Yonghui Wu, Trevor Strohman, Ruoming Pang, Ding Zhao, Rohit Prabhavalkar, Arun Narayanan, Shuo-Yiin Chang, Wei Li, Anjuli Kannan
المصدر: ICASSP
مصطلحات موضوعية: Vocabulary, Microphone, Computer science, Speech recognition, media_common.quotation_subject, Word error rate, 020206 networking & telecommunications, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Recurrent neural network, End-to-end principle, 0202 electrical engineering, electronic engineering, information engineering, Model quality, Latency (engineering), Server-side, 0105 earth and related environmental sciences, media_common