-
1
المؤلفون: Ke Wu, Ehsan Variani, Tom Bagby, Michael Riley
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)
-
2
المؤلفون: Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Computation and Language, Audio and Speech Processing (eess.AS), Computer Science - Artificial Intelligence, FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Computer Science - Sound, Machine Learning (cs.LG), Electrical Engineering and Systems Science - Audio and Speech Processing
-
3
المؤلفون: Ehsan Variani, Michael Riley, David Rybach, Cyril Allauzen, Tongzhou Chen, Bhuvana Ramabhadran
المصدر: Interspeech 2022.
-
4
المؤلفون: Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Electrical Engineering and Systems Science - Audio and Speech Processing, Machine Learning (cs.LG)
-
5
المؤلفون: David Rybach, Hao Zhang, Cyril Allauzen, Michael A. Riley, Ehsan Variani
المصدر: Interspeech 2021.
مصطلحات موضوعية: Computer architecture, Computer science
-
6Academic Journal
المؤلفون: Ehsan Variani, Feipeng Li, Hynek Hermansky
المساهمون: The Pennsylvania State University CiteSeerX Archives
وصف الملف: application/pdf
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.644.7711; http://hltcoe.jhu.edu/uploads/publications/papers/16661_slides.pdf
-
7
المؤلفون: Jiahui Yu, Chung-Cheng Chiu, Ehsan Variani, Arun Narayanan, Rohit Prabhavalkar, Trevor Strohman, Ruoming Pang, Tara N. Sainath
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Signal processing, Computer Science - Computation and Language, Computer science, Computer Science - Sound, Mode (computer interface), Computer engineering, Audio and Speech Processing (eess.AS), Error analysis, FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Encoder, Word (computer architecture), Electrical Engineering and Systems Science - Audio and Speech Processing
-
8
المؤلفون: Pedro J. Moreno, Ehsan Variani, James Apfel, Bhuvana Ramabhadran, Tongzhou Chen, Seungji Lee
المصدر: ICASSP
مصطلحات موضوعية: Computer science, business.industry, 05 social sciences, Word error rate, Pattern recognition, Function (mathematics), 010501 environmental sciences, 01 natural sciences, Oracle, Binary classification, Search algorithm, 0502 economics and business, Search problem, Beam search, Language model, Artificial intelligence, 050207 economics, business, 0105 earth and related environmental sciences
-
9Academic Journal
المؤلفون: Ehsan Variani, Feipeng Li, Hynek Hermansky
المساهمون: The Pennsylvania State University CiteSeerX Archives
وصف الملف: application/pdf
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.684.3153; http://www.clsp.jhu.edu/%7Evariani/MSPM_Interspeech_2013.pdf
-
10
المؤلفون: Kevin W. Wilson, Kean Chin, Chanwoo Kim, Bo Li, Ananya Misra, Ron Weiss, Ehsan Variani, Andrew W. Senior, Tara N. Sainath, Izhak Shafran, Michiel Bacchiani, Arun Narayanan
المصدر: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25:965-979
مصطلحات موضوعية: Beamforming, Acoustics and Ultrasonics, Time delay neural network, Computer science, Speech recognition, Direction of arrival, Word error rate, 020206 networking & telecommunications, 02 engineering and technology, Filter (signal processing), Speech processing, Filter bank, Speech enhancement, 030507 speech-language pathology & audiology, 03 medical and health sciences, Computational Mathematics, 0202 electrical engineering, electronic engineering, information engineering, Computer Science (miscellaneous), Electrical and Electronic Engineering, 0305 other medical science
-
11
المؤلفون: Hasim Sak, Ehsan Variani, Erik McDermott
المصدر: ASRU
مصطلحات موضوعية: FOS: Computer and information sciences, Fusion, Sound (cs.SD), Voice search, Computer Science - Computation and Language, Computer science, Speech recognition, Computer Science - Sound, Domain (software engineering), Bayes' theorem, Recurrent neural network, End-to-end principle, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Language model, Hidden Markov model, Computation and Language (cs.CL), Electrical Engineering and Systems Science - Audio and Speech Processing
-
12
المؤلفون: David Rybach, Ehsan Variani, Michael Riley, Cyril Allauzen
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, Measure (data warehouse), Sound (cs.SD), Computer Science - Computation and Language, Voice search, Computer science, Modularity (biology), Speech recognition, Inference, Computer Science - Sound, Machine Learning (cs.LG), Transducer, Autoregressive model, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Beam search, Language model, Computation and Language (cs.CL), Electrical Engineering and Systems Science - Audio and Speech Processing
-
13
المؤلفون: Ananda Theertha Suresh, Mitchel Weintraub, Ehsan Variani
المصدر: ICASSP
مصطلحات موضوعية: FOS: Computer and information sciences, Transduction (machine learning), Vocabulary, Computer Science - Machine Learning, Computer science, media_common.quotation_subject, Inference, Machine Learning (stat.ML), 010501 environmental sciences, 01 natural sciences, Machine Learning (cs.LG), 030507 speech-language pathology & audiology, 03 medical and health sciences, Statistics - Machine Learning, Categorical variable, 0105 earth and related environmental sciences, media_common, Computer Science - Computation and Language, business.industry, Pattern recognition, Transducer, Softmax function, Embedding, Artificial intelligence, 0305 other medical science, business, Computation and Language (cs.CL)
-
14
المؤلفون: Michiel Bacchiani, Tom Bagby, Kamel Lahouel, Erik McDermott, Ehsan Variani
المصدر: ICASSP
مصطلحات موضوعية: Logarithm, business.industry, Computer science, Entropy (statistical thermodynamics), Sampling (statistics), Pattern recognition, 02 engineering and technology, 01 natural sciences, Upper and lower bounds, Cross entropy, Connectionism, Bias of an estimator, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, Entropy (information theory), 020201 artificial intelligence & image processing, Artificial intelligence, Entropy (energy dispersal), 010306 general physics, business, Entropy (arrow of time)
-
15
المؤلفون: Kevin W. Wilson, Erik McDermott, Arun Narayanan, Olivier Siohan, Ananya Misra, Khe Chai Sim, Ehsan Variani, J. Caroselli, Izhak Shafran, Chanwoo Kim, Matt Shannon, Mitchel Weintraub, Ron Weiss, Michiel Bacchiani, Bo Li, Richard Rose, Golan Pundak, Hasim Sak, Tara N. Sainath, K. K. Chin
المصدر: INTERSPEECH
مصطلحات موضوعية: World Wide Web, 030507 speech-language pathology & audiology, 03 medical and health sciences, Computer science, 0202 electrical engineering, electronic engineering, information engineering, 020206 networking & telecommunications, 02 engineering and technology, 0305 other medical science
-
16
المؤلفون: Erik McDermott, Michiel Bacchiani, Tom Bagby, Ehsan Variani
المصدر: INTERSPEECH
مصطلحات موضوعية: Vocabulary, Computer science, Speech recognition, media_common.quotation_subject, Training (meteorology), Acoustic model, 02 engineering and technology, 030507 speech-language pathology & audiology, 03 medical and health sciences, End-to-end principle, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, 0305 other medical science, media_common
-
17
المؤلفون: Arun Narayanan, Michiel Bacchiani, Ehsan Variani, Chanwoo Kim
المصدر: INTERSPEECH
مصطلحات موضوعية: Signal Processing (eess.SP), FOS: Computer and information sciences, Sound (cs.SD), Speedup, Artificial neural network, Computer science, Fast Fourier transform, Graphics processing unit, CPU time, 020206 networking & telecommunications, 02 engineering and technology, Computer Science - Sound, Audio and Speech Processing (eess.AS), 0202 electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, Central processing unit, Electrical Engineering and Systems Science - Signal Processing, Word (computer architecture), Simulation, Impulse response, Electrical Engineering and Systems Science - Audio and Speech Processing
-
18
المؤلفون: Ron Weiss, Kevin W. Wilson, Bo Li, Michiel Bacchiani, Ehsan Variani, Chanwoo Kim, Ananya Misra, Andrew W. Senior, Izhak Shafran, Tara N. Sainath, Arun Narayanan, K. K. Chin
المصدر: New Era for Robust Speech Recognition ISBN: 9783319646794
New Era for Robust Speech Recognition, Exploiting Deep Learningمصطلحات موضوعية: Beamforming, Speech enhancement, Artificial neural network, Computer science, Frequency domain, Direction of arrival, Word error rate, Filter (signal processing), Filter bank, Algorithm
-
19
المؤلفون: Tara N. Sainath, Michiel Bacchiani, Ehsan Variani, Izhak Shafran
المصدر: INTERSPEECH
مصطلحات موضوعية: business.industry, Computer science, Speech recognition, Feature extraction, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, 030507 speech-language pathology & audiology, 03 medical and health sciences, Discriminative model, 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, 0305 other medical science, Joint (audio engineering), business
-
20
المؤلفون: Tara N. Sainath, Izhak Shafran, Michiel Bacchiani, Ehsan Variani, Ron Weiss, Arun Narayanan, Kevin W. Wilson
المصدر: INTERSPEECH
مصطلحات موضوعية: Computational complexity theory, Computer science, business.industry, Feature extraction, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, Machine learning, computer.software_genre, 030507 speech-language pathology & audiology, 03 medical and health sciences, 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, 0305 other medical science, business, computer