-
1
المؤلفون: Samuele Cornell, Zhong-Qiu Wang, Yoshiki Masuyama, Shinji Watanabe, Manuel Pariente, Nobutaka Ono, Stefano Squartini
المصدر: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
-
2
المؤلفون: Samuele Cornell, Manuel Pariente, Francois Grondin, Stefano Squartini
مصطلحات موضوعية: Signal Processing (eess.SP), FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Electrical Engineering and Systems Science - Signal Processing, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Machine Learning (cs.LG)
-
3
المؤلفون: Ariel Frank, Emmanuel Vincent, Fabian-Robert Stöter, Mathieu Hu, Joris Cosentino, Manuel Pariente, Samuele Cornell, Sunit Sivasankaran, David Ditter, Efthymios Tzinis, Juan M. Martín-Doñas, Antoine Deleforge, Michel Olvera, Jens Heitkaemper
المساهمون: Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Università Politecnica delle Marche [Ancona] (UNIVPM), Department of Electrical and Computer Engineering [Urbana] (University of Illinois), University of Illinois at Urbana-Champaign [Urbana], University of Illinois System-University of Illinois System, University of Paderborn, Scientific Data Management (ZENITH), Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM), Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Inria Sophia Antipolis - Méditerranée (CRISAM), Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Universidad de Granada = University of Granada (UGR), University of Hamburg, Technion - Israel Institute of Technology [Haifa], Grid'5000, Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Inria Sophia Antipolis - Méditerranée (CRISAM), University of Granada [Granada]
المصدر: Interspeech 2020
Interspeech 2020, Oct 2020, Shanghai, China
INTERSPEECHمصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer science, open-source software, 020207 software engineering, 02 engineering and technology, [INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE], end-to-end, Computer Science - Sound, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Speech enhancement, Computer engineering, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Audio and Speech Processing (eess.AS), Asteroid, source separation, [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD], FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Source separation, 020201 artificial intelligence & image processing, speech enhancement, Software architecture, Electrical Engineering and Systems Science - Audio and Speech Processing
-
4
المؤلفون: Emmanuel Vincent, Manuel Pariente, Antoine Deleforge, Samuele Cornell
المساهمون: Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Università Politecnica delle Marche [Ancona] (UNIVPM), Grid5000, Pariente, Manuel
المصدر: ICASSP 2020-45th International Conference on Acoustics, Speech, and Signal Processing
ICASSP 2020-45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
ICASSPمصطلحات موضوعية: Signal Processing (eess.SP), FOS: Computer and information sciences, Masking (art), Sound (cs.SD), Computer Science - Machine Learning, Computer science, Speech recognition, 02 engineering and technology, Computer Science - Sound, Machine Learning (cs.LG), Set (abstract data type), 030507 speech-language pathology & audiology, 03 medical and health sciences, End-to-end principle, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Electrical Engineering and Systems Science - Signal Processing, Short-time Fourier transform, 020206 networking & telecommunications, Filter bank, Speaker recognition, [STAT.ML] Statistics [stat]/Machine Learning [stat.ML], [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD], [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD], 0305 other medical science, Electrical Engineering and Systems Science - Audio and Speech Processing
وصف الملف: application/pdf
-
5
المؤلفون: Emmanuel Vincent, Manuel Pariente, Antoine Deleforge
المساهمون: Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Grid5000, Pariente, Manuel, Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)
المصدر: INTERSPEECH 2019-20th Annual Conference of the International Speech Communication Association
INTERSPEECH 2019-20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
INTERSPEECHمصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Iterative method, Computer science, Posterior probability, Machine Learning (stat.ML), 02 engineering and technology, Computer Science - Sound, Machine Learning (cs.LG), Non-negative matrix factorization, 030507 speech-language pathology & audiology, 03 medical and health sciences, symbols.namesake, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Statistics - Machine Learning, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, 020206 networking & telecommunications, [STAT.ML] Statistics [stat]/Machine Learning [stat.ML], [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD], Speech enhancement, [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD], symbols, Spectrogram, 0305 other medical science, Gradient descent, Algorithm, Encoder, Electrical Engineering and Systems Science - Audio and Speech Processing, Gibbs sampling
وصف الملف: application/pdf
-
6
مصطلحات موضوعية: Asteroid, audio source separation, Surgical_mask_speech_enhancement_v1, enhancement, DeMask, pretrained model
Relation: https://zenodo.org/communities/asteroid-models; https://doi.org/10.5281/zenodo.3997046; https://doi.org/10.5281/zenodo.3997047; oai:zenodo.org:3997047
-
7
المؤلفون: Manuel Pariente
مصطلحات موضوعية: Asteroid, audio source separation, WHAM!, sep_clean, DPRNNTasNet, pretrained model
Relation: https://zenodo.org/communities/asteroid-models; https://doi.org/10.5281/zenodo.3903794; https://doi.org/10.5281/zenodo.3903795; oai:zenodo.org:3903795
-
8
المؤلفون: Daniel Pressnitzer, Manuel Pariente
المصدر: The Journal of the Acoustical Society of America. 142:2611-2611
مصطلحات موضوعية: Signal-to-noise ratio, Acoustics and Ultrasonics, Arts and Humanities (miscellaneous), Computer science, Speech recognition, Acoustics, Noise reduction, Deep neural networks, Percentage point, Intelligibility (communication), Speech in noise