-
1Report
المؤلفون: Almudévar, Antonio, Serizel, Romain, Ortega, Alfonso
مصطلحات موضوعية: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2411.00153
-
2Report
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
URL الوصول: http://arxiv.org/abs/2410.04951
-
3Report
المصدر: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2025, Hyderabad, India
مصطلحات موضوعية: Computer Science - Sound, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing, Electrical Engineering and Systems Science - Signal Processing
URL الوصول: http://arxiv.org/abs/2410.05301
-
4Report
-
5Report
مصطلحات موضوعية: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2409.08589
-
6Report
-
7Report
المؤلفون: Douwes, Constance, Serizel, Romain
مصطلحات موضوعية: Computer Science - Machine Learning, Computer Science - Sound
URL الوصول: http://arxiv.org/abs/2409.05080
-
8Report
مصطلحات موضوعية: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2409.02915
-
9Report
-
10Report
-
11Report
المؤلفون: Monir, Nasser-Eddine, Magron, Paul, Serizel, Romain
مصطلحات موضوعية: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2401.13548
-
12Report
-
13Report
المؤلفون: Ronchini, Francesca, Serizel, Romain
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
URL الوصول: http://arxiv.org/abs/2310.03455
-
14Report
المؤلفون: Sadeghi, Mostafa, Serizel, Romain
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Electrical Engineering and Systems Science - Signal Processing, Statistics - Machine Learning
URL الوصول: http://arxiv.org/abs/2309.10439
-
15Report
المؤلفون: Ayilo, Jean-Eudes, Sadeghi, Mostafa, Serizel, Romain
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Electrical Engineering and Systems Science - Signal Processing, Statistics - Machine Learning
URL الوصول: http://arxiv.org/abs/2309.10457
-
16Report
المؤلفون: Nortier, Berné, Sadeghi, Mostafa, Serizel, Romain
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Electrical Engineering and Systems Science - Signal Processing, Statistics - Machine Learning
URL الوصول: http://arxiv.org/abs/2309.10450
-
17Report
-
18ReportPretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning
المؤلفون: Moummad, Ilyass, Serizel, Romain, Farrugia, Nicolas
مصطلحات موضوعية: Computer Science - Sound, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2309.00878
-
19Conference
المساهمون: Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Vers des robots à l’intelligence sociale au travers de l’apprentissage, de la perception et de la commande (ROBOTLEARN), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Grenoble Alpes (UGA), IEEE, ANR-22-CE23-0026,REAVISE,Amélioration de la parole audiovisuelle basée sur l'apprentissage profond, robuste et efficace(2022)
المصدر: International Conference on Acoustics Speech and Signal Processing (ICASSP) ; https://hal.science/hal-04718254 ; International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2025, Hyderabad, India
مصطلحات موضوعية: unsupervised learning, audio-visual speech enhancement, diffusion models, posterior sampling, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Time: Hyderabad, India
Relation: info:eu-repo/semantics/altIdentifier/arxiv/2410.05301; ARXIV: 2410.05301
-
20Report
المؤلفون: Roman, Robin San, Adi, Yossi, Deleforge, Antoine, Serizel, Romain, Synnaeve, Gabriel, Défossez, Alexandre
المصدر: Thirty-seventh Conference on Neural Information Processing Systems (2023)
مصطلحات موضوعية: Computer Science - Sound, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2308.02560