-
1Academic Journal
المصدر: Applications of Modelling and Simulation, Vol 9, Pp 22-36 (2025)
مصطلحات موضوعية: deep neural network, low-resourced, machine learning, speaker diarization, x-vectors, Engineering (General). Civil engineering (General), TA1-2040, Technology (General), T1-995
وصف الملف: electronic resource
-
2Academic JournalA lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams
المصدر: EURASIP Journal on Audio, Speech, and Music Processing, Vol 2024, Iss 1, Pp 1-16 (2024)
مصطلحات موضوعية: Speaker diarization, Streamed data processing, Multi-modal, Audio-visual, Deep learning, Acoustics. Sound, QC221-246, Electronic computers. Computer science, QA75.5-76.95
وصف الملف: electronic resource
Relation: https://doaj.org/toc/1687-4722
-
3Academic Journal
المؤلفون: Robin Brutschi, Rui Wang, Michaela Kolbe, Kerrin Weiss, Quentin Lohmeyer, Mirko Meboldt
المصدر: Advances in Simulation, Vol 9, Iss 1, Pp 1-13 (2024)
مصطلحات موضوعية: Simulation, Healthcare, Debriefing, Education, Speaker diarization, Sociograms, Computer applications to medicine. Medical informatics, R858-859.7
وصف الملف: electronic resource
Relation: https://doaj.org/toc/2059-0628
-
4Dissertation/ Thesis
المؤلفون: Zelenák, Martin
المساهمون: University/Department: Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Thesis Advisors: Hernando Pericás, Francisco Javier
المصدر: TDX (Tesis Doctorals en Xarxa)
مصطلحات موضوعية: Overlapping speech detection, Speaker overlap, Speaker diarization, Spatial features, Cross-correlation, Prosody
Time: 621.3
وصف الملف: application/pdf
URL الوصول: http://hdl.handle.net/10803/72431
-
5Academic Journal
المؤلفون: Michael Nigro, Sridhar Krishnan
المصدر: Machine Learning with Applications, Vol 18, Iss , Pp 100593- (2024)
مصطلحات موضوعية: Audio signal processing, Speaker diarization, Sound event detection, Audio scene analysis, Machine learning, Environmental sounds, Cybernetics, Q300-390, Electronic computers. Computer science, QA75.5-76.95
وصف الملف: electronic resource
-
6Conference
المؤلفون: Pagés, Clément, Bredin, Hervé
المساهمون: Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Centre National de la Recherche Scientifique (CNRS)
المصدر: Interspeech 2024 ; https://hal.science/hal-04734839 ; Interspeech 2024, Sep 2024, Kos, Greece
مصطلحات موضوعية: speaker diarization annotation process pyannote gradio, speaker diarization, annotation process, pyannote, gradio, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
-
7Conference
المؤلفون: Rahou, Bilal, Bredin, Hervé
المساهمون: Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Centre National de la Recherche Scientifique (CNRS)
المصدر: Interspeech 2024 ; https://hal.science/hal-04734819 ; Interspeech 2024, Sep 2024, Kos, Greece. pp.1610-1614, ⟨10.21437/Interspeech.2024-923⟩
مصطلحات موضوعية: speaker diarization speaker segmentation low latency lookahead, speaker diarization, speaker segmentation, low latency, lookahead, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
-
8
المؤلفون: Nascimento, Pedro Miguel Delgado do
مصطلحات موضوعية: Transcrição automática, Debates parlamentares, Reconhecimento de fala, Processamento de linguagem natural - -- NLP Natural language processing, Machine learning, Diarização de orador, Automatic transcription, Parliamentary debates, Speech recognition, Speaker diarization
وصف الملف: application/pdf
الاتاحة: http://hdl.handle.net/10071/31035
-
9Academic Journal
المصدر: IEEE Access, Vol 12, Pp 134702-134713 (2024)
مصطلحات موضوعية: Audio analysis, deep learning, machine learning, multi-modal learning analytics, speaker diarization, teaching practices, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
10Conference
المؤلفون: Plaquet, Alexis, Bredin, Hervé
المساهمون: Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Université Toulouse III - Paul Sabatier (UT3), Centre National de la Recherche Scientifique (CNRS), ISCA: International Speech Communication Association
المصدر: INTERSPEECH 2024 ; 25th Interspeech Conference (INTERSPEECH 2024) ; https://hal.science/hal-04696316 ; 25th Interspeech Conference (INTERSPEECH 2024), ISCA: International Speech Communication Association, Sep 2024, Kos, Greece. pp.3764--3768, ⟨10.21437/Interspeech.2024-1060⟩
مصطلحات موضوعية: Speaker diarization, Calibration, Powerset classification, Confidence Estimation, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
-
11Conference
المؤلفون: Kalda, Joonas, Alumae, Tanel, Lebourdais, Martin, Bredin, Hervé, Baroudi, Séverin, Marxer, Ricard
المساهمون: Tallinn University, Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Centre National de la Recherche Scientifique (CNRS), DYNamiques de l’Information (DYNI), Laboratoire d'Informatique et des Systèmes (LIS) (Marseille, Toulon) (LIS), Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS)-Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS)
المصدر: Interspeech 2024 ; 25th Interspeech Conference (Interspeech 2024) ; https://hal.science/hal-04683362 ; 25th Interspeech Conference (Interspeech 2024), Sep 2024, Kos, Greece. pp.1635--1639, ⟨10.21437/interspeech.2024-2462⟩ ; https://www.isca-archive.org/interspeech_2024/kalda24_interspeech.html
مصطلحات موضوعية: speech perception, intelligibility prediction, Whisper, deep learning, DISPLACE 2024, speaker diarization, language diarization, [INFO]Computer Science [cs], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
Relation: hal-04683362; https://hal.science/hal-04683362; https://hal.science/hal-04683362/document; https://hal.science/hal-04683362/file/kalda24_interspeech.pdf
-
12Conference
المؤلفون: Cui, Can, Sheikh, Imran, Ahamad, Sadeghi, Mostafa, Vincent, Emmanuel
المساهمون: Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Vivoka
المصدر: The Speaker and Language Recognition Workshop Odyssey 2024 ; https://hal.science/hal-04495886 ; The Speaker and Language Recognition Workshop Odyssey 2024, Jun 2024, Quebec, Canada
مصطلحات موضوعية: Multi-speaker ASR, speaker diarization, speaker embedding, AMI, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
جغرافية الموضوع: Quebec
Time: Quebec, Canada
Relation: info:eu-repo/semantics/altIdentifier/arxiv/2403.06570; ARXIV: 2403.06570
-
13Conference
المؤلفون: Mariotte, Théo, Larcher, Anthony, Montrésor, Silvio, Thomas, Jean-Hugh
المساهمون: Laboratoire d'Acoustique de l'Université du Mans (LAUM), Le Mans Université (UM)-Centre National de la Recherche Scientifique (CNRS), Laboratoire d'Informatique de l'Université du Mans (LIUM), Le Mans Université (UM), International Speech Communication Association (ISCA), European Project: 101007666.,ESPERANTO
المصدر: Interspeech Proceedings ; Interspeech ; https://univ-lemans.hal.science/hal-04602289 ; Interspeech, International Speech Communication Association (ISCA), Sep 2024, Kos / Greece, Greece
مصطلحات موضوعية: speaker diarization, distant speech, multimicrophone, explainable AI, [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
جغرافية الموضوع: Kos / Greece, Greece
Time: Kos / Greece, Greece
Relation: info:eu-repo/grantAgreement//101007666./EU/Exchanges for SPEech ReseArch aNd TechnOlogies/ESPERANTO; hal-04602289; https://univ-lemans.hal.science/hal-04602289; https://univ-lemans.hal.science/hal-04602289/document; https://univ-lemans.hal.science/hal-04602289/file/2024_asobo_interspeech-1.pdf
-
14Conference
المؤلفون: Tahon, Marie, Larcher, Anthony, Lebourdais, Martin, Fethi, Bougares, Silnova, Ana, Gimeno, Pablo
المساهمون: Laboratoire d'Informatique de l'Université du Mans (LIUM), Le Mans Université (UM), ELYADATA, Brno University of Technology Brno (BUT), Universitad de Zaragoza (UNIZAR), GENCI–IDRIS (Grant 2022-AD011012565), ANR-19-CE38-0012,GEM,Mesure de l'égalité entre les sexes dans les médias(2019), European Project: 101007666.,ESPERANTO
المصدر: Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) ; https://hal.science/hal-04578441 ; Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy
مصطلحات موضوعية: segmentation, music, noise, overlap, speaker diarization, transcription, media, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Relation: info:eu-repo/grantAgreement//101007666./EU/Exchanges for SPEech ReseArch aNd TechnOlogies/ESPERANTO
-
15Conference
المؤلفون: Izquierdo del Álamo, Sergio, Labrador Serrano, Beltrán, Lozano Díez, Alicia, Torre Toledano, Doroteo
المساهمون: UAM. Departamento de Tecnología Electrónica y de las Comunicaciones, Audias - Audio, Data Intelligence and Speech
مصطلحات موضوعية: speaker diarization, end-to-end neural diarization, efficient transformers, Telecomunicaciones
وصف الملف: application/pdf
Relation: Proceedings IberSPEECH 2022; https://doi.org/10.21437/IberSPEECH.2022-32; November 14-16, 2022; Granada (Spain); IberSPEECH 2022; Gobierno de España. RTI2018-098091-B-I00; IberSPEECH 2022. ISCA, Granada, Spain, 14-16 November 2022; http://hdl.handle.net/10486/712182; 156; 160
-
16Conference
المؤلفون: Landini, Federico, Diez, Mireia, Lozano-Diez, Alicia, Burget, Lukas
المساهمون: UAM. Departamento de Tecnología Electrónica y de las Comunicaciones, AUDIAS (Audio Data Intelligence and Speech)
مصطلحات موضوعية: end-To-end neural diarization, simulated conversations, Speaker diarization, Telecomunicaciones
وصف الملف: application/pdf
Relation: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; https://doi.org/10.1109/ICASSP49357.2023.10097049; June 4-10, 2023; Rhodes Island (Greece); International Conference on Acoustics, Speech, and Signal Processing (ICASSP); Gobierno de España. PID2021-125943OB-I00; International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes Island, Greece, 2023; http://hdl.handle.net/10486/711978
-
17Conference
المؤلفون: Gruttadauria, Elio, Fontaine, Mathieu, Essid, Slim
المساهمون: Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom Paris (IMT)-Télécom Paris-Institut Mines-Télécom Paris (IMT)-Télécom Paris, Département Images, Données, Signal (IDS), Télécom ParisTech, ANR-22-CE23-0011,SAROUMANE,Segmentation et regroupement de locuteurs via un modèle robuste unifié audio spatial et multimodal(2022)
المصدر: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)
IEEE International Conference on Acoustics, Speech, and Signal Processing
https://hal.science/hal-04419041
IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2024, Seoul (Korea), South Koreaمصطلحات موضوعية: Speaker Diarization, Source separation, Online inference, Overlapped speech, AMI dataset, Speaker embedding, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
جغرافية الموضوع: Seoul (Korea), South Korea
Relation: hal-04419041; https://hal.science/hal-04419041; https://hal.science/hal-04419041/document; https://hal.science/hal-04419041/file/ICASSP_2024_ELIO_GRUTTADAURIA-final.pdf
-
18Academic Journal
المؤلفون: Ke-Ming Lyu, Ren-yuan Lyu, Hsien-Tsung Chang
المصدر: PeerJ Computer Science, Vol 10, p e1973 (2024)
مصطلحات موضوعية: Automatic speech recognition, Speaker diarization, Real-time system, Incremental clustering, Electronic computers. Computer science, QA75.5-76.95
وصف الملف: electronic resource
-
19Dissertation/ Thesis
المؤلفون: Anguera Miró, Xavier
المساهمون: University/Department: Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Thesis Advisors: Wooters, Charles, Hernando Pericás, Francisco Javier
المصدر: TDX (Tesis Doctorals en Xarxa)
مصطلحات موضوعية: acoustic beamforming, speaker diarization, speaker clustering, speaker change detection, speaker segmentation, signal enhancement
وصف الملف: application/pdf
-
20Academic Journal
المؤلفون: Weijun Pan, Yidi Wang, Yumei Zhang, Boyuan Han
المصدر: Aerospace, Vol 11, Iss 7, p 599 (2024)
مصطلحات موضوعية: radiotelephone communications, speaker diarization, text-related clustering, ATCSPEECH dataset, Motor vehicles. Aeronautics. Astronautics, TL1-4050
وصف الملف: electronic resource