-
1
-
2Report
المؤلفون: Phan, Huy, Nguyen, Huy Le, Chén, Oliver Y., Koch, Philipp, Duong, Ngoc Q. K., McLoughlin, Ian, Mertins, Alfred
مصطلحات موضوعية: Computer Science - Sound, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/2010.09132
-
3Report
-
4Report
-
5Report
المصدر: ICCV 2019
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
URL الوصول: http://arxiv.org/abs/1812.01973
-
6Report
المؤلفون: Parekh, Sanjeel, Essid, Slim, Ozerov, Alexey, Duong, Ngoc Q. K., Pérez, Patrick, Richard, Gaël
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: http://arxiv.org/abs/1804.07345
-
7Report
المؤلفون: Vo, Huy V., Duong, Ngoc Q. K., Perez, Patrick
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
URL الوصول: http://arxiv.org/abs/1803.10348
-
8
-
9Conference
المساهمون: Institut de Recherche en Communications et en Cybernétique de Nantes (IRCCyN), Mines Nantes (Mines Nantes)-École Centrale de Nantes (ECN)-Ecole Polytechnique de l'Université de Nantes (EPUN), Université de Nantes (UN)-Université de Nantes (UN)-PRES Université Nantes Angers Le Mans (UNAM)-Centre National de la Recherche Scientifique (CNRS), Technicolor R & I Cesson Sévigné, Technicolor
المصدر: ICMR '18: 2018 International Conference on Multimedia Retrieval ; https://hal.science/hal-01785130 ; ICMR '18: 2018 International Conference on Multimedia Retrieval , Jun 2018, Yokohama, Japan. ⟨10.1145/3206025.3206056⟩
مصطلحات موضوعية: Scene understanding, Long-term memory, Video memorability, Measurement protocol, Global video features, Attributes, [SCCO.COMP]Cognitive science/Computer science, [SCCO.PSYC]Cognitive science/Psychology
Relation: hal-01785130; https://hal.science/hal-01785130; https://hal.science/hal-01785130/document; https://hal.science/hal-01785130/file/Annotating_Understanding_and_Predicting_Long-term_Video_Memorability.pdf
-
10Conference
المساهمون: International Research Institute MICA (MICA), Institut National Polytechnique de Grenoble (INPG)-Hanoi University of Science and Technology (HUST)-Centre National de la Recherche Scientifique (CNRS), Technicolor R & I Cesson Sévigné, Technicolor, Hanoi University of Science and Technology (HUST)
المصدر: 14th Int. Conf. on Latent Variable Analysis and Signal Separation (LVA ICA)
https://hal.science/hal-01740052
14th Int. Conf. on Latent Variable Analysis and Signal Separation (LVA ICA), Jul 2018, London, United Kingdomمصطلحات موضوعية: Multichannel audio source separation, generic spectral model, non- negative matrix factorization, spatial covariance model, Gaussian modeling, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
جغرافية الموضوع: London, United Kingdom
-
11Academic Journal
المساهمون: International Research Institute MICA (MICA), Institut National Polytechnique de Grenoble (INPG)-Hanoi University of Science and Technology (HUST)-Centre National de la Recherche Scientifique (CNRS), Technicolor, Hanoi University of Science and Technology (HUST)
المصدر: ISSN: 2329-9290.
مصطلحات موضوعية: [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
-
12Academic Journal
المؤلفون: Parekh, Sanjeel, Essid, Slim, Ozerov, Alexey, Duong, Ngoc, Q. K., Pérez, Patrick, Richard, Gael
المساهمون: Technicolor R & I Cesson Sévigné, Technicolor, Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom Paris (IMT)-Télécom Paris-Institut Mines-Télécom Paris (IMT)-Télécom Paris, Département Images, Données, Signal (IDS), Télécom ParisTech, Valeo.ai, VALEO
المصدر: ISSN: 2329-9290.
مصطلحات موضوعية: Index Terms-Multimodal classification, sound event detection, object localization, multiple instance learning, deep learning, audio-visual fusion, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Relation: hal-02399993; https://telecom-paris.hal.science/hal-02399993; https://telecom-paris.hal.science/hal-02399993/document; https://telecom-paris.hal.science/hal-02399993/file/2019-IEEE_TASLP_Parekh.pdf
-
13Book
المؤلفون: Essid, Slim, Parekh, Sanjeel, Duong, Ngoc, Q. K., Serizel, Romain, Ozerov, Alexey, Antonacci, Fabio, Sarti, Augusto
المساهمون: Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom Paris (IMT)-Télécom Paris, Institut Mines-Télécom Paris (IMT)-Institut Polytechnique de Paris (IP Paris)-Institut Polytechnique de Paris (IP Paris)-Institut Mines-Télécom Paris (IMT)-Télécom Paris, Institut Mines-Télécom Paris (IMT)-Institut Polytechnique de Paris (IP Paris)-Institut Polytechnique de Paris (IP Paris), Département Images, Données, Signal (IDS), Télécom ParisTech, Technicolor R & I Cesson Sévigné, Technicolor, Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Dipartimento di Elettronica e Informazione, Politecnico di Milano Milan (POLIMI), Tuomas Virtanen, Mark D. Plumbley, Dan Ellis
المصدر: Computational Analysis of Sound Scenes and Events ; https://hal.science/hal-01620341 ; Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer, pp.243-276, 2017, 978-3319634494. ⟨10.1007/978-3-319-63450-0_9⟩ ; http://www.springer.com/gp/book/9783319634494
مصطلحات موضوعية: Multimodal scene analysis, Tensor factorization, Joint audiovisual scene analysis, Representation learning, Multichannel audio, Wiener filtering, Data fusion, Multichannel, Audio source separation, Beamforming, Matrix factorization, Multiview scene analysis, Audio source localization and tracking, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing, [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]
-
14Conference
المؤلفون: Demarty, Claire-Hélène, Sjöberg, Mats Viktor, Ionescu, Bogdan, Do, Thanh-Toan, Wang, Hanli, Duong, Ngoc Q. K., Lefèbvre, Frédéric
المساهمون: Helsinki Institute for Information Technology, Intelligent Interactive Information Access research group / Patrik Floréen
مصطلحات موضوعية: Computer and information sciences
وصف الملف: application/pdf
Relation: Demarty , C-H , Sjöberg , M V , Ionescu , B , Do , T-T , Wang , H , Duong , N Q K & Lefèbvre , F 2016 , ' MediaEval 2016 Predicting Media Interestingness Task ' , CEUR Workshop Proceedings , vol. 1739 . < http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_1.pdf >; http://hdl.handle.net/10138/178118; 8549e885-8121-4366-a97a-30ed12d5184b
الاتاحة: http://hdl.handle.net/10138/178118
-
15Conference
المؤلفون: Duong, Ngoc Q. K., Berthet, Pierre, Zabre, Sidkieta, Kerdranvat, Michel, Ozerov, Alexey, Chevallier, Louis
المساهمون: Technicolor R & I Cesson Sévigné, Technicolor, Alten, ALTRAN (FRANCE)
المصدر: 13th International Conference on Latent Variable Analysis and Signal Separation
https://hal.inria.fr/hal-01288219
13th International Conference on Latent Variable Analysis and Signal Separation, Feb 2017, Grenoble, Franceمصطلحات موضوعية: sound capture, Audio zoom on smartphone, robust adaptive beamformer, post-processing, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Relation: hal-01288219; https://hal.inria.fr/hal-01288219; https://hal.inria.fr/hal-01288219/document; https://hal.inria.fr/hal-01288219/file/eusipco2016.pdf
-
16
-
17Conference
المساهمون: Hanoi University of Mining and Geology (HUMG), International Research Institute MICA (MICA), Institut National Polytechnique de Grenoble (INPG)-Hanoi University of Science and Technology (HUST)-Centre National de la Recherche Scientifique (CNRS), Hanoi University of Science and Technology (HUST), Technicolor R & I Cesson Sévigné, Technicolor
المصدر: IEEE International Conference on Electronics, Information and Communication ; https://inria.hal.science/hal-01288277 ; IEEE International Conference on Electronics, Information and Communication, Jan 2016, Da Nang, Vietnam
مصطلحات موضوعية: Speaker-dependent speech enhancement, non-negative matrix factorization, group sparsity, generic spectral model, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]
-
18Conference
المؤلفون: Prablanc, Pierre, Ozerov, Alexey, Duong, Ngoc Q. K., Pérez, Patrick
المساهمون: Technicolor R & I Cesson Sévigné, Technicolor, ANR-14-CE27-0002,MAD,Inpainting de données audio manquantes(2014)
المصدر: 24th European Signal Processing Conference (EUSIPCO 2016)
https://hal.inria.fr/hal-01271257
24th European Signal Processing Conference (EUSIPCO 2016), Aug 2016, Budapest, Hungaryمصطلحات موضوعية: Gaussian mixture model, voice con-version, audio inpainting, speech inpainting, speech synthesis, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Relation: hal-01271257; https://hal.inria.fr/hal-01271257; https://hal.inria.fr/hal-01271257v2/document; https://hal.inria.fr/hal-01271257v2/file/eusipco16a.pdf
-
19Conference
المؤلفون: Duong, Hien-Thanh, Nguyen, Quoc-Cuong, Nguyen, Cong-Phuong, Tran, Thanh-Huan, Duong, Ngoc Q. K.
المساهمون: Hanoi University of Mining and Geology (HUMG), International Research Institute MICA (MICA), Institut National Polytechnique de Grenoble (INPG)-Hanoi University of Science and Technology (HUST)-Centre National de la Recherche Scientifique (CNRS), Hanoi University of Science and Technology (HUST), Hanoi University of Industry (HAUI), Technicolor R & I Cesson Sévigné, Technicolor
المصدر: 6th ACM International Symposium on Information and Communication Technology
https://inria.hal.science/hal-01288291
6th ACM International Symposium on Information and Communication Technology, Dec 2015, Hanoi, Vietnam. ⟨10.1145/2833258.2833276⟩مصطلحات موضوعية: Speech enhancement, audio source separation, nonnegative matrix factorization, multiplicative update, spectral model, group sparsity, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
-
20Conference
المؤلفون: El Badawy, Dalia, Ozerov, Alexey, Duong, Ngoc Q. K.
المساهمون: Technicolor R & I Cesson Sévigné, Technicolor
المصدر: Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'15) ; https://hal.inria.fr/hal-01120009 ; Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'15), Apr 2015, Brisbane, Australia
مصطلحات موضوعية: universal model, audio source separation, group sparsity, non-negative matrix factoriza-tion, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Relation: hal-01120009; https://hal.inria.fr/hal-01120009; https://hal.inria.fr/hal-01120009v2/document; https://hal.inria.fr/hal-01120009v2/file/paper.pdf