-
1Dissertation/ Thesis
المؤلفون: Slizovskaia, Olga
المساهمون: University/Department: Universitat Pompeu Fabra. Departament de Tecnologies de la Informació i les Comunicacions
Thesis Advisors: Gómez Gutiérrez, Emilia, Haro Ortega, Gloria
المصدر: TDX (Tesis Doctorals en Xarxa)
مصطلحات موضوعية: Audio-visual deep learning, Multimodal deep learning, Music information retrieval, Musical performance video, Musical performance analysis, Musical instrument classification, Sound source separation, Fusion techniques, Conditioning techniques, Aprendizaje profundo audiovisual, Aprendizaje profundo multimodal, Recuperación de información musical, Video musical, Análisis de interpretación musical, Clasificación de instrumentos musicales, Separación de fuentes de sonido, Técnicas de fusión, Técnicas de acondicionamiento
وصف الملف: application/pdf
URL الوصول: http://hdl.handle.net/10803/669963
-
2Academic Journal
المؤلفون: Donghyeok Jo, Jun-Hwa Kim, Jihoon Jeon, Chee Sun Won
المصدر: IEEE Access, Vol 13, Pp 6387-6396 (2025)
مصطلحات موضوعية: Audio-visual deep learning, on-screen sound separation, edge computing, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
3
المؤلفون: Daniel Michelsanti, Giovanni Morrone, Jesper Jensen, Zheng-Hua Tan
المصدر: Morrone, G, Michelsanti, D, Tan, Z-H & Jensen, J 2021, Audio-Visual Speech Inpainting with Deep Learning . in ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . vol. 2021-June, IEEE, I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings, pp. 6653-6657, ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Ontario, Canada, 06/06/2021 . https://doi.org/10.1109/ICASSP39728.2021.9413488
ICASSPمصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, speech inpainting, audio-visual, deep learning, face-landmarks, multi-task learning, Computer science, Speech recognition, Inpainting, Multi-task learning, Context (language use), Task (project management), Machine Learning (cs.LG), Phone, Speech inpainting, Audio and Speech Processing (eess.AS), Face-landmarks, FOS: Electrical engineering, electronic engineering, information engineering, Signal processing, business.industry, Deep learning, Image and Video Processing (eess.IV), Audio-visual, Electrical Engineering and Systems Science - Image and Video Processing, Visualization, Artificial intelligence, business, Electrical Engineering and Systems Science - Audio and Speech Processing