-
1Academic Journal
المؤلفون: Peng Chen, Binh Thien Nguyen, Yuting Geng, Kenta Iwai, Takanobu Nishiura
المصدر: IEEE Access, Vol 12, Pp 152036-152044 (2024)
مصطلحات موضوعية: Single-channel speech separation, time-frequency mask, deep neural network, joint network, ideal binary mask, ideal ratio mask, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
2Academic Journal
المؤلفون: Peng Chen, Binh Thien Nguyen, Kenta Iwai, Takanobu Nishiura
المصدر: Information, Vol 15, Iss 10, p 608 (2024)
مصطلحات موضوعية: single-channel speech separation, deep neural network, ideal binary mask, ideal ratio mask, combination of time–frequency mask, Information technology, T58.5-58.64
وصف الملف: electronic resource
-
3Academic JournalSpeaker-Independent Audio-Visual Speech Separation Based on Transformer in Multi-Talker Environments
المؤلفون: Jing WANG, Weiming YI, Xiang XIE, Yiyu LUO
المصدر: IEICE Transactions on Information and Systems. 2022, E105.D(4):766
-
4Academic Journal
المؤلفون: Chiho Haruta, Nobutaka Ono, 小野 順貴, 春田 智穂
المصدر: 日本音響学会誌 / THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN. 2022, 78(5):227
-
5Academic Journal
المؤلفون: Katsutoshi Itoyama, Kazuhiro Nakadai, Kenji Nishida, Masahiko Fujita, 中臺 一博, 糸山 克寿, 藤田 雅彦, 西田 健次
المصدر: 日本ロボット学会誌 / Journal of the Robotics Society of Japan. 2022, 40(7):631
-
6ConferenceFace Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
المؤلفون: Giovanni Morrone, Luca Pasa, Vadim Tikhanoff, Sonia Bergamaschi, Luciano Fadiga, Leonardo Badino
المساهمون: Morrone, Giovanni, Pasa, Luca, Tikhanoff, Vadim, Bergamaschi, Sonia, Fadiga, Luciano, Badino, Leonardo
مصطلحات موضوعية: Audio-Visual Speech Enhancement, Cocktail Party Problem, Time-Frequency Mask, LSTM, Face Landmarks
Relation: info:eu-repo/semantics/altIdentifier/wos/WOS:000482554007027; ispartofbook:Proceedings of the 44th IEEE International Conference on Acoustics, Speech and Signal Processing; 44th IEEE International Conference on Acoustics, Speech and Signal Processing; serie:PROCEEDINGS OF THE . IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING; http://hdl.handle.net/11380/1170465; info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85068987044; https://ieeexplore.ieee.org/document/8682061; https://dr-pato.github.io/audio_visual_speech_enhancement/
-
7Academic Journal
المؤلفون: Xuliang Li, Zhaogui Ding, Weifeng Li, Qingmin Liao
المصدر: Sensors; Volume 17; Issue 6; Pages: 1447
مصطلحات موضوعية: delay-and-sum beamforming, binary time-frequency mask, cosine function
وصف الملف: application/pdf
Relation: Physical Sensors; https://dx.doi.org/10.3390/s17061447
الاتاحة: https://doi.org/10.3390/s17061447
-
8Academic Journal
المؤلفون: Ming XIAO, Feng GAO, Gong-xian SUN, Sheng-li XIE
المصدر: Tongxin xuebao, Vol 33, Pp 77-84 (2012)
مصطلحات موضوعية: underdetermined blind source separation, sparse component analysis, time-frequency mask, blind source extraction, Telecommunication, TK5101-6720
وصف الملف: electronic resource
Relation: https://doaj.org/toc/1000-436X
-
9Academic Journal
المؤلفون: Shireesha, K., RajaShekar, T.
المصدر: International Journal of Innovative Technology and Research; Vol 4, No 5 (2016): August - September 2016; 3692-3695
مصطلحات موضوعية: ECE, Ensemble Learning, Speech Enhancement, Time-Frequency Mask, Intelligibility, Speech
وصف الملف: application/pdf
-
10
المؤلفون: Yoichi Haneda, Kenta Niwa, Kobayashi Kazunori, Yusuke Hioka, Yuma Koizumi
المصدر: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26:1780-1792
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Machine Learning, Acoustics and Ultrasonics, Mean squared error, Computer science, Machine Learning (stat.ML), Probability density function, 02 engineering and technology, Computer Science - Sound, Machine Learning (cs.LG), 030507 speech-language pathology & audiology, 03 medical and health sciences, Audio and Speech Processing (eess.AS), Statistics - Machine Learning, FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Computer Science (miscellaneous), Electrical and Electronic Engineering, Sound quality, time-frequency mask, Sound-source enhancement, Artificial neural network, business.industry, deep learning, 020206 networking & telecommunications, Pattern recognition, Speech processing, Backpropagation, Time–frequency analysis, objective sound quality assessment (OSQA) score, Computational Mathematics, Artificial intelligence, 0305 other medical science, business, Gradient method, Electrical Engineering and Systems Science - Audio and Speech Processing
-
11Academic Journal
المؤلفون: Yang Shao, Soundararajan Srinivasan, Zhaozhang Jin, DeLiang Wang
المساهمون: The Pennsylvania State University CiteSeerX Archives
مصطلحات موضوعية: Speech segregation, Computational Auditory Scene Analysis, Binary time–frequency mask, Robust speech recognition, Uncertainty
وصف الملف: application/pdf
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.151.4132; http://www.cse.ohio-state.edu/~dwang/papers/SSJW.csl10.pdf
-
12Academic Journal
المؤلفون: Yang Shao, DeLiang Wang
المساهمون: The Pennsylvania State University CiteSeerX Archives
مصطلحات موضوعية: Sequential organization, Computational auditory scene analysis, Speaker quantization, Binary time–frequency mask
وصف الملف: application/pdf
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.151.5146; http://www.cse.ohio-state.edu/~dwang/papers/Shao-Wang.spcomm09.pdf
-
13Conference
المؤلفون: Moore, Alastair H., Lightburn, Leo, Xue, Wei, Naylor, Patrick A., Brookes, Mike
مصطلحات موضوعية: Assisted listening, Beamforming, Head rotation, Speech enhancement, Time frequency mask
Relation: https://repository.hkust.edu.hk/ir/Record/1783.1-125451; 16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Proceedings / IEEE. Piscataway, NJ : IEEE, 2018, p. 461-465, article number 8521361; https://doi.org/10.1109/IWAENC.2018.8521361; http://lbdiscover.ust.hk/uresolver?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rfr_id=info:sid/HKUST:SPI&rft.genre=article&rft.issn=&rft.volume=&rft.issue=&rft.date=2018&rft.spage=461&rft.aulast=Moore&rft.aufirst=&rft.atitle=Binaural+mask-informed+speech+enhancement+for+hearing+AIDS+with+head+tracking&rft.title=16th+International+Workshop+on+Acoustic+Signal+Enhancement,+IWAENC+2018+-+Proceedings; http://www.scopus.com/record/display.url?eid=2-s2.0-85057404247&origin=inward; http://gateway.isiknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=LinksAMR&SrcApp=PARTNER_APP&DestLinkType=FullRecord&DestApp=WOS&KeyUT=000458323900093
الاتاحة: https://repository.hkust.edu.hk/ir/Record/1783.1-125451
https://doi.org/10.1109/IWAENC.2018.8521361
http://lbdiscover.ust.hk/uresolver?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rfr_id=info:sid/HKUST:SPI&rft.genre=article&rft.issn=&rft.volume=&rft.issue=&rft.date=2018&rft.spage=461&rft.aulast=Moore&rft.aufirst=&rft.atitle=Binaural+mask-informed+speech+enhancement+for+hearing+AIDS+with+head+tracking&rft.title=16th+International+Workshop+on+Acoustic+Signal+Enhancement,+IWAENC+2018+-+Proceedings
http://www.scopus.com/record/display.url?eid=2-s2.0-85057404247&origin=inward
http://gateway.isiknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=LinksAMR&SrcApp=PARTNER_APP&DestLinkType=FullRecord&DestApp=WOS&KeyUT=000458323900093 -
14Academic Journal
المساهمون: The Pennsylvania State University CiteSeerX Archives
مصطلحات موضوعية: Speech segregation, Computational Auditory Scene Analysis, Binary time–frequency mask, Robust speech recognition, Un- 23 certainty decoding 24
وصف الملف: application/pdf
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.422.2957; http://www.cse.ohio-state.edu/~dwang/papers/SSJW.csl08.pdf
-
15Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
المؤلفون: Giovanni Morrone, Sonia Bergamaschi, Vadim Tikhanoff, Luciano Fadiga, Leonardo Badino, Luca Pasa
المصدر: ICASSP
مصطلحات موضوعية: Masking (art), FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Computer science, Speech recognition, Socio-culturale, 02 engineering and technology, Cocktail party effect, Computer Science - Sound, face land-marks, Machine Learning (cs.LG), 030507 speech-language pathology & audiology, 03 medical and health sciences, LS5_1, Audio and Speech Processing (eess.AS), 0202 electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, Face Landmarks, time-frequency mask, Computer Science - Computation and Language, Landmark, Noise measurement, Audio-Visual Speech Enhancement, 020206 networking & telecommunications, audio-visual speech enhancement, Speech enhancement, cocktail party problem, LSTM, Time-Frequency Mask, Face (geometry), Audio-Visual Speech Enhancement, Cocktail Party Problem, Time-Frequency Mask, LSTM, Face Landmarks, Spectrogram, 0305 other medical science, Cocktail Party Problem, Computation and Language (cs.CL), Electrical Engineering and Systems Science - Audio and Speech Processing
-
16
المؤلفون: Leo Lightburn, Mike Brookes, Alastair H. Moore, Wei Xue, Patrick A. Naylor
المساهمون: Engineering & Physical Science Research Council (EPSRC)
المصدر: International Workshop on Acoustic Signal Enhancement (IWAENC 2018)
IWAENCمصطلحات موضوعية: Beamforming, Technology, Assisted listening, Computer science, Speech recognition, Speech enhancement, 02 engineering and technology, Intelligibility (communication), 030507 speech-language pathology & audiology, 03 medical and health sciences, Automation & Control Systems, Engineering, 0202 electrical engineering, electronic engineering, information engineering, Head rotation, Minimum mean square error, Science & Technology, Artificial neural network, 020206 networking & telecommunications, Engineering, Electrical & Electronic, Time–frequency analysis, A priori and a posteriori, INTELLIGIBILITY, 0305 other medical science, Binaural recording, Time-frequency mask
-
17Academic JournalSparse source separation based on simultaneous clustering of source locational and spectral features
المؤلفون: Hiroshi Sawada, Shoko Araki, Tomohiro Nakatani
المصدر: Acoustical Science and Technology. 2011, 32(4):161
-
18Conference
المؤلفون: Zou, Y. X., Wang, Y. Q., Wang, Peng, Ritz, C. H., Xi, Jiangtao
المساهمون: Zou, YX (reprint author), Peking Univ, Sch Elect Comp Engn, ADSPLAB ELIP, Shenzhen, Peoples R China., Peking Univ, Sch Elect Comp Engn, ADSPLAB ELIP, Shenzhen, Peoples R China., Univ Wollongong, Sch Elect Comp & Telecom Engn, Wollongong, NSW, Australia.
المصدر: SCI
مصطلحات موضوعية: Speech enhancement, time-frequency mask, acoustic vector sensor, Wiener post-filter, power spectral density, PERFORMANCE, NOISE
Relation: 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP).Hong Kong, PEOPLES R CHINA,2014/1/1.; 1331873; http://hdl.handle.net/20.500.11897/424331; WOS:000361019500102
-
19Academic Journal
المؤلفون: Shao,Yang, Srinivasan,Soundararajan, Jin,Zhaozhang, Wang,DeLiang
المساهمون: Ohio State University Columbus United States
-
20Academic Journal
المؤلفون: Srinivasan,Soundararajan, Wang,DeLiang
المساهمون: Ohio State University Columbus United States
مصطلحات موضوعية: INTELLIGIBILITY, SPEECH RECOGNITION, REGRESSION ANALYSIS, NOISY SPEECH SIGNALS, BINARY MASKS, SPEECH INTELLIGIBILITY, UNCERTAINTY DECODING, CASA(COMPUTATIONAL AUDITORY SCENE ANALYSIS), BINARY T-F MASK(BINARY TIME FREQUENCY MASK)
وصف الملف: text/html