نتائج البحث - "Time-Frequency Mask" :: Library Catalog

تحديد النتيجة رقم 1
1

Academic Journal

Joint Deep Neural Network for Single-Channel Speech Separation on Masking-Based Training Targets

المؤلفون: Peng Chen, Binh Thien Nguyen, Yuting Geng, Kenta Iwai, Takanobu Nishiura

المصدر: IEEE Access, Vol 12, Pp 152036-152044 (2024)

مصطلحات موضوعية: Single-channel speech separation, time-frequency mask, deep neural network, joint network, ideal binary mask, ideal ratio mask, Electrical engineering. Electronics. Nuclear engineering, TK1-9971

وصف الملف: electronic resource

Relation: https://ieeexplore.ieee.org/document/10716362/; https://doaj.org/toc/2169-3536

URL الوصول: https://doaj.org/article/396c8c1864854165a624388e4c742608

View record in DOAJ Full Text Finder

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 2
2

Academic Journal

Threshold-Based Combination of Ideal Binary Mask and Ideal Ratio Mask for Single-Channel Speech Separation

المؤلفون: Peng Chen, Binh Thien Nguyen, Kenta Iwai, Takanobu Nishiura

المصدر: Information, Vol 15, Iss 10, p 608 (2024)

مصطلحات موضوعية: single-channel speech separation, deep neural network, ideal binary mask, ideal ratio mask, combination of time–frequency mask, Information technology, T58.5-58.64

وصف الملف: electronic resource

Relation: https://www.mdpi.com/2078-2489/15/10/608; https://doaj.org/toc/2078-2489

URL الوصول: https://doaj.org/article/d669799ce8d34b1badb77dffb22ac884

View record in DOAJ Full Text Finder

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 3
3

Academic Journal

Speaker-Independent Audio-Visual Speech Separation Based on Transformer in Multi-Talker Environments

المؤلفون: Jing WANG, Weiming YI, Xiang XIE, Yiyu LUO

المصدر: IEICE Transactions on Information and Systems. 2022, E105.D(4):766

View record in JSTAGE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 4
4

Academic Journal

Low-computational DNN-based speech enhancement for hearing aids / 補聴器応用のためのDNN音声強調の低演算量化の検討

المؤلفون: Chiho Haruta, Nobutaka Ono, 小野順貴, 春田智穂

المصدر: 日本音響学会誌 / THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN. 2022, 78(5):227

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 5
5

Academic Journal

Evaluation of a Speech Enhancement Method Combining Ensemble Time-Frequency Masking and Beamforming / アンサンブル時間周波数マスクとビームフォーミングを組み合わせた音声強調手法の評価

المؤلفون: Katsutoshi Itoyama, Kazuhiro Nakadai, Kenji Nishida, Masahiko Fujita, 中臺一博, 糸山克寿, 藤田雅彦, 西田健次

المصدر: 日本ロボット学会誌 / Journal of the Robotics Society of Japan. 2022, 40(7):631

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 6
6

Conference

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

المؤلفون: Giovanni Morrone, Luca Pasa, Vadim Tikhanoff, Sonia Bergamaschi, Luciano Fadiga, Leonardo Badino

المساهمون: Morrone, Giovanni, Pasa, Luca, Tikhanoff, Vadim, Bergamaschi, Sonia, Fadiga, Luciano, Badino, Leonardo

مصطلحات موضوعية: Audio-Visual Speech Enhancement, Cocktail Party Problem, Time-Frequency Mask, LSTM, Face Landmarks

Relation: info:eu-repo/semantics/altIdentifier/wos/WOS:000482554007027; ispartofbook:Proceedings of the 44th IEEE International Conference on Acoustics, Speech and Signal Processing; 44th IEEE International Conference on Acoustics, Speech and Signal Processing; serie:PROCEEDINGS OF THE . IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING; http://hdl.handle.net/11380/1170465; info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85068987044; https://ieeexplore.ieee.org/document/8682061; https://dr-pato.github.io/audio_visual_speech_enhancement/

الاتاحة: http://hdl.handle.net/11380/1170465
https://doi.org/10.1109/ICASSP.2019.8682061
https://ieeexplore.ieee.org/document/8682061
https://dr-pato.github.io/audio_visual_speech_enhancement/

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 7
7

Academic Journal

Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation

المؤلفون: Xuliang Li, Zhaogui Ding, Weifeng Li, Qingmin Liao

المصدر: Sensors; Volume 17; Issue 6; Pages: 1447

مصطلحات موضوعية: delay-and-sum beamforming, binary time-frequency mask, cosine function

وصف الملف: application/pdf

Relation: Physical Sensors; https://dx.doi.org/10.3390/s17061447

الاتاحة: https://doi.org/10.3390/s17061447

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 8
8

Academic Journal

Blind extraction of underdetermined mixtures via time-frequency mask

المؤلفون: Ming XIAO, Feng GAO, Gong-xian SUN, Sheng-li XIE

المصدر: Tongxin xuebao, Vol 33, Pp 77-84 (2012)

مصطلحات موضوعية: underdetermined blind source separation, sparse component analysis, time-frequency mask, blind source extraction, Telecommunication, TK5101-6720

وصف الملف: electronic resource

Relation: https://doaj.org/toc/1000-436X

URL الوصول: https://doaj.org/article/060e072057fa4cefb63eebe9ade8741f

View record in DOAJ

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 9
9

Academic Journal

SPEECH COMMUNICATION AND INTELLIGIBILITY ENHANCEMENT BY MACHINE LEARNING ALGORITHM

المؤلفون: Shireesha, K., RajaShekar, T.

المصدر: International Journal of Innovative Technology and Research; Vol 4, No 5 (2016): August - September 2016; 3692-3695

مصطلحات موضوعية: ECE, Ensemble Learning, Speech Enhancement, Time-Frequency Mask, Intelligibility, Speech

وصف الملف: application/pdf

Relation: http://www.ijitr.com/index.php/ojs/article/view/1044/pdf; http://www.ijitr.com/index.php/ojs/article/view/1044

الاتاحة: http://www.ijitr.com/index.php/ojs/article/view/1044

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 10
10

DNN-Based Source Enhancement to Increase Objective Sound Quality Assessment Score

المؤلفون: Yoichi Haneda, Kenta Niwa, Kobayashi Kazunori, Yusuke Hioka, Yuma Koizumi

المصدر: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26:1780-1792

مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Machine Learning, Acoustics and Ultrasonics, Mean squared error, Computer science, Machine Learning (stat.ML), Probability density function, 02 engineering and technology, Computer Science - Sound, Machine Learning (cs.LG), 030507 speech-language pathology & audiology, 03 medical and health sciences, Audio and Speech Processing (eess.AS), Statistics - Machine Learning, FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Computer Science (miscellaneous), Electrical and Electronic Engineering, Sound quality, time-frequency mask, Sound-source enhancement, Artificial neural network, business.industry, deep learning, 020206 networking & telecommunications, Pattern recognition, Speech processing, Backpropagation, Time–frequency analysis, objective sound quality assessment (OSQA) score, Computational Mathematics, Artificial intelligence, 0305 other medical science, business, Gradient method, Electrical Engineering and Systems Science - Audio and Speech Processing

URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7506ad77b4cd4f763e87d949a04536ea
https://doi.org/10.1109/taslp.2018.2842156

View record in OpenAIRE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 11
11

Academic Journal

A computational auditory scene analysis system fro speech . . .

المؤلفون: Yang Shao, Soundararajan Srinivasan, Zhaozhang Jin, DeLiang Wang

المساهمون: The Pennsylvania State University CiteSeerX Archives

المصدر: http://www.cse.ohio-state.edu/~dwang/papers/SSJW.csl10.pdf.

مصطلحات موضوعية: Speech segregation, Computational Auditory Scene Analysis, Binary time–frequency mask, Robust speech recognition, Uncertainty

وصف الملف: application/pdf

Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.151.4132; http://www.cse.ohio-state.edu/~dwang/papers/SSJW.csl10.pdf

الاتاحة: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.151.4132
http://www.cse.ohio-state.edu/~dwang/papers/SSJW.csl10.pdf

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 12
12

Academic Journal

Sequential organization of speech in computational . . .

المؤلفون: Yang Shao, DeLiang Wang

المساهمون: The Pennsylvania State University CiteSeerX Archives

المصدر: http://www.cse.ohio-state.edu/~dwang/papers/Shao-Wang.spcomm09.pdf.

مصطلحات موضوعية: Sequential organization, Computational auditory scene analysis, Speaker quantization, Binary time–frequency mask

وصف الملف: application/pdf

Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.151.5146; http://www.cse.ohio-state.edu/~dwang/papers/Shao-Wang.spcomm09.pdf

الاتاحة: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.151.5146
http://www.cse.ohio-state.edu/~dwang/papers/Shao-Wang.spcomm09.pdf

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 13
13

Conference

Binaural mask-informed speech enhancement for hearing AIDS with head tracking

المؤلفون: Moore, Alastair H., Lightburn, Leo, Xue, Wei, Naylor, Patrick A., Brookes, Mike

مصطلحات موضوعية: Assisted listening, Beamforming, Head rotation, Speech enhancement, Time frequency mask

Relation: https://repository.hkust.edu.hk/ir/Record/1783.1-125451; 16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Proceedings / IEEE. Piscataway, NJ : IEEE, 2018, p. 461-465, article number 8521361; https://doi.org/10.1109/IWAENC.2018.8521361; http://lbdiscover.ust.hk/uresolver?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rfr_id=info:sid/HKUST:SPI&rft.genre=article&rft.issn=&rft.volume=&rft.issue=&rft.date=2018&rft.spage=461&rft.aulast=Moore&rft.aufirst=&rft.atitle=Binaural+mask-informed+speech+enhancement+for+hearing+AIDS+with+head+tracking&rft.title=16th+International+Workshop+on+Acoustic+Signal+Enhancement,+IWAENC+2018+-+Proceedings; http://www.scopus.com/record/display.url?eid=2-s2.0-85057404247&origin=inward; http://gateway.isiknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=LinksAMR&SrcApp=PARTNER_APP&DestLinkType=FullRecord&DestApp=WOS&KeyUT=000458323900093

الاتاحة: https://repository.hkust.edu.hk/ir/Record/1783.1-125451
https://doi.org/10.1109/IWAENC.2018.8521361
http://lbdiscover.ust.hk/uresolver?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rfr_id=info:sid/HKUST:SPI&rft.genre=article&rft.issn=&rft.volume=&rft.issue=&rft.date=2018&rft.spage=461&rft.aulast=Moore&rft.aufirst=&rft.atitle=Binaural+mask-informed+speech+enhancement+for+hearing+AIDS+with+head+tracking&rft.title=16th+International+Workshop+on+Acoustic+Signal+Enhancement,+IWAENC+2018+-+Proceedings
http://www.scopus.com/record/display.url?eid=2-s2.0-85057404247&origin=inward
http://gateway.isiknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=LinksAMR&SrcApp=PARTNER_APP&DestLinkType=FullRecord&DestApp=WOS&KeyUT=000458323900093

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 14
14

Academic Journal

ApJ, in press

المؤلفون: Yang Shao A, Soundararajan Srinivasan B, Zhaozhang Jin A, Deliang Wang A

المساهمون: The Pennsylvania State University CiteSeerX Archives

المصدر: http://www.cse.ohio-state.edu/~dwang/papers/SSJW.csl08.pdf.

مصطلحات موضوعية: Speech segregation, Computational Auditory Scene Analysis, Binary time–frequency mask, Robust speech recognition, Un- 23 certainty decoding 24

وصف الملف: application/pdf

Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.422.2957; http://www.cse.ohio-state.edu/~dwang/papers/SSJW.csl08.pdf

الاتاحة: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.422.2957
http://www.cse.ohio-state.edu/~dwang/papers/SSJW.csl08.pdf

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 15
15

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

المؤلفون: Giovanni Morrone, Sonia Bergamaschi, Vadim Tikhanoff, Luciano Fadiga, Leonardo Badino, Luca Pasa

المصدر: ICASSP

مصطلحات موضوعية: Masking (art), FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Computer science, Speech recognition, Socio-culturale, 02 engineering and technology, Cocktail party effect, Computer Science - Sound, face land-marks, Machine Learning (cs.LG), 030507 speech-language pathology & audiology, 03 medical and health sciences, LS5_1, Audio and Speech Processing (eess.AS), 0202 electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, Face Landmarks, time-frequency mask, Computer Science - Computation and Language, Landmark, Noise measurement, Audio-Visual Speech Enhancement, 020206 networking & telecommunications, audio-visual speech enhancement, Speech enhancement, cocktail party problem, LSTM, Time-Frequency Mask, Face (geometry), Audio-Visual Speech Enhancement, Cocktail Party Problem, Time-Frequency Mask, LSTM, Face Landmarks, Spectrogram, 0305 other medical science, Cocktail Party Problem, Computation and Language (cs.CL), Electrical Engineering and Systems Science - Audio and Speech Processing

URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::71cae94c87fffe2083c283d9c3920f81

View record in OpenAIRE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 16
16

Binaural mask-informed speech enhancement for hearing aids with head tracking

المؤلفون: Leo Lightburn, Mike Brookes, Alastair H. Moore, Wei Xue, Patrick A. Naylor

المساهمون: Engineering & Physical Science Research Council (EPSRC)

المصدر: International Workshop on Acoustic Signal Enhancement (IWAENC 2018)
IWAENC

مصطلحات موضوعية: Beamforming, Technology, Assisted listening, Computer science, Speech recognition, Speech enhancement, 02 engineering and technology, Intelligibility (communication), 030507 speech-language pathology & audiology, 03 medical and health sciences, Automation & Control Systems, Engineering, 0202 electrical engineering, electronic engineering, information engineering, Head rotation, Minimum mean square error, Science & Technology, Artificial neural network, 020206 networking & telecommunications, Engineering, Electrical & Electronic, Time–frequency analysis, A priori and a posteriori, INTELLIGIBILITY, 0305 other medical science, Binaural recording, Time-frequency mask

URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1ae7bf41183e7b970046d5a3d00b07f2
http://hdl.handle.net/10044/1/62651

View record in OpenAIRE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 17
17

Academic Journal

Sparse source separation based on simultaneous clustering of source locational and spectral features

المؤلفون: Hiroshi Sawada, Shoko Araki, Tomohiro Nakatani

المصدر: Acoustical Science and Technology. 2011, 32(4):161

View record in JSTAGE Full Text Finder

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 18
18

Conference

An Effective Target Speech Enhancement with Single Acoustic Vector Sensor Based on the Speech Time-Frequency Sparsity

المؤلفون: Zou, Y. X., Wang, Y. Q., Wang, Peng, Ritz, C. H., Xi, Jiangtao

المساهمون: Zou, YX (reprint author), Peking Univ, Sch Elect Comp Engn, ADSPLAB ELIP, Shenzhen, Peoples R China., Peking Univ, Sch Elect Comp Engn, ADSPLAB ELIP, Shenzhen, Peoples R China., Univ Wollongong, Sch Elect Comp & Telecom Engn, Wollongong, NSW, Australia.

المصدر: SCI

مصطلحات موضوعية: Speech enhancement, time-frequency mask, acoustic vector sensor, Wiener post-filter, power spectral density, PERFORMANCE, NOISE

Relation: 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP).Hong Kong, PEOPLES R CHINA,2014/1/1.; 1331873; http://hdl.handle.net/20.500.11897/424331; WOS:000361019500102

الاتاحة: https://hdl.handle.net/20.500.11897/424331

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 19
19

Academic Journal

A Computational Auditory Scene Analysis System for Speech Segregation and Robust Speech Recognition

المؤلفون: Shao,Yang, Srinivasan,Soundararajan, Jin,Zhaozhang, Wang,DeLiang

المساهمون: Ohio State University Columbus United States

مصطلحات موضوعية: VOICE COMMUNICATIONS, speech recognition, auditory signals, speech analysis, speech, noise, SPEECH SEGREGATION, CASA(COMPUTATIONAL AUDITORY SCENE ANALYSIS), BINARY TIME-FREQUENCY MASK, ROBUST SPEECH RECOGNITION, UNCERTAINTY DECODING FRAMEWORKS, lang, psy

Relation: http://www.dtic.mil/docs/citations/AD1001212

الاتاحة: http://www.dtic.mil/docs/citations/AD1001212

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في:
تحديد النتيجة رقم 20
20

Academic Journal

Transforming Binary Uncertainties for Robust Speech Recognition

المؤلفون: Srinivasan,Soundararajan, Wang,DeLiang

المساهمون: Ohio State University Columbus United States

مصطلحات موضوعية: INTELLIGIBILITY, SPEECH RECOGNITION, REGRESSION ANALYSIS, NOISY SPEECH SIGNALS, BINARY MASKS, SPEECH INTELLIGIBILITY, UNCERTAINTY DECODING, CASA(COMPUTATIONAL AUDITORY SCENE ANALYSIS), BINARY T-F MASK(BINARY TIME FREQUENCY MASK)

وصف الملف: text/html

Relation: http://www.dtic.mil/docs/citations/AD1001223

الاتاحة: http://www.dtic.mil/docs/citations/AD1001223
http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=AD1001223

View record in BASE

qrcode_show

أضف إلى سلة الكتب حذف من سلة الكتب
أضف إلى المفضلة

محفوظ في: