-
1Academic Journal
المؤلفون: Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda
المصدر: IEEE Access, Vol 9, Pp 137599-137612 (2021)
مصطلحات موضوعية: Generative adversarial network, neural vocoder, signal processing, singing voice synthesis, waveform generative model, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
2Academic Journal
المؤلفون: Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Shinji Takaki, Junichi Yamagishi
المصدر: IEEE Access, Vol 8, Pp 138149-138161 (2020)
مصطلحات موضوعية: Context, entertainment, global style tokens, rakugo, self-attention, speech synthesis, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
3Academic Journal
المؤلفون: Yi Zhao, Shinji Takaki, Hieu-Thi Luong, Junichi Yamagishi, Daisuke Saito, Nobuaki Minematsu
المصدر: IEEE Access, Vol 6, Pp 60478-60488 (2018)
مصطلحات موضوعية: Generative adversarial network, multi-speaker modeling, speech synthesis, WaveNet, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
4Academic Journal
المؤلفون: Shinji Takaki, 高木 信二
المصدر: 日本音響学会誌 / THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN. 2019, 75(7):393
-
5Academic Journal
المؤلفون: Kei Hashimoto, Shinji Takaki, 橋本 佳, 高木 信二
المصدر: 日本音響学会誌 / THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN. 2017, 73(1):55
-
6Academic Journal
المؤلفون: Junichi YAMAGISHI, Shinji TAKAKI, Xin WANG
المصدر: IEICE Transactions on Information and Systems. 2016, E99.D(10):2471
-
7
المؤلفون: Kei Hashimoto, Shinji Takaki, Yukiya Hono, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda
المصدر: IEEE Access, Vol 9, Pp 137599-137612 (2021)
مصطلحات موضوعية: General Computer Science, Computer science, Autocorrelation, waveform generative model, General Engineering, Signal, singing voice synthesis, TK1-9971, Generative model, symbols.namesake, Sine wave, Autoregressive model, Aperiodic graph, Gaussian noise, Computer Science::Sound, neural vocoder, symbols, Waveform, General Materials Science, Electrical engineering. Electronics. Nuclear engineering, Generative adversarial network, signal processing, Algorithm
-
8Dissertation/ Thesis
-
9
المؤلفون: Keiichiro Oura, Kei Hashimoto, Yukiya Hono, Yoshihiko Nankaku, Keiichi Tokuda, Shinji Takaki
المصدر: ICASSP
مصطلحات موضوعية: Signal Processing (eess.SP), FOS: Computer and information sciences, Signal processing, Computer Science - Machine Learning, Sound (cs.SD), Artificial neural network, Series (mathematics), Computer science, Computer Science - Sound, Machine Learning (cs.LG), Naturalness, Autoregressive model, Aperiodic graph, Audio and Speech Processing (eess.AS), Feature (machine learning), FOS: Electrical engineering, electronic engineering, information engineering, Waveform, Electrical Engineering and Systems Science - Signal Processing, Algorithm, Electrical Engineering and Systems Science - Audio and Speech Processing
-
10
المؤلفون: Shinji Takaki, Junichi Yamagishi, Toru Nakashika
المصدر: Nakashika, T, Takaki, S & Yamagishi, J 2019, ' Complex-Valued Restricted Boltzmann Machine for Speaker-Dependent Speech Parameterization from Complex Spectra ', IEEE/ACM Transactions on Audio, Speech and Language Processing, pp. 244-254 . https://doi.org/10.1109/TASLP.2018.2877465
مصطلحات موضوعية: Restricted Boltzmann machine, Signal processing, Acoustics and Ultrasonics, Computer science, business.industry, Feature extraction, Pattern recognition, 030507 speech-language pathology & audiology, 03 medical and health sciences, Computational Mathematics, symbols.namesake, ComputingMethodologies_PATTERNRECOGNITION, Computer Science (miscellaneous), symbols, Feature (machine learning), Artificial intelligence, Mel-frequency cepstrum, Electrical and Electronic Engineering, 0305 other medical science, business, Representation (mathematics), Energy (signal processing), Gibbs sampling
وصف الملف: application/pdf
-
11
المؤلفون: Keiichiro Oura, Yoshihiko Nankaku, Kei Hashimoto, Keiichi Tokuda, Shinji Takaki, Takato Fujimoto
المصدر: ICASSP
مصطلحات موضوعية: Training set, Computer science, Character (computing), Speech recognition, 05 social sciences, Speech synthesis, Semi-supervised learning, 010501 environmental sciences, Pronunciation, computer.software_genre, 01 natural sciences, ComputingMethodologies_PATTERNRECOGNITION, 0502 economics and business, Active listening, 050207 economics, Sound quality, computer, Generative grammar, 0105 earth and related environmental sciences
-
12
المؤلفون: Yuta Ochiai, Shinji Takaki, Junichi Yamagishi, Gustav Eje Henter, Jaime Lorenzo-Trueba, Yosuke Morino
المصدر: Lorenzo-Trueba, J, Eje Henter, G, Takaki, S, Yamagishi, J, Morino, Y & Ochiai, Y 2018, ' Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis ', Speech Communication, vol. 99, pp. 135-143 . https://doi.org/10.1016/j.specom.2018.03.002
مصطلحات موضوعية: Linguistics and Language, Speech recognition, media_common.quotation_subject, Speech synthesis, 02 engineering and technology, computer.software_genre, Language and Linguistics, Task (project management), 030507 speech-language pathology & audiology, 03 medical and health sciences, ComputerApplications_MISCELLANEOUS, Perception, 0202 electrical engineering, electronic engineering, information engineering, Emotion recognition, GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries), media_common, Class (computer programming), Communication, Speech quality, Computer Science Applications, Modeling and Simulation, 020201 artificial intelligence & image processing, Computer Vision and Pattern Recognition, 0305 other medical science, Psychology, computer, Software
وصف الملف: application/pdf
-
13
المؤلفون: Xin Wang, Junichi Yamagishi, Shinji Takaki
المصدر: Wang, X, Takaki, S & Yamagishi, J 2016, Investigating Very Deep Highway Networks for Parametric Speech Synthesis . in 9th ISCA Speech Synthesis Workshop . pp. 166-171, 9th ISCA Speech Synthesis Workshop, Sunnyvale, United States, 13/09/16 . https://doi.org/10.21437/SSW.2016-27
Wang, X, Takaki, S & Yamagishi, J 2017, ' Investigating very deep highway networks for parametric speech synthesis ', Speech Communication, vol. 96, pp. 1-9 . https://doi.org/10.1016/j.specom.2017.11.002
SSWمصطلحات موضوعية: Engineering, Linguistics and Language, Computer science, Speech recognition, Speech synthesis, 02 engineering and technology, computer.software_genre, Machine learning, Language and Linguistics, 030507 speech-language pathology & audiology, 03 medical and health sciences, Deep belief network, ComputerApplications_MISCELLANEOUS, 0202 electrical engineering, electronic engineering, information engineering, Sensitivity (control systems), Parametric statistics, Basis (linear algebra), Artificial neural network, Contextual image classification, business.industry, Time delay neural network, Communication, Feed forward, 020206 networking & telecommunications, Pattern recognition, Computer Science Applications, Modeling and Simulation, Trajectory, Computer Vision and Pattern Recognition, Artificial intelligence, 0305 other medical science, business, computer, Software
وصف الملف: application/pdf
-
14
المؤلفون: Shinji Takaki, Yi Zhao, Junichi Yamagishi, Hieu-Thi Luong, Daisuke Saito, Nobuaki Minematsu
المصدر: IEEE Access, Vol 6, Pp 60478-60488 (2018)
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Similarity (geometry), General Computer Science, Computer science, Speech recognition, Speech synthesis, Machine Learning (stat.ML), 02 engineering and technology, computer.software_genre, Computer Science - Sound, 030507 speech-language pathology & audiology, 03 medical and health sciences, Quality (physics), speech synthesis, Audio and Speech Processing (eess.AS), Statistics - Machine Learning, 0202 electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, Waveform, General Materials Science, WaveNet, Computer Science - Computation and Language, General Engineering, Acoustic model, multi-speaker modeling, 020201 artificial intelligence & image processing, lcsh:Electrical engineering. Electronics. Nuclear engineering, 0305 other medical science, Generative adversarial network, computer, Computation and Language (cs.CL), lcsh:TK1-9971, Generator (mathematics), Electrical Engineering and Systems Science - Audio and Speech Processing
-
15
المؤلفون: Junichi Yamagishi, Yusuke Yasuda, Xin Wang, Shinji Takaki, Shuhei Kato, Erica Cooper
المصدر: 10th ISCA Workshop on Speech Synthesis (SSW 10).
مصطلحات موضوعية: Communication, business.industry, Computer science, Speech synthesis, Transduction (psychology), computer.software_genre, business, computer, Style (sociolinguistics)
-
16
المؤلفون: Junichi Yamagishi, Satoshi Kobashikawa, Shinji Takaki, Yi Zhao, Ando Atsushi
المصدر: INTERSPEECH
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Adverse conditions, Computer science, Speech recognition, Emotional communication, Intelligibility (communication), Lombard effect, Computer Science - Sound, Noise, Audio and Speech Processing (eess.AS), QUIET, FOS: Electrical engineering, electronic engineering, information engineering, medicine, Affect (linguistics), medicine.symptom, Electrical Engineering and Systems Science - Audio and Speech Processing, Confusion
-
17
المؤلفون: Junichi Yamagishi, Toru Nakashika, Xin Wang, Shinji Takaki
المصدر: ICASSP
Takaki, S, Nakashika, T, Wang, X & Yamagishi, J 2019, STFT Spectral Loss for Training a Neural Speech Waveform Model . in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . Institute of Electrical and Electronics Engineers (IEEE), Brighton, United Kingdom, pp. 7065-7069, 44th International Conference on Acoustics, Speech, and Signal Processing, Brighton, United Kingdom, 12/05/19 . https://doi.org/10.1109/ICASSP.2019.8683791مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer science, Gaussian, Speech recognition, Phase (waves), Machine Learning (stat.ML), Speech synthesis, computer.software_genre, Computer Science - Sound, Spectral line, neural waveform modeling, symbols.namesake, speech synthesis, Statistics - Machine Learning, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Waveform, WaveNet, Computer Science - Computation and Language, Basis (linear algebra), Short-time Fourier transform, ComputingMethodologies_PATTERNRECOGNITION, Amplitude, Fourier transform, Computer Science::Sound, symbols, Computation and Language (cs.CL), computer, Electrical Engineering and Systems Science - Audio and Speech Processing
وصف الملف: application/pdf
-
18
المؤلفون: Junichi Yamagishi, Yusuke Yasuda, Shinji Takaki, Xin Wang
المصدر: ICASSP
Yasuda, Y, Wang, X, Takaki, S & Yamagishi, J 2019, Investigation of Enhanced Tacotron Text-to-speech Synthesis Systems with Self-attention for Pitch Accent Language . in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . Institute of Electrical and Electronics Engineers (IEEE), Brighton, United Kingdom, pp. 6905-6909, 44th International Conference on Acoustics, Speech, and Signal Processing, Brighton, United Kingdom, 12/05/19 . https://doi.org/10.1109/ICASSP.2019.8682353مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer science, Speech recognition, media_common.quotation_subject, Machine Learning (stat.ML), Speech synthesis, 02 engineering and technology, computer.software_genre, Tacotron, Computer Science - Sound, 030507 speech-language pathology & audiology, 03 medical and health sciences, speech synthesis, Statistics - Machine Learning, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Quality (business), Sound quality, media_common, Computer Science - Computation and Language, Pitch accent, business.industry, Character (computing), Deep learning, deep learning, 020206 networking & telecommunications, Pipeline (software), Artificial intelligence, 0305 other medical science, business, Computation and Language (cs.CL), computer, Electrical Engineering and Systems Science - Audio and Speech Processing
وصف الملف: application/pdf
-
19
المؤلفون: Junichi Yamagishi, Shinji Takaki, Xin Wang
المصدر: Wang, X, Takaki, S & Yamagishi, J 2020, ' Neural Source-Filter Waveform Models for Statistical Parametric Speech Synthesis ', IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 28, pp. 402-415 . https://doi.org/10.1109/TASLP.2019.2956145
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Acoustics and Ultrasonics, Computer science, neural network, Speech synthesis, Machine Learning (stat.ML), computer.software_genre, Computer Science - Sound, Audio and Speech Processing (eess.AS), Statistics - Machine Learning, Computer Science (miscellaneous), FOS: Electrical engineering, electronic engineering, information engineering, Waveform, Electrical and Electronic Engineering, Parametric statistics, Artificial neural network, business.industry, Deep learning, Short-time Fourier transform, Speech corpus, Filter (signal processing), Computational Mathematics, Computer Science::Sound, short-time Fourier transform, waveform model, Artificial intelligence, business, Algorithm, computer, Electrical Engineering and Systems Science - Audio and Speech Processing
وصف الملف: application/pdf
-
20
المؤلفون: Junichi Yamagishi, Shinji Takaki, Yoshikazu Nishimura
المصدر: Takaki, S, Nishimura, Y & Yamagishi, J 2019, Unsupervised Speaker Adaptation for DNN-based Speech Synthesis using Input Codes . in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 . Institute of Electrical and Electronics Engineers (IEEE), Honolulu, Hawaii, USA, pp. 649-658, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018, Honolulu, United States, 12/11/18 . https://doi.org/10.23919/APSIPA.2018.8659621
APSIPAمصطلحات موضوعية: Basis (linear algebra), Artificial neural network, Computer science, Speech recognition, Orthographic projection, Posterior probability, 020206 networking & telecommunications, Speech synthesis, 02 engineering and technology, Construct (python library), computer.software_genre, Data modeling, 030507 speech-language pathology & audiology, 03 medical and health sciences, 0202 electrical engineering, electronic engineering, information engineering, Code (cryptography), 0305 other medical science, computer
وصف الملف: application/pdf