PUBLICATIONS - 2019
Journal
- Xin Wang, Shinji Takaki, and Junichi Yamagishi, ``Neural source-filter waveform models for statistical parametric speech synthesis,'' IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 402-415, November 2019. (Full paper peer reviewed)
link
- Xin Wang, Shinji Takaki, Junichi Yamagishi, Simon King, and Keiichi Tokuda, ``A vector quantized variational autoencoder (VQ-VAE) autoregressive neural F0 model for statistical parametric speech synthesis,'' IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 157-170, October 2019. (Full paper peer reviewed)
link
- Shinji Takaki, ``Applied technology for speech synthesis : DNN-based text-to-speech synthesis,'' The journal of the acoustical society of Japan, vol. 75, no. 7, pp. 393-399, July 2019. (Review paper)
link
International Conference
- Motoki Shimada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Low computational cost speech synthesis based on deep neural networks using hidden semi-Markov model structures,'' 10th ISCA Speech Synthesis Workshop (SSW10), pp. 177-182, Vienne, Austria, September, 2019. (Full paper peer reviewed)
- Takato Fujimoto, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis,'' 10th ISCA Speech Synthesis Workshop (SSW10), pp. 166-171, Vienne, Austria, September, 2019. (Full paper peer reviewed)
- Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Shinji Takaki, and Junichi Yamagishi, ``Rakugo speech synthesis using segment-to-segment neural transduction and style tokens — toward speech synthesis for entertaining audiences,'' 10th ISCA Speech Synthesis Workshop (SSW10), pp. 111-116, Vienne, Austria, September, 2019. (Full paper peer reviewed)
- Keiichiro Oura, Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, ``Deep neural network based real-time speech vocoder with periodic and aperiodic inputs,'' 10th ISCA Speech Synthesis Workshop (SSW10), pp. 13-18, Vienne, Austria, September, 2019. (Full paper peer reviewed)
- Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Speaker-dependent WaveNet-based delay-free ADPCM speech coding,'' 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7145-7149, Brighton, UK, May, 2019. (Full paper peer reviewed)
- Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Singing voice synthesis based on generative adversarial networks,'' 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 6955-6959, Brighton, UK, May, 2019. (Full paper peer reviewed)
Past Publications