PUBLICATIONS - 2019

Journal

  1. Xin Wang, Shinji Takaki, and Junichi Yamagishi, ``Neural source-filter waveform models for statistical parametric speech synthesis,'' IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 402-415, November 2019. (Full paper peer reviewed) link
  2. Xin Wang, Shinji Takaki, Junichi Yamagishi, Simon King, and Keiichi Tokuda, ``A vector quantized variational autoencoder (VQ-VAE) autoregressive neural F0 model for statistical parametric speech synthesis,'' IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 157-170, October 2019. (Full paper peer reviewed) link
  3. Shinji Takaki, ``Applied technology for speech synthesis : DNN-based text-to-speech synthesis,'' The journal of the acoustical society of Japan, vol. 75, no. 7, pp. 393-399, July 2019. (Review paper) link

International Conference

  1. Motoki Shimada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Low computational cost speech synthesis based on deep neural networks using hidden semi-Markov model structures,'' 10th ISCA Speech Synthesis Workshop (SSW10), pp. 177-182, Vienne, Austria, September, 2019. (Full paper peer reviewed)
  2. Takato Fujimoto, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis,'' 10th ISCA Speech Synthesis Workshop (SSW10), pp. 166-171, Vienne, Austria, September, 2019. (Full paper peer reviewed)
  3. Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Shinji Takaki, and Junichi Yamagishi, ``Rakugo speech synthesis using segment-to-segment neural transduction and style tokens — toward speech synthesis for entertaining audiences,'' 10th ISCA Speech Synthesis Workshop (SSW10), pp. 111-116, Vienne, Austria, September, 2019. (Full paper peer reviewed)
  4. Keiichiro Oura, Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, ``Deep neural network based real-time speech vocoder with periodic and aperiodic inputs,'' 10th ISCA Speech Synthesis Workshop (SSW10), pp. 13-18, Vienne, Austria, September, 2019. (Full paper peer reviewed)
  5. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Speaker-dependent WaveNet-based delay-free ADPCM speech coding,'' 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7145-7149, Brighton, UK, May, 2019. (Full paper peer reviewed)
  6. Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Singing voice synthesis based on generative adversarial networks,'' 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 6955-6959, Brighton, UK, May, 2019. (Full paper peer reviewed)

Past Publications





トップ   編集 凍結 差分 履歴 添付 複製 名前変更 リロード   新規 一覧 検索 最終更新   ヘルプ   最終更新のRSS