PUBLICATIONS - 2018
Journal
- Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda,
``A Bayesian framework for image recognition based on hidden Markov eigen-image models,''
IEEJ Transactions on Electrical and Electronic Engineering, Vol. 13, Issue 9, pp. 1335-1347, September, 2018. (Full paper peer reviewed)
link
- Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Mel-cepstrum-based quantization noise shaping applied to neural-network-based speech waveform synthesis,'' IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 7, pp. 1173-1180, July, 2018. (Full paper peer reviewed)
link
- Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``Constructing text-to-speech systems for languages with unknown pronunciations,''
Acoustical Science and Technology, Vol. 39, Issue 2, pp. 119-129, March, 2018. (Full paper peer reviewed)
link
International Conference
- Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``WaveNet-based zero-delay lossless speech coding,''
2018 IEEE Workshop on Spoken Language Technology (SLT 2018), pp. 153-158, Athens, Greece, December 2018. (Full paper peer reviewed)
- Koki Senda, Yukiya Hono, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``Singing voice conversion using posted waveform data on music social media,''
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 1913-1917, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed)
- Yukiya Hono, Shumma Murata, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``Recent development of the DNN-based singing voice synthesis system -- Sinsy,''
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 1003-1009, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed)
- Takato Fujimoto, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``Speech synthesis using WaveNet vocoder based on periodic/aperiodic decomposition,''
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 644-648, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed)
- Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``Speaker adaptation for speech synthesis
based on deep neural networks
using hidden semi-Markov model structures,''
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 638-643, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed)
- Takenori Yoshimura, Natsumi Koike, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``Discriminative feature extraction based on sequential variational autoencoder for speaker recognition,''
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 1742-1746, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed)
- Takayuki Kasugai, Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``Image recognition based on convolutional neural networks using features generated from separable lattice hidden Markov models''
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 324-328, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed)
- Kei Sawada, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda,
``The NITech text-to-speech system for the Blizzard Challenge 2018,''
Blizzard Challenge 2018 Workshop, Hyderabad, India, September, 2018. (Full paper peer reviewed)
link
- Eiji Ichikawa, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, ``Image recognition based on separable lattice HMMs using a deep neural network for output probability distribution,'' 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 3021-3025, Calgary, Canada, April, 2018. (Full paper peer reviewed)
- Jumpei Niwa, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Statistical voice conversion based on WaveNet,'' 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5289-5293, Calgary, Canada, April, 2018. (Full paper peer reviewed)
Past Publications