* PUBLICATIONS - 2018 [#n982a49e] #contents ** Journal [#q0e89094] + Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, ``A Bayesian framework for image recognition based on hidden Markov eigen-image models,'' IEEJ Transactions on Electrical and Electronic Engineering, Vol. 13, Issue 9, pp. 1335-1347, September, 2018. (Full paper peer reviewed) &publication(2018/20180901_Journal_IEEJ_Kei_Sawada_paper.pdf, paper); [[link>https://onlinelibrary.wiley.com/doi/abs/10.1002/tee.22700]] + Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Mel-cepstrum-based quantization noise shaping applied to neural-network-based speech waveform synthesis,'' IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 7, pp. 1173-1180, July, 2018. (Full paper peer reviewed) &publication(2018/20180701_Journal_IEEE_Takenori_Yoshimura_paper.pdf, paper); [[link>https://ieeexplore.ieee.org/document/8322169/?arnumber=8322169&source=authoralert]] + Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Constructing text-to-speech systems for languages with unknown pronunciations,'' Acoustical Science and Technology, Vol. 39, Issue 2, pp. 119-129, March, 2018. (Full paper peer reviewed) &publication(2018/20180301_Journal_AST_Kei_Sawada_paper.pdf, paper); [[link>https://www.jstage.jst.go.jp/article/ast/39/2/39_E1734/_article/-char/en]] ** International Conference [#r32cf446] + %%%Takenori Yoshimura%%%, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``WaveNet-based zero-delay lossless speech coding,'' 2018 IEEE Workshop on Spoken Language Technology (SLT 2018), pp. 153-158, Athens, Greece, December 2018. (Full paper peer reviewed) &publication(2018/20181219_IConference_SLT_Takenori_Yoshimura_paper.pdf, paper); &publication(2018/20181219_IConference_SLT_Takenori_Yoshimura_poster.pdf, poster); + %%%Koki Senda%%%, Yukiya Hono, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Singing voice conversion using posted waveform data on music social media,'' Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 1913-1917, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed) &publication(2018/20181115_IConference_APSIPA_Koki_Senda_paper.pdf, paper); &publication(2018/20181115_IConference_APSIPA_Koki_Senda_poster.pdf, poster); + %%%Yukiya Hono%%%, Shumma Murata, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Recent development of the DNN-based singing voice synthesis system -- Sinsy,'' Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 1003-1009, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed) &publication(2018/20181114_IConference_APSIPA_Yukiya_Hono_paper.pdf, paper); &publication(2018/20181114_IConference_APSIPA_Yukiya_Hono_slide.pptx, slide); + %%%Takato Fujimoto%%%, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Speech synthesis using waveNet vocoder based on periodic/aperiodic decomposition,'' ``Speech synthesis using WaveNet vocoder based on periodic/aperiodic decomposition,'' Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 644-648, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed) &publication(2018/20181114_IConference_APSIPA_Takato_Fujimoto_paper.pdf, paper); &publication(2018/20181114_IConference_APSIPA_Takato_Fujimoto_slide.pptx, slide); + %%%Kento Nakao%%%, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Speaker adaptation for speech synthesis based on deep neural networks using hidden semi-Markov model structures,'' Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 638-643, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed) &publication(2018/20181114_IConference_APSIPA_Kento_Nakao_paper.pdf, paper); &publication(2018/20181114_IConference_APSIPA_Kento_Nakao_slide.pptx, slide); + %%%Takenori Yoshimura%%%, Natsumi Koike, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Discriminative feature extraction based on sequential variational autoencoder for speaker recognition,'' Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 1742-1746, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed) &publication(2018/20181114_IConference_APSIPA_Takenori_Yoshimura_paper.pdf, paper); &publication(2018/20181114_IConference_APSIPA_Takenori_Yoshimura_poster.pdf, poster); + %%%Takayuki Kasugai%%%, Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Image recognition based on convolutional neural networks using features generated from separable lattice hidden Markov models'' Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018), pp. 324-328, Honolulu, Hawaii, November, 2018. (Full paper peer reviewed) &publication(2018/20181113_IConference_APSIPA_Takayuki_Kasugai_paper.pdf, paper); &publication(2018/20181113_IConference_APSIPA_Takayuki_Kasugai_slide.pptx, slide); + %%%Kei Sawada%%%, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``The NITech text-to-speech system for the Blizzard Challenge 2018,'' Blizzard Challenge 2018 Workshop, Hyderabad, India, September, 2018. (Full paper peer reviewed) &publication(2018/20180908_IConference_Blizzard_Kei_Sawada_paper.pdf, paper); &publication(2018/20180908_IConference_Blizzard_Kei_Sawada_slide.pptx, slide); [[link>http://www.festvox.org/blizzard/bc2018/NITech_BlizzardChallenge2018.pdf]] + %%%Eiji Ichikawa%%%, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, ``Image recognition based on separable lattice HMMs using a deep neural network for output probability distribution,'' 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 3021-3025, Calgary, Canada, April, 2018. (Full paper peer reviewed) &publication(2018/20180420_IConference_ICASSP_Eiji_Ichikawa_paper.pdf, paper); &publication(2018/20180420_IConference_ICASSP_Eiji_Ichikawa_poster.pdf, poster); + %%%Jumpei Niwa%%%, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Statistical voice conversion based on WaveNet,'' 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5289-5293, Calgary, Canada, April, 2018. (Full paper peer reviewed) &publication(2018/20180418_IConference_ICASSP_Jumpei_Niwa_paper.pdf, paper); &publication(2018/20180418_IConference_ICASSP_Jumpei_Niwa_poster.pdf, poster); **Past Publications [#d97906db] #ls2(HOME/PUBLICATIONS/,reverse);