Lists of Publications

English | Japanese


TOP





Journal Paper
  1. Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components," IEEE Access, vol. 9, pp. 137599-137612, October, 2021. (DOI: 10.1109/ACCESS.2021.3118033) [paper(link)]
  2. Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System," IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 29, pp. 2803-2815, August, 2021. (DOI: 10.1109/TASLP.2021.3104165) [paper(link)]
  3. Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A Bayesian framework for image recognition based on hidden Markov eigen-image models," IEEJ Transactions on Electrical and Electronic Engineering, vol. 13, Issue 9, pp. 1335-1347, September, 2018. (DOI: 10.1002/tee.22700) [paper(link)]
  4. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Mel-cepstrum-based quantization noise shaping applied to neural-network-based speech waveform synthesis," IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 26, Issue 7, pp. 1173-1180, July, 2018. (DOI: 10.1109/TASLP.2018.2818408) [paper(link)] (IEEE Signal Processing Society Japan Student Journal Paper Award)
  5. Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Constructing text-to-speech systems for languages with unknown pronunciations," Acoustical Science and Technology, vol. 39, Issue 2, pp. 119-129, March, 2018. (DOI: 10.1250/ast.39.119) [paper(link)]
  6. 大浦圭一郎, 橋本佳, 南角吉彦, 徳田恵一, "隠れマルコフモデルに基づく日本語音声合成ソフトウェア入門," システム制御情報学会誌, vol. 62, no. 2, pp. 57-62, February 2018. (DOI: 10.11509/isciesci.62.2_57) [paper(link)]
  7. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Simultaneous optimization of multiple tree-based factor analyzed HMM for speech synthesis," IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 25, Issue 9, pp. 1532-1541, September, 2017. (DOI: 10.1109/TASLP.2017.2721219) [paper(link)]
  8. Kei Hashimoto and Shinji Takaki, "Statistical parametric speech synthesis based on deep learning," The journal of the acoustical society of Japan, vol. 73, no. 1, pp. 55-62, January, 2017. (in Japanese) (Review paper) (DOI: 10.20697/jasj.73.1_55) [paper(link)]
  9. Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A Bayesian approach to image recognition based on separable lattice hidden Markov models," IEICE TRANSACTIONS on Information & Systems, vol. E99-D, no. 12, pp. 3119-3131, December, 2016. (DOI: 10.1587/transinf.2016EDP7112) [paper(link)]
  10. Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Integration of spectral feature extraction and modeling for HMM-based speech synthesis," IEICE TRANSACTIONS on Information & Systems, vol. E97-D, no. 6, pp. 1438-1448, June, 2014. (DOI: 10.1587/transinf.E97.D.1438) [paper(link)]
  11. Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A Bayesian framework using multiple model structures for speech recognition," IEICE TRANSACTIONS on Information & Systems, vol. E96-D, no. 4, pp. 939-948, April, 2013. (DOI: 10.1587/transinf.E96.D.939) [paper(link)]
  12. Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, and Keiichi Tokuda, "Impacts of machine translation and speech synthesis on speech-to-speech translation," Speech Communication, vol. 54, Issue 7, pp. 854-866, September, 2012. (DOI: 10.1016/j.specom.2012.02.004) [paper(link)]
  13. Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Speech recognition based on statistical models including multiple phonetic decision trees," Acoustical Science and Technology, vol. 32, no. 6, pp. 236-243, November, 2011. (DOI: 10.1250/ast.32.236) [paper(link)]
  14. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Bayesian context clustering using cross validation for speech recognition," IEICE TRANSACTIONS on Information & Systems, vol. E94-D, no. 3, pp. 668-678, March, 2011. (DOI: 10.1587/transinf.E94.D.668) [paper(link)]
  15. Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro Sumita, and Keiichi Tokuda, "A reordering model using a source-side parse-tree for statistical machine translation," IEICE TRANSACTIONS on Information & Systems, vol. E92-D, no. 12, pp. 2386-2393, December, 2009. (DOI: 10.1587/transinf.E92.D.2386) [paper(link)]


International Conference
  1. Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice synthesis based on a musical note position-aware attention mechanism," Proceedings of 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023), Greece, June 4-10, 2023.
  2. Takenori Yoshimura, Shinji Takaki, Kazuhiro Nakamura, Keiichiro Oura, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Embedding a differentiable mel-cepstral synthesis filter to a neural speech synthesis system," Proceedings of 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023), Greece, June 4-10, 2023.
  3. Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Autoregeressive variational autoencoder with a hidden semi-Markov model-based structured attention for speech synthesis," Proceedings of 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2022), pp. 7462-7466, Singapore, May 7-13, 2022.
  4. Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components," Proceedings of 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2021), pp. 6049-6053, Toronto, Canada, June 6-11, 2021.
  5. Yukiya Hono, Kazuna Tsuboi, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis," Proceedings of Interspeech 2020, pp. 3441-3445, Shanghai, China, October 25-29, 2020.
  6. Takato Fujimoto, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Semi-supervised learning based on hierarchical generative models for end-to-end speech synthesis," Proceedings of 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 7644-7648, Barcelona, Spain, May 4-8, 2020.
  7. Kazuhiro Nakamura, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Fast and high-quality singing voice synthesis system based on convolutional neural networks," Proceedings of 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 7239-7243, Barcelona, Spain, May 4-8, 2020.
  8. Motoki Shimada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Low computational cost speech synthesis based on deep neural networks using hidden semi-Markov model structures," Proceedings of 10th ISCA Speech Synthesis Workshop (SSW10), pp. 177-182, Vienne, Austria, September 20-22, 2019.
  9. Takato Fujimoto, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis," Proceedings of 10th ISCA Speech Synthesis Workshop (SSW10), pp. 166-171, Vienne, Austria, September 20-22, 2019.
  10. Keiichiro Oura, Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Deep neural network based real-time speech vocoder with periodic and aperiodic inputs," Proceedings of 10th ISCA Speech Synthesis Workshop (SSW10), pp. 13-18, Vienne, Austria, September 20-22, 2019.
  11. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Speaker-dependent WaveNet-based delay-free ADPCM speech coding," Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019), pp. 7145-7149, Brighton, UK, May 12-17, 2019.
  12. Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice synthesis based on generative adversarial networks," Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019), pp. 6955-6959, Brighton, UK, May 12-17, 2019.
  13. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "WaveNet-based zero-delay lossless speech coding," Proceedings of 2018 IEEE Workshop on Spoken Language Technology (SLT 2018), pp. 153-158, Athens, Greece, December 18-21, 2018.
  14. Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Speaker adaptation for speech synthesis based on deep neural networks using hidden semi-Markov model structures," Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018), pp. 638-643, Honolulu, Hawaii, November 12-15, 2018.
  15. Takayuki Kasugai, Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Image recognition based on convolutional neural networks using features generated from separable lattice hidden Markov models," Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018), pp. 324-328, Honolulu, Hawaii, November 12-15, 2018.
  16. Koki Senda, Yukiya Hono, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice conversion using posted waveform data on music social media," Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018), pp. 1913-1917, Honolulu, Hawaii, November 12-15, 2018.
  17. Yukiya Hono, Shumma Murata, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Recent development of the DNN-based singing voice synthesis system -- Sinsy," Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018), pp. 1003-1009, Honolulu, Hawaii, November 12-15, 2018.
  18. Takenori Yoshimura, Natsumi Koike, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Discriminative feature extraction based on sequential variational autoencoder for speaker recognition," Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018), pp. 1742-1746, Honolulu, Hawaii, November 12-15, 2018.
  19. Takato Fujimoto, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Speech synthesis using WaveNet vocoder based on periodic/aperiodic decomposition," Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018), pp. 644-648, Honolulu, Hawaii, November 12-15, 2018.
  20. Kei Sawada, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "The NITech text-to-speech system for the Blizzard Challenge 2018," Proceedings of Blizzard Challenge 2018 Workshop, Hyderabad, India, September 8, 2018. (web proceedings)
  21. Eiji Ichikawa, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Image recognition based on separable lattice HMMs using a deep neural network for output probability distributions," Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018), pp. 3021-3025, Calgary, Canada, April 15-20, 2018.
  22. Jumpei Niwa, Takenori Yoshimura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Statistical voice conversion based on WaveNet," Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018), pp. 5289-5293, Calgary, Canada, April 15-20, 2018.
  23. Kei Sawada, Kei Hashimoto, Keiichiro Oura, and Keiichi Tokuda, "The NITech text-to-speech system for the Blizzard Challenge 2017," Proceedings of Blizzard Challenge 2017 Workshop, Stockholm, Sweden, August 25, 2017. (web proceedings)
  24. Amelia Gully, Takenori Yoshimura, Damian Murphy, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Articulatory text-to-speech synthesis using the digital waveguide mesh driven by a deep neural network," Proceedings of Interspeech 2017, pp. 234-238, Stockholm, Sweden, August 20-24, 2017.
  25. Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Image recognition based on discriminative models using features generated from separable lattice HMMs," Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), pp. 2607-2611, New Orleans, USA, March 5-9, 2017.
  26. Kei Sawada, Chiaki Asai, Kei Hashimoto, Keiichiro Oura, and Keiichi Tokuda, "The NITech text-to-speech system for the Blizzard Challenge 2016," Proceedings of Blizzard Challenge 2016 Workshop, California, USA, September 16, 2016. (web proceedings)
  27. Keiichi Tokuda, Kei Hashimoto, Keiichiro Oura, and Yoshihiko Nankaku, "Temporal modeling in neural network based statistical parametric speech synthesis," Proceedings of 9th ISCA Speech Synthesis Workshop (SSW9), pp. 113-118, California, USA, September 13-15, 2016.
  28. Rasmus Dall, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Redefining the linguistic context feature set for HMM and DNN TTS through position and parsing," Proceedings of Interspeech 2016, pp. 2851-2855, California, USA, September 8-12, 2016.
  29. Masanari Nishimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice synthesis based on deep neural networks," Proceedings of Interspeech 2016, pp. 2478-2482, California, USA, September 8-12, 2016.
  30. Naoki Hosaka, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Voice conversion based on trajectory model training of neural networks considering global variance," Proceedings of Interspeech 2016, pp. 307-311, California, USA, September 8-12, 2016.
  31. Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Trajectory training considering global variance for speech synthesis based on neural networks," Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), pp. 5600-5604, Shanghai, China, March 20-25, 2016.
  32. Kei Hashimoto, Junichi Yamagishi, and Isao Echizen, "Privacy-preserving sound to degrade automatic speaker verification performance," Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), pp. 5500-5504, Shanghai, China, March 20-25, 2016.
  33. Kei Sawada, Kei Hashimoto, Keiichiro Oura, and Keiichi Tokuda, "The NITECH HMM-based text-to-speech system for the Blizzard Challenge 2015," Proceedings of Blizzard Challenge 2015 Workshop, Berlin, Germany, September 11, 2015. (web proceedings)
  34. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Simultaneous optimization of multiple tree structures for factor analyzed HMM-based speech synthesis," Proceedings of Interspeech 2015, pp. 1196-1200, Dresden, Germany, September 6-10, 2015.
  35. Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "The effect of neural networks in statistical parametric speech synthesis," Proceedings of 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2015), pp. 4455-4459, Brisbane, Australia, April 19-24, 2015.
  36. Kei Sawada, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, and Keiichi Tokuda, "Overview of NITECH HMM-based text-to-speech system for Blizzard Challenge 2014," Proceedings of Blizzard Challenge 2014 Workshop, Singapore, September 19, 2014. (web proceedings)
  37. Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech," Proceedings of Interspeech 2014, pp. 2494-2498, Singapore, September 14-18, 2014.
  38. Kanako Shirota, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis," Proceedings of 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), pp. 2578-2582, Florence, Italy, May 4-9, 2014.
  39. Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Image recognition based on hidden Markov eigen-image models using variational Bayesian method," Proceedings of Asia-Pacific Signal ans Information Processing Association Annual Summit and Conference 2013 (APSIPA ASC 2013), Kaohsiung, Taiwan, October 29-November 1, 2013.
  40. Shinji Takaki, Kei Sawada, Kei Hashimoto, Keiichiro Oura, and Keiichi Tokuda, "Overview of NITECH HMM-based speech synthesis system for Blizzard Challenge 2013," Proceedings of Blizzard Challenge 2013 Workshop, Barcelona, Spain, September 3, 2013. (web proceedings)
  41. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis," Proceedings of 8th ISCA Speech Synthesis Workshop (SSW8), pp. 317-322, Barcelona, Spain, August 31-September 2, 2013.
  42. Takaya Makino, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Separable lattice 2-D HMMs introducing state duration control for recognition of images with various variations," Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), pp. 3203-3207, Vancouver, Canada, May 26-31, 2013.
  43. Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Integration of acoustic modeling and mel-cepstral analysis for HMM-based speech synthesis," Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), pp. 7883-7887, Vancouver, Canada, May 26-31, 2013.
  44. Shinji Takaki, Kei Sawada, Kei Hashimoto, Keiichiro Oura, and Keiichi Tokuda, "Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2012," Proceedings of Blizzard Challenge 2012 Workshop, Portland, Oregon, U.S.A., September 14, 2012. (web proceedings)
  45. Takafumi Hattori, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A Bayesian approach to speaker recognition based on GMMs using multiple model structures," Proceedings of Interspeech 2012, Portland, Oregon, U.S.A., September 9-13, 2012.
  46. Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Face recognition based on separable lattice 2-D HMMs using variational Bayesian method," Proceedings of 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), pp. 2205-2208, Kyoto, Japan, March 25-30, 2012.
  47. Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A model structure integration based on Bayesian framework for speech recognition," Proceedings of 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), pp. 4813-4816, Kyoto, Japan, March 25-30, 2012.
  48. Kei Hashimoto, Shinji Takaki, Keiichiro Oura, and Keiichi Tokuda, "Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2011," Proceedings of Blizzard Challenge 2011 Workshop, Turin, Italy, September 2, 2011. (web proceedings)
  49. Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Multi-speaker modeling with shared prior distributions and model structures for Bayesian speech synthesis," Proceedings of Interspeech 2011, pp. 113-116, Florence, Italy, August 28-31, 2011.
  50. Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, and Keiichi Tokuda, "An analysis of machine translation and speech synthesis in speech-to-speech translation system," Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), pp. 5108-5111, Prague, Czech Republic, May 22-27, 2011.
  51. Keiichiro Oura, Kei Hashimoto, Sayaka Shiota, and Keiichi Tokuda, "Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2010," Proceedings of Blizzard Challenge 2010 Workshop, Kyoto, Japan, September 25, 2010.
  52. Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Bayesian speech synthesis framework integrating training and synthesis processes," Proceedings of 7th ISCA Speech Synthesis Workshop (SSW7), pp. 106-111, Kyoto, Japan, September 22-24, 2010.
  53. Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A Bayesian approach to hidden semi Markov model based speech synthesis," Proceedings of Interspeech 2009, pp. 1751-1754, Brighton, United Kingdom, September 6-10, 2009. (Student Paper Award Finalist)
  54. Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Deterministic annealing based training algorithm for Bayesian speech recognition," Proceedings of Interspeech 2009, pp. 680-683, Brighton, United Kingdom, September 6-10, 2009.
  55. Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro Sumita, and Keiichi Tokuda, "Reordering model using syntactic information of a source tree for statistical machine translation," Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation (SSST-3) at North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT) 2009, pp. 69-77, Boulder, Colorado, U.S.A., June 5, 2009.
  56. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Takashi Masuko, and Keiichi Tokuda, "A Bayesian approach to HMM-based speech synthesis," Proceedings of 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), pp. 4029-4032, Taipei, Taiwan, April 19-24, 2009.
  57. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition," Proceedings of Interspeech 2008, pp. 936-939, Brisbane, Australia, September 22-26, 2008.
  58. Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Acoustic modeling based on model structure annealing for speech recognition," Proceedings of Interspeech 2008, pp. 932-935, Brisbane, Australia, September 22-26, 2008.
  59. Tatsuya Ito, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Speaker recognition based on variational Bayesian method," Proceedings of Interspeech 2008, pp. 1417-1420, Brisbane, Australia, September 22-26, 2008.
  60. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Hyperparameter estimation for speech recognition based on variational Bayesian approach," Proceedings of ASA & ASJ Joint Meeting, p. 3042, Honolulu, Hawaii, U.S.A., November 28-December 2, 2006.


Technical Report
  1. Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice synthesis based on a frame-driven attention mechanism considering vocal timing deviation," Technical Report of IEICE, vol. 122, no. 389, SP2022-42, pp. 19-24, Okinawa, February 28-March 1, 2022.
  2. Sota Wada, Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A comparison of neural vocoders in singing voice synthesis," Technical Report of IEICE, vol. 119, no. 321, SP2019-42, pp. 85-90, Tokyo, December 6, 2019.
  3. Takahiro Tsugui, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Synthetic speech-based sound masking for privacy protection when speaking to smartphones in public space," Technical Report of IEICE, vol. 119, no. 321, SP2019-42, pp. 55-60, Tokyo, December 6, 2019.
  4. 大浦圭一郎, 中村和寛, 橋本佳, 南角吉彦, 徳田恵一, "周期・非周期信号を用いたDNNに基づくリアルタイム音声ボコーダ," IPSJ SIG Technical Report, vol. 2019-SLP-127, no. 34, pp. 1-6, 京都, June 22-23, 2019.
  5. Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling," Technical Report of IEICE, SP2018-11, pp. 53-58, Nagano, June 28-29, 2018.
  6. Jumpei Niwa, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A study on voice conversion based on WaveNet," Technical Report of IEICE, vol. 117, no. 393, SP2017-84, pp. 99-104, Tokyo, January 20-21, 2018.
  7. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet," Technical Report of IEICE, vol. 117, no. 393, SP2017-83, pp. 93-98, Tokyo, January 20-21, 2018.
  8. Ryohei Funato, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Trajectory training considering power for speech synthesis based on neural networks," Technical Report of IEICE, vol. 117, no. 393, SP2017-74, pp. 43-48, Tokyo, January 20-21, 2017. (Student Poster Award in Speech Field)
  9. 浅見太一, 大谷大和, 岡本拓磨, 小川哲司, 落合翼, 亀岡弘和, 駒谷和範, 高木信二, 高道慎之介, 俵直弘, 南條浩輝, 橋本佳, 福田 隆, 増村亮, 松田繁樹, 李晃伸, 渡部晋治 "国際会議ICASSP2017報告," IPSJ SIG Technical Report, vol. 2017-SLP-117, no. 3, pp. 1-8, 宮城, July 27-28, 2017.
  10. 浅見太一, 小川厚徳, 小川哲司, 大谷大和, 倉田岳人, 齋藤大輔, 塩田さやか, 篠原雄介, 鈴木雅之, 高道慎之介, 南條浩輝, 橋本佳, 樋口卓哉, 増村亮, 吉野幸一郎, 渡部晋治, "国際会議INTERSPEECH2016報告," IPSJ SIG Technical Report, vol. 2017-SLP-115, no. 7, pp. 1-7, Kagawa, February 17-18, 2017.
  11. Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Simultaneous modeling of acoustic feature sequences and its temporal structures for DNN-based speech synthesis," Technical Report of IEICE, vol. 116, no. 414, SP2016-76, pp. 71-76, Tokyo, January 21, 2017. (IEICE ISS Young Researcher's Award in Speech Field)
  12. Chiaki Asai, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Designing linguistic features for expressive speech synthesis using audiobooks," Technical Report of IEICE, vol. 116, no. 414, SP2016-70, pp. 35-40, Tokyo, January 21, 2017.
  13. 峯松信明, 秋田祐哉, 浅見太一, 伊藤信貴, 落合翼, 郡山知樹, 齋藤大輔, 塩田さやか, 篠崎隆宏, 鈴木雅之, 高木信二, 俵直弘, 橋本佳, 樋口卓哉, 福田隆, "国際会議ICASSP2016参加報告," IPSJ SIG Technical Report, vol. 2016-SLP-112, no. 5, pp. 1-6, Yamagata, July 28-30, 2016.
  14. Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Image recognition based on discriminative models using features extracted from separable lattice HMMs," Technical Report of IEICE, vol. 116, no. 89, PRMU2016-36, pp. 7-12, Tokyo, June 13-14, 2016.
  15. Masato Sukegawa, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Parameter sharing structures of separable lattice HMMs using mixture output distributions for image recognition," Technical Report of IEICE, vol. 115, no. 456, PRMU2015-138, pp. 37-42, Fukuoka, February 21-22, 2016.
  16. Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Evaluation of text-to-speech system construction for unknown-pronunciation languages," Technical Report of IEICE, vol. 115, no. 346, SP2015-80, pp. 93-98, Aichi, December 2-3, 2015.
  17. Kei Hashimoto, Junichi Yamagishi, and Isao Echizen, "Investigation of privacy-preserving sounds to degrade automatic spaker verificationperformance," Technical Report of IEICE, vol. 115, no. 146, SP2015-49, pp. 79-84, Suwa, July 16-17, 2015.
  18. 岡本拓磨, 小川哲司, 落合翼, 柏木陽佑, 亀岡弘和, 木下慶介, 郡山知樹, 齋藤大輔, 篠崎隆宏, 高木信二, 滝口哲也, 太刀岡勇気, 俵直弘, 橋本佳, 藤本雅清, 松田繁樹, 三村正人, 吉岡拓也, 渡部晋治, "国際会議ICASSP2015参加報告," IPSJ SIG Technical Report, vol. 2015-SLP-107, no. 3, pp. 1-7, Suwa, July 16-17, 2015.
  19. Koji Mushika, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A robust modeling technique against training data errors for HMM-based singing voice synthesis," IPSJ SIG Technical Report, vol. 2015-MUS-106, no. 13, pp. 1-6, Kofu, March 2-3, 2015.
  20. Akifumi Tsuge, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Speaker recognition based on log-linear models using feature generation by variational Bayesian method," Technical Report of IEICE, vol. 113, no. 404, pp. 13-18, Nagoya, January 23-24, 2014.
  21. Takaya Makino, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Extended separable lattice HMMs with state duration control for recognition of images with variations," Technical Report of IEICE, vol. 112, no. 441, pp. 149-154, Osaka, February 21-22, 2013.
  22. Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Image recognition based on hidden Markov eigen-image models with the variational Bayesian method," Technical Report of IEICE, vol. 112, no. 441, pp. 155-160, Osaka, February 21-22, 2013.
  23. Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Face recognition based on separable lattice 2-D HMMs with variational Bayesian method," Technical Report of IEICE, vol. 111, no. 317, pp. 125-130, Nagasaki, November 24-25, 2011.
  24. Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Bayesian speech recognition based on model structure integration," Technical Report of IEICE, vol. 111, no. 97, pp. 11-16, Nagoya, June 23-24, 2011. (IEICE ISS Young Researcher's Award in Speech Field)
  25. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, and Keiichi Tokuda, "Bayesian context clustering using cross validation for HMM-based speech synthesis," Technical Report of IEICE, vol. 108, no. 338, pp. 73-78, Tokyo, December 9-10, 2008.
  26. Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Speech recognition based on statistical models including multiple decision trees," Technical Report of IEICE, vol. 108, no. 338, pp. 221-226, Tokyo, December 9-10, 2008.
  27. Tatsuya Ito, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Speaker recognition based on Gaussian mixture models using variational Bayesian method," Technical Report of IEICE, vol. 108, no. 338, pp. 185-190, Tokyo, December 9-10, 2008.
  28. Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Acoustic modeling based on model structure annealing for speech recognition," Technical Report of IEICE, vol. 107, no. 165, pp. 67-72, Toyama, July 26-27, 2007.


Domestic Conference
  1. Hikaru Aohara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Consideration on periodic excitation signals in source-filter type neural vocoders," Proceedings of ASJ2024 spring meeting, pp. 813-816, March 6-8, 2024.
  2. Miduki Hodotsuka, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Speaker identification using semi-supervised learning with Noisy Student," Proceedings of ASJ2024 spring meeting, pp. 837-840, March 6-8, 2024.
  3. Sota Nakamura, Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Design of a control interface for text-to-speech synthesis with 55 selectable styles," Proceedings of ASJ2023 autumn meeting, pp. 1141-1144, September 26-28, 2023.
  4. Ikuya Hasegawa, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Active cancellation of self-vocalization sounds via bone conduction for real-time voice quality conversion," Proceedings of ASJ2023 autumn meeting, pp. 1291-1294, September 26-28, 2023.
  5. Hikaru Suzuki, Takato Fujimoto, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Real-time voice conversion in consideration of output delay and time warping transformation," Proceedings of ASJ2023 autumn meeting, pp. 1081-1084, September 26-28, 2023.
  6. Shion Hukuda, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A neural vocoder training method using a pitch extractor for fundamental frequency controllability," Proceedings of ASJ2023 autumn meeting, pp. 1065-1068, September 26-28, 2023.
  7. Suzuka Sato, Takato Fujimoto, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Neural vocoder based on disentangled representation learning to control fundamental frequency," Proceedings of ASJ2023 autumn meeting, pp. 1061-1064, September 26-28, 2023.
  8. Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "V2Coder: A neural vocoder based on hierarchical variational autoencoders," Proceedings of ASJ2023 autumn meeting, pp. 1051-1054, September 26-28, 2023.
  9. Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "PeriodGrad: A neural vocoder based on a diffusion probabilistic model with fundamental frequency controllability," Proceedings of ASJ2023 autumn meeting, pp. 1045-1048, September 26-28, 2023.
  10. Ryusei Tanaka, Atsushi Yamada, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice conversion with a small amount of training data using feature extraction by self-supervised learning and coarse-fine conversion," Proceedings of ASJ2023 spring meeting, pp. 705-708, March 15-17, 2023.
  11. Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A study on vocal timing modeling for sequence-to-sequence singing voice synthesis," Proceedings of ASJ2022 autumn meeting, pp. 1359-1362, September 14-16, 2022.
  12. Ryusei Ishida, Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Parameter sharing structures in speech synthesis using structured attention based on a hidden semi-Markov model," Proceedings of ASJ2022 autumn meeting, pp. 1199-1202, September 14-16, 2022.
  13. Yuya Shiraki, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "End-to-end speech recognition with sequence discriminative training under constraints of decoding algorithms," Proceedings of ASJ2022 autumn meeting, pp. 1141-1144, September 14-16, 2022.
  14. Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A study on musical note position-aware attention mechanism for sequence-to-sequence singing voice synthesis," Proceedings of ASJ2022 autumn meeting, pp. 1589-1592, September 14-16, 2022.
  15. Takenori Yoshimura, Shinji Takaki, Kazuhiro Nakamura, Keiichiro Oura, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Embedding a differentiable mel-cepstral synthesis filter to an end-to-end speech synthesis system," Proceedings of ASJ2022 autumn meeting, pp. 1585-1588, September 14-16, 2022.
  16. Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Japanese end-to-end speech synthesis based on hierarchical generative models using semi-supervised learning," Proceedings of ASJ2022 autumn meeting, pp. 1579-1582, September 14-16, 2022.
  17. Yukiya Hono, Shinji Takaki, Kei Hashimoto, Kazuhiro Nakamura, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Neural vocoder training considering the aperiodic measure," Proceedings of ASJ2022 spring meeting, pp. 973-976, March 9-11, 2022. (Awaya Prize Young Researcher Award)
  18. Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Memory reduction methods for sequence-to-sequence speech synthesis using a hidden semi-Markov model based structured attention mechanism," Proceedings of ASJ2022 spring meeting, pp. 969-972, March 9-11, 2022.
  19. Kazumasa Sasaki, Takenori Yoshimura, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Neural vocoders which can control voice characteristics, average pitch and speaking rate," Proceedings of ASJ2022 spring meeting, pp. 935-938, March 9-11, 2022.
  20. Keisuke Hiramitsu, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Cross-modal speaker adaptation using face image information in speech synthesis based on deep learining," Proceedings of ASJ2022 spring meeting, pp. 905-906, March 9-11, 2022.
  21. Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Autoregressive variational autoencoder-based sequence-to-sequence speech synthesis using a hidden semiMarkov model based structured attention mechanism," Proceedings of ASJ2021 autumn meeting, pp. 915-918, September 7-9, 2021.
  22. Yukiya Hono, Taisei Kato, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Sequence-to-sequence singing voice synthesis considering vocal timing fluctuation," Proceedings of ASJ2021 autumn meeting, pp. 911-914, September 7-9, 2021.
  23. Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Automatic pitch correction of tuneless singing for DNN-based singing voice synthesis," Proceedings of ASJ2021 autumn meeting, pp. 907-910, September 7-9, 2021.
  24. Shinji Takaki, Koichi Ushida, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A factor analyzed voice model for sequence-to-sequence speech synthesis using a hidden semi-Markov model based structured attention mechanism," Proceedings of ASJ2021 autumn meeting, pp. 871-874, September 7-9, 2021.
  25. Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Variational autoencoder-based autoregressive sequence-to-sequence speech synthesis considering consistency between training and synthesis," Proceedings of ASJ2021 spring meeting, pp. 947-950, March 10-12, 2021. (Student Presentation Award)
  26. Kenta Sumiya, Takenori Yoshimura, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Sequence-to-sequence speech synthesis using a hidden semi-Markov model based structured attention mechanism," Proceedings of ASJ2021 spring meeting, pp. 943-946, March 10-12, 2021. (Student Presentation Award)
  27. Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "An investigation of modeling speech waveform by neural vocoder based on periodic/aperiodic decomposition," Proceedings of ASJ2021 spring meeting, pp. 861-864, March 10-12, 2021.
  28. Kohei Iwata, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, "An investigation of speech synthesis based on gradient boosting decision trees," Proceedings of ASJ2021 spring meeting, pp. 813-814, March 10-12, 2021.
  29. 平光啓祐, 橋本佳, 南角吉彦, 徳田恵一, "深層学習に基づく音声合成における顔画像を用いた話者適応," 第18回情報学ワークショップ, November 28, 2020.
  30. 車田智哉, 木下耕介, 吉村建慶, 橋本佳, 南角吉彦, 徳田恵一, "生成モデルの構造を組み込んだ系列変分オートエンコーダに基づく話者認識," 第18回情報学ワークショップ, November 28, 2020.
  31. 西村愛理, 藤本崇人, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一, "出力遅延を考慮したアテンション機構に基づくリアルタイム声質変換," 第18回情報学ワークショップ, November 28, 2020.
  32. 久野宏彰, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一, "音声合成における特徴的な発話スタイルの転移学習," 第18回情報学ワークショップ, November 28, 2020.
  33. 成田哲郎, 吉村建慶, 橋本佳, 南角吉彦, 徳田恵一, "ニューラルボコーダを用いた音声符号化手法の検討," 第18回情報学ワークショップ, November 28, 2020.
  34. 大谷眞史, 佐藤優介, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一, "音声合成における敵対的生成ネットワークを用いた複数言語・複数話者モデリングの検討," 第18回情報学ワークショップ, November 28, 2020.
  35. 佐々木一匡, 吉村建慶, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一, "大規模音楽データを活用した汎用WaveNetボコーダ構成法の検討," 第18回情報学ワークショップ, November 28, 2020.
  36. 厚地俊哉, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一, "音声プライバシー保護のためのノンパラレル声質変換による話者匿名化の検討," 第18回情報学ワークショップ, November 28, 2020.
  37. 岩田康平, 高木信二, 橋本佳, 南角吉彦, 徳田恵一, "勾配ブースティング決定木を用いた高速な音声合成手法の検討," 第18回情報学ワークショップ, November 28, 2020.
  38. 前川遼太朗, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一, "深層学習に基づく楽器音合成における音響モデルの比較検討," 第18回情報学ワークショップ, November 28, 2020.
  39. 木村俊介, 橋本佳, 南角吉彦, 徳田恵一, "幾何学的変動に頑健な画像認識のための深層学習モデルの検討," 第18回情報学ワークショップ, November 28, 2020.
  40. 小林睦, 橋本佳, 南角吉彦, 徳田恵一, "統計モデルに基づくドライバ認知負荷推定の検討," 第18回情報学ワークショップ, November 28, 2020.
  41. Yukiya Hono, Kazuna Tsuboi, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Expressive speech synthesis using hierarchical multi-grained generative model," Proceedings of ASJ2020 autumn meeting, pp. 791-794, September 9-11, 2020.
  42. Takato Fujimoto, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Dirichlet VAE for emotional speech synthesis," Proceedings of ASJ2020 autumn meeting, pp. 789-790, September 9-11, 2020.
  43. Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "An investigation of modeling periodic and aperiodic components in speech vocoder based on deep neural networks," Proceedings of ASJ2020 autumn meeting, pp. 759-760, September 9-11, 2020.
  44. Masafumi Otani, Yusuke Sato, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Multi-language multi-speaker modeling for speech synthesis using generative adversarial networks," Proceedings of ASJ2020 autumn meeting, pp. 695-696, September 9-11, 2020.
  45. Takato Fujimoto, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "End-to-end speech synthesis based on hierarchical generative models for semi-supervised learning," Proceedings of ASJ2020 spring meeting, pp. 1039-1042, Saitama, March 16-18, 2020.
  46. Keiichiro Oura, Shinji Takaki, Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Generative adversarial network based real-time speech vocoder with periodic/aperiodic inputs," Proceedings of ASJ2019 autumn meeting, pp. 997-998, Shiga, September 4-6, 2019. (Awaya Prize Young Researcher Award)
  47. Shumma Murata, Takato Fujimoto, Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A study on singing voice synthesis with attention mechanism using musical score time information," Proceedings of ASJ2019 autumn meeting, pp. 943-944, Shiga, September 4-6, 2019.
  48. Kazuhiro Nakamura, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Computational complexity reduction method for CNN-based singing voice synthesis," Proceedings of ASJ2019 autumn meeting, pp. 939-940, Shiga, September 4-6, 2019.
  49. Kenta Sumiya, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A study of adversarial learning for emotional speech synthesis based on deep neural networks," Proceedings of ASJ2019 spring meeting, pp. 1359-1360, Tokyo, March 5-7, 2019.
  50. Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "CNN-based speech parameter generation for singing voice synthesis," Proceedings of ASJ2019 spring meeting, pp. 1035-1038, Tokyo, March 5-7, 2019.
  51. Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice synthesis using generative adversarial networks," Proceedings of ASJ2019 spring meeting, pp. 1039-1040, Tokyo, March 5-7, 2019.
  52. Kei Sawada, Kazuna Tsuboi, Xianchao Wu, Zhan Chen, Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "AI Singer Rinna: a singing voice synthesis system using user's singing voice or musical score," Proceedings of ASJ2019 spring meeting, pp. 1041-1044, Tokyo, March 5-7, 2019.
  53. Keiichiro Oura, Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Deep neural network based speech vocoder with periodic/aperiodic inputs," Proceedings of ASJ2019 spring meeting, pp. 1049-1052, Tokyo, March 5-7, 2019.
  54. Takato Fujimoto, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Impacts of input linguistic features on Japanese end-to-end speech synthesis," Proceedings of ASJ2019 spring meeting, pp. 1061-1062, Tokyo, March 5-7, 2019.
  55. Motoki Shimada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Reducing computational costs for speech synthesis based on deep neural networks using hidden semi-Markov model structures," Proceedings of ASJ2019 spring meeting, pp. 1071-1072, Tokyo, March 5-7, 2019.
  56. Yukiya Hono, Shumma Murata, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A DNN-based singing voice synthesis system -- Sinsy," Proceedings of ASJ2018 autumn meeting, pp. 1099-1102, Oita, September 12-14, 2018.
  57. Takato Fujimoto, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Periodic /aperiodic decomposition based speech synthesis using WaveNet vocoder," Proceedings of ASJ2018 autumn meeting, pp. 1125-1126, Oita, September 12-14, 2018.
  58. Takahiro Tsugui, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Sound masking for privacy protection when speaking to smart devices in public space," Proceedings of ASJ2018 autumn meeting, pp. 883-884, Oita, September 12-14, 2018.
  59. Takenori Yoshimura, Natsumi Koike, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Feature extraction based on sequential variational autoencoder for speaker recognition," Proceedings of ASJ2018 autumn meeting, pp. 1341-1344, Oita, September 12-14, 2018.
  60. Kei Sawada, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Overview of the NITech text-to-speech system for the Blizzard Challenge 2018," Proceedings of ASJ2018 autumn meeting, pp. 1091-1094, Oita, September 12-14, 2018.
  61. Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice synthesis based on neural network using a structure of a hidden semi-Markov model," Proceedings of ASJ2018 spring meeting, pp. 247-248, Saitama, March 13-15, 2018.
  62. Shumma Murata, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Singing voice synthesis using timing-lab model based on a deep neural network," Proceedings of ASJ2018 spring meeting, pp. 245-246, Saitama, March 13-15, 2018.
  63. Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Overview of the NITech text-to-speech system for the Blizzard Challenge 2017," Proceedings of ASJ2017 autumn meeting, pp. 287-290, Ehime, September 25-27, 2017.
  64. Yukiya Hono, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda, Daisuke Kondo, and Daisuke Ichikawa, "Singing voice conversion using post data in music SNS," Proceedings of ASJ2017 autumn meeting, pp. 209-210, Ehime, September 25-27, 2017.
  65. Jumpei Niwa, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "WaveNet-based voice conversion," Proceedings of ASJ2017 autumn meeting, pp. 207-208, Ehime, September 25-27, 2017.
  66. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Mel-cepstrum based quantization noise shaping applied for WaveNet," Proceedings of ASJ2017 autumn meeting, pp. 193-194, Ehime, September 25-27, 2017. (Student Presentation Award)
  67. Shiori Murase, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Investigation of conditions for acoustic feature extraction for speech synthesis based on deep neural networks," Proceedings of ASJ2017 spring meeting, pp. 263-264, Kanagawa, March 15-17, 2017.
  68. Yushi Ichikawa, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Voice conversion based on hybrid deep neural network - Gaussian mixture model," Proceedings of ASJ2017 spring meeting, pp. 233-234, Kanagawa, March 15-17, 2017. (Student Presentation Award)
  69. Kei Hashimoto, Junichi Yamagishi, and Isao Echizen, "Privacy-preserving sounds based on universal background models to degrade performance of automatic speaker verification systems," Proceedings of ASJ2016 spring meeting, pp. 131-132, Kanagawa, March 9-11, 2016.
  70. Keiichiro Oura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Prior probability distribution based on musical score for HMM-based singing voice synthesis," Proceedings of ASJ2016 spring meeting, pp. 245-246, Kanagawa, March 9-11, 2016.
  71. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A design of voice recording software tool to effectively collect speech data based on crowdsourcing," Proceedings of ASJ2016 spring meeting, pp. 307-308, Kanagawa, March 9-11, 2016.
  72. Tatsuya Suzuki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Investigation of voice fundamental frequency estimation based on conditional random fields," Proceedings of ASJ2016 spring meeting, pp. 279-280, Kanagawa, March 9-11, 2016.
  73. Masanari Nishimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Investigation of singing voice synthesis based on deep neural networks," Proceedings of ASJ2016 spring meeting, pp. 213-214, Kanagawa, March 9-11, 2016.
  74. Naoki Hosaka, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Trajectory training considering global variance for voice conversion based on neural network," Proceedings of ASJ2016 spring meeting, pp. 239-240, Kanagawa, March 9-11, 2016.
  75. Kei Sawada, Kazuki Igami, Chiaki Asai, Yusuke Sato, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Automatic construction of training corpus using audiobooks for statistical parametric speech synthesis," Proceedings of ASJ2016 spring meeting, pp. 219-220, Kanagawa, March 9-11, 2016. (Student Presentation Award)
  76. Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Trajectory model training considering global variance for speech synthesis based on neural network," Proceedings of ASJ2015 autumn meeting, pp. 237-238, Fukushima, September 16-18, 2015.
  77. Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A mel-cepstral analysis technique restoring missing high-frequency components of speech for HMM-based speech synthesis," Proceedings of ASJ2015 autumn meeting, pp. 233-234, Fukushima, September 16-18, 2015.
  78. Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Investigation of text-to-speech system construction in unknown-pronunciation language," Proceedings of ASJ2015 autumn meeting, pp. 231-232, Fukushima, September 16-18, 2015.
  79. Kei Hashimoto, Junichi Yamagishi, and Isao Echizen, "Investigation of privacy-preserving sounds to degrade performance of automatic speaker verification systems," Proceedings of ASJ2015 autumn meeting, pp. 27-28, Fukushima, September 16-18, 2015.
  80. Masaya Hashimoto, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Speaker recognition based on log linear models using multiple acoustic features," Proceedings of ASJ2015 autumn meeting, pp. 25-26, Fukushima, September 16-18, 2015.
  81. Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Investigation of neural network based speech synthesis using generative models," Proceedings of ASJ2014 autumn meeting, pp. 245-246, Hokkaido, September 3-5, 2014. (Awaya Prize Young Researcher Award)
  82. Takenori Yoshimura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A clustering technique for factor analyzed HMM-based speech synthesis," Proceedings of ASJ2014 autumn meeting, pp. 239-240, Hokkaido, September 3-5, 2014.
  83. Shota Kamiya, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Simultaneous high/low accent and acoustic model training for HMM-based speech synthesis," Proceedings of ASJ2014 autumn meeting, pp. 237-238, Hokkaido, September 3-5, 2014. (Student Presentation Award)
  84. Yusuke Sato, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Evaluation of a cross-lingual speaker adaptation technique using joint-eigenvoices with a perceptual characteristic space," Proceedings of ASJ2014 spring meeting, pp. 325-326, Tokyo, March 10-12, 2014.
  85. Koki Tsuruno, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "GMM-based voice conversion using a modified posterior probability function," Proceedings of ASJ2014 spring meeting, pp. 325-326, Tokyo, March 10-12, 2014.
  86. Koji Mushika, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A robust acoustic modeling technique against training data errors in HMM-based singing voice synthesis," Proceedings of ASJ2014 spring meeting, pp. 335-336, Tokyo, March 10-12, 2014.
  87. Takashi Aritaka, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Comparing spectrum representation methods related to LSPs for HMM-based speech synthesis," Proceedings of ASJ2014 spring meeting, pp. 337-338, Tokyo, March 10-12, 2014.
  88. Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "A mel-cepstral analysis technique restoring missing high-frequency components from low-sampling-rate speech," Proceedings of ASJ2014 spring meeting, pp. 339-340, Tokyo, March 10-12, 2014. (Student Presentation Award)
  89. Keiichiro Oura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "The use of state-level contexts in HMM-based speech synthesis," Proceedings of ASJ2014 spring meeting, pp. 341-342, Tokyo, March 10-12, 2014.
  90. Akifumi Tsuge, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Speaker recognition based on log-linear models using Bayesian statistics," Proceedings of ASJ2013 autumn meeting, pp. 73-74, Aichi, September 25-27, 2013.
  91. Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Cross-lingual speaker adaptation based on factor analysis using bilingual speech data," Proceedings of ASJ2013 spring meeting, pp. 267-268, Tokyo, March 13-15, 2013.
  92. Viviane de Franca Oliveira, Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Cross-lingual speaker adaptation for HMM-based speech synthesis using joint-eigenvoices with a space of perceptual characteristics," Proceedings of ASJ2013 spring meeting, pp. 269-270, Tokyo, March 13-15, 2013.
  93. Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Integration of acoustic modeling and mel-cepstral analysis for HMM-based speech synthesis," Proceedings of ASJ2013 spring meeting, pp. 289-290, Tokyo, March 13-15, 2013.
  94. Shuichi Kuwako, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Utterance adaptive training in factor analyzed acoustic models for HMM-based speech synthesis," Proceedings of ASJ2013 spring meeting, pp. 291-292, Tokyo, March 13-15, 2013.
  95. Shoto Kitamura, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, "Automatic correction of tuneless singing using pitch adaptive training for HMM-based singing voice synthesis," Proceedings of ASJ2013 spring meeting, pp. 337-338, Tokyo, March 13-15, 2013.
  96. Takafumi Hattori, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A Bayesian approach to speaker recognition based on GMMs using multiple model structures," Proceedings of ASJ2012 autumn meeting, pp. 39-40, Nagano, September 19-21, 2012.
  97. Kei Hashimoto, Junichi Yamagishi, Peter Bell, Simon King, Steve Renals, and Keiichi Tokuda, "Linear regression based on the variational Bayesian method for HMM-based speech synthesis," Proceedings of ASJ2012 spring meeting, pp. 403-404, Kanagawa, March 13-15, 2012.
  98. Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "A training algorithm based on variational Bayesian method using deterministic annealing process for separable lattice 2-D HMMs," The 74th National Convention of IPSJ, pp. 409-410, Kanagawa, March 13-15, 2012.
  99. Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Bayesian speech synthesis sharing prior distributions and model structures among multiple speakers," Proceedings of ASJ2011 autumn meeting, pp. 345-348, Shimane, September 20-22, 2011.
  100. Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, and Keiichi Tokuda, "An evaluation and analysis of machine translation and speech synthesis in speech-to-speech translation," Proceedings of ASJ2011 spring meeting, pp. 315-316, Tokyo, March 9-11, 2011.
  101. Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Acoustic modeling based model structure annealing for Bayesian speech recognition," Proceedings of ASJ2011 spring meeting, pp. 21-24, Tokyo, March 9-11, 2011. (Student Presentation Award)
  102. Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Bayesian speech synthesis integrating training and synthesis processes," Proceedings of ASJ2010 autumn meeting, pp. 243-244, Osaka, September 14-16, 2010.
  103. Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Evaluation of HSMM-based speech synthesis based on Bayesian framework," Proceedings of ASJ2009 autumn meeting, pp. 257-258, Fukushima, September 15-17, 2009.
  104. Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Training algorithm based on deterministic annealing for Bayesian speech recognition," Proceedings of ASJ2009 autumn meeting, pp. 3-6, Fukushima, September 15-17, 2009.
  105. Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, "Hidden semi-Markov model based Bayesian speech synthesis," Proceedings of ASJ2009 spring meeting, pp. 303-304, Tokyo, March 17-19, 2009.
  106. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "HMM based speech synthesis using cross validation for Bayesian criterion," Proceedings of ASJ2008 autumn meeting, pp. 251-252, Nagano, September 10-12, 2008.
  107. Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Speech recognition based on multiple phonetic decision tree structures," Proceedings of ASJ2008 autumn meeting, pp. 125-126, Nagano, September 10-12, 2008.
  108. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Context clustering using cross validation based on Bayesian criterion," Proceedings of ASJ2008 spring meeting, pp. 69-70, Chiba, March 17-19, 2008.
  109. Tatsuya Ito, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Speaker recognition based on variational Bayesian method," Proceedings of ASJ2008 spring meeting, pp. 143-144, Chiba, March 17-19, 2008.
  110. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Hyper-parameter tying structure for speech recognition based on variational Bayesian method," Proceedings of ASJ2007 autumn, pp. 139-142, Yamanashi, September 19-21, 2007.
  111. Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, "Acoustic modeling based on model structure annealing for speech recognition," Proceedings of ASJ2007 autumn meeting, pp. 143-146, Yamanashi, September 19-21, 2007.


Book
  1. 橋本佳(分担執筆), "人工知能学大辞典(音声合成(HMM 合成方式))," 人工知能学会編, 共立出版, pp. 785-787, 2017年7月7日. (ISBN:978-4-320-12420-2)
  2. Keiichi Tokuda, Akinobu Lee, Yoshihiko Nankaku, Keiichiro Oura, Kei Hashimoto, Daisuke Yamamoto, Ichi Takumi, Takahiro Uchiya, Shuhei Tsutsumi, Steve Renals, and Junichi Yamagishi(分担執筆), "User generated dialogue systems: uDialogue," Human Harmonized Information Technology, Volume 2, Springer, pp. 77-114, May, 2017. (ISBN:978-4-431-56533-8) (DOI: 10.1007/978-4-431-56535-2)
  3. 橋本佳(分担執筆), "音響学入門ペディア(Q34 統計的音声合成の仕組みを教えてください)," 日本音響学会編, コロナ社, pp. 136-139, 2017年3月15日. (ISBN:978-4-339-00895-1)


Thesis
  1. Kei Hashimoto, "Statistical models of machine translation, speech recognition, and speech synthesis for speech-to-speech translation," Doctoral Dissertation, February 2011.
  2. Kei Hashimoto, "Estimation of Prior Distribution for Speech Recognition Based on Bayesian criterion," Master Thesis, February 2008.
  3. Kei Hashimoto, "Investigation of Prior Distribution for Speech Recognition Based on Bayesian Criterion," Graduation Thesis, February 2006.


Talk
  1. 橋本佳, "音声合成における深層学習," 日本音響学会2017年秋季研究発表会ビギナーズセミナー, 愛媛, 2017年9月25日.