論文

  1. 中村 和寛, 大浦 圭一郎, 南角 吉彦, 徳田 恵一, “隠れマルコフモデ ルに基づく英語歌声合成,” 電子情報通信学会, vol.J97-D, no.10, pp.1572–1581, October 2014.

  2. Akira Tamamori, Yoshihiko Nankaku, Keiichi Tokuda, “Image recognition based on separable lattice trajectory 2-D HMMs,” IEICE Transactions on Information and Systems, vol.E97-D, no.7, pp.1842–1854, July 2014.

  3. Hongwu Yang, Keiichiro Oura, Haiyan Wang, Zhenye Gan, Keiichi Tokuda, “Using speaker adaptive training to realize Mandarin-Tibetan cross-lingual speech synthesis,” Springer, Multimedia Tools and Applications, June 2014.

  4. Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, “Integration of spectral feature extraction and modeling for HMM-based speech synthesis,” IEICE Transactions on Information and Systems, vol.E97-D, no.6, pp.1438–1448, June 2014.

  5. Shinji Takaki, Yoshihiko Nankaku and Keiichi Tokuda, “Contextual additive structure for HMM-based speech synthesis,” IEEE Journal of Selected Topics in Signal Processing, vol.8, issue 2, pp.229–238, April 2014.

  6. Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, “A Bayesian framework using multiple model structures for speech recognition,” IEICE Transactions on Information and Systems, vol.E96-D, no.4, pp.939–948, April 2013.

  7. John Dines, Hui Liang, Lakshmi Saheer, Matthew Gibson, William Byrne, Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, “Personalising speech-to-speech translation: unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis,” Computer Speech and Language, vol.27, pp.420–437, February 2013.

  8. Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, Keiichi Tokuda, “Impacts of machine translation and speech synthesis on speech-to-speech translation,” Speech Communication, vol.54, no.7, pp.857–866, September 2012.

  9. Akira Tamamori, Yoshihiko Nankuaku, Keiichi Tokuda, “An extension of separable lattice 2-D HMMs for rotational data variations,” IEICE Transactions on Information and Systems, vol.E95-D, no.8, pp.2074–2083, August 2012.

  10. Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King, Keiichi Tokuda, “Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping,” Speech Communication, vol.54, no.6, pp.703–714, July 2012.

  11. Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda, “Speech recognition based on statistical models including multiple phonetic decision trees” Acoustical Science and Technology, vol.32, no.6, pp.236-243, June 2011.

  12. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda, “Bayesian context clustering using cross validation for speech recognition” IEICE Transactions on Information and Systems, vol.E94-D, no.3, pp.668–678, March 2011.

  13. 寺嶌 立太, 全 炳河, 南角 吉彦, 徳田 恵一, “フレーム単位の コンテキスト依存構造に基づく音声認識のための音響モデル,” 電気 学会論文集C (電子・情報・システム部門誌), vol.130, no.10, pp.1856–1864, Nov. 2010.

  14. Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, “Continuous stochastic feature mapping based on trajectory HMMs,” IEEE Transactions on Audio, Speech, and Language Processing, vol.18, no.5, pp.417–430, Feb. 2011.

  15. 寺嶌立太, 吉村貴克, 脇田敏裕, 徳田恵一, 北村正, “HMM音声合 成に基づく音声認識率予測手法,”電気学会論文誌C (電子・情報・シ ステム部門誌), vol.130, no.4, pp.557–564, April 2010.

  16. Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda, “A covariance-tying technique for HMM-based speech synthesis,” IEICE Transactions on Information and Systems, vol.E93-D, no.3, pp.595–601, March 2010.

  17. Junichi Yamagishi, Bela Usabaev, Simon King, Oliver Watts, John Dines, Jilei Tian, Yong Guan, Rile Hu, Keiichiro Oura, Yi-Jian Wu, Keiichi Tokuda, Reima Karhila, Mikko Kurimo “Thousands of Voices for HMM-Based Speech Synthesis–Analysis and Application of TTS Systems Built on Various ASR Corpora,” IEEE Transactions on Audio, Speech, and Language Processing, vol.18, no.5, pp.984–1004, July 2010.

  18. Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro Sumita, Keiichi Tokuda, “A reordering model using a source-side parse-tree for statistical machine translation,” IEICE Transactions on Information and Systems, vol.E92-D, no.12, pp.2386–2393, December 2009.

  19. Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhen-Hua Ling, Tomoki Toda, Keiichi Tokuda, Simon King, Steve Renals, “A robust speaker-adaptive HMM-based text-to-speech synthesis,” IEEE Transactions on Audio, Speech, and Language Processing, vol.17, no.6, August 2009.

  20. Heiga Zen, Keiichi Tokuda, Alan W. Black, “Statistical parametric speech synthesis,” Speech Communication, vol.51, no.11, pp.1039-1154, November 2009.

  21. Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda, “A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System,” IEICE Transactions on Information and Systems, vol.E91-D, no.11, pp.2693–2700, Nov 2008.

  22. Heiga Zen, Tomoki Toda, Keiichi Tokuda, “The Nitech-NAIST HMM based speech synthesis system for the Blizzard Challenge 2006,” IEICE Transactions on Information and Systems, vol.E91-D, no.6, pp.1764–1773, June 2008.

  23. Tomoki Toda, Alan W. Black, and Keiichi Tokuda, “Statistical mapping between articulatory movements and acoustic spectrum with a Gaussian mixture model,” Speech Communication, vol.50, no.3, pp.215–227, March 2008.

  24. Tomoki Toda, Alan W. Black, and Keiichi Tokuda, “Voice conversion based on maximum likelihood estimation of speech parameter trajectory,” IEEE Transactions on Audio, Speech and Language Processing, vol.15, no.8, pp.2222–2235, November 2007.

  25. Ranniery Maia, Heiga Zen, Keiichi Tokuda, Tadashi Kitamura, and Fernando G. V. Resende, “An HMM-based Brazilian Portuguese speech synthesizer and its characteristics,” Journal of Communication and Information and Systems, vol.21, no.2, pp.58–71, Aug. 2006.

  26. Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura, “Hidden Semi-Markov Model Based Speech Synthesis System,” IEICE Transactions on Information and Systems, vol.E90-D, no.5, pp.825–834, May 2007.

  27. Tomoki Toda and Keiichi Tokuda, “Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis,” IEICE Transactions on Information and Systems, vol.E90-D, no.5, pp.816–824, May 2007.

  28. Heiga Zen, Tomoki Toda, Masaru Nakamura, and Keiichi Tokuda, “Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005,” IEICE Transactions on Information and Systems, vol.E90-D, no.1, pp.325–333, Jan. 2007.

  29. 河井 恒, 戸田智基, 山岸順一, 平井俊男, 倪 晋富, 西澤信行, 津崎 実, 徳田恵一, “大規模コーパスを用いた音声合成システム XIMERA,” 電子情報通信学会論文誌(D-II),J89-D-II, no.12, pp.2688-2698, Dec. 2006.

  30. Heiga Zen, Keiichi Tokuda, Tadashi Kitamura, “Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences,” Computer Speech and Language, vol.21, no.1, pp.153–173, Jan. 2007.

  31. Amaro Lima, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, and Fernando G. Resende, “Applying sparse KPCA for feature extraction in speech recognition,” IEICE Transactions on Information and Systems, vol.E88-D, no.3, pp.401–409, March 2005.

  32. Hiroyuki Suzuki, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, and Tadashi Kitamura, “Continuous speech recognition based on general factor dependent acoustic models,” IEICE Transactions on Information and Systems, vol.E88-D, no.3, pp.410–417, March 2005.

  33. Hiroyoshi Yamamoto, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, and Tadashi Kitamura, “Parameter Sharing in Mixture of Factor Analyzers for Speaker Identification,” IEICE Transactions on Information and Systems, vol.E88-D, no.3, pp.419–424, March 2005.

  34. Yohei Itaya, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, and Tadashi Kitamura, “Deterministic annealing EM algorithm in acoustic modeling for speaker and speech recognition,” IEICE Transactions on Information and Systems, vol.E88-D, no.3, pp.425–431, March 2005.

  35. Amaro Lima, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura, “On the use of kernel PCA for feature extraction in speech recognition,” IEICE Transactions on Information and Systems, vol.E87-D, no.12, pp.2802–2811, Dec. 2004.

  36. 全 炳河, 徳田恵一, 北村 正, “決定木に基づく音素コンテキス ト・次元・状態位置の同時クラスタリングによる音響モデリング,” 電子情報通信学会論文誌(D-II), vol.87-D-II, no.8, pp.1593–1602, Aug. 2004.

  37. 吉村貴克, 徳田恵一, 益子貴史, 小林隆夫, 北村正, “HMMに基 づくテキスト音声合成への混合励振源モデルとポストフィルタの導入,” 電子情報通信学会論文誌(D-II), vol.87-D-II, no.8, pp.1565–1571, Aug. 2004.

  38. 酒向慎司, 宮島千代美, 徳田恵一, 北村 正, “隠れマルコフモ デルに基づいた歌声合成システム,” 情報処理学会論文誌, vol.45, no.3, pp.719-727, Mar. 2004.

  39. Toru Takahashi, Keiichi Tokuda, Takao Kobayashi, Tadashi Kitamura, “Mixture density models based on mel-cepstral representation of Gaussian process,” IEICE Trans. Fundamentals, vol.E86-A, no.8, pp.1971–1978, Aug. 2003.

  40. Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda and Takao Kobayashi, “A training method of average voice model for HMM-based speech synthesis,” IEICE Trans. Fundamentals of Electronics, Communications and Computer Sciences, E86-A, vol.8, pp.1956-1963 August 2003.

  41. Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, “A context clustering technique for average voice models,” IEICE Trans. Inf. & Syst., vol.E86-D, no.3, pp.534-542, Mar. 2003.

  42. 南角吉彦, 徳田恵一, 北村 正, 小林隆夫, “隠れマルコフモデ ルを用いた視覚音声認識のための正規化学習,” 電子情報通信学会論 文誌(D-II), vol.J86-D-II, no.2, pp.163–172, Feb. 2003.

  43. 益子貴史, 小林隆夫, 徳田恵一, “HMMに基づいた極低ビットレー ト音声符号化における不特定話者への対応,” 電子情報通信学会論文 誌(D-II), vol.J85-D-II, no.12, pp.1749–1759, Dec. 2002.

  44. 佐藤隆之, 益子貴史, 小林隆夫, 徳田恵一, “話者照合における HMM音声合成による合成音声の判別, ” 情報処理学会論文誌, vol.43, no.7, pp.2197–2204, July 2002.

  45. 川本真一, 下平 博, 新田恒雄, 西本卓也, 中村 哲, 伊藤克亘, 森島繁生, 四倉達夫, 甲斐充彦, 李 晃伸, 山下洋一, 小林隆夫, 徳 田恵一, 広瀬啓吉, 峯松信明, 山田 篤, 伝 康晴, 宇津呂武仁, 嵯峨 山茂樹, “カスタマイズ性を考慮した擬人化音声対話ソフトウェアツー ルキットの設計,” 情報処理学会論文誌, vol.43, no.7, pp.2249–2263, July 2002.

  46. 酒向慎司, 益子貴史, 徳田恵一, 小林隆夫, 北村 正, “HMMに基 づいた視聴覚テキスト音声合成 —画像ベースアプローチ,” 情報処 理学会論文誌, vol.43, no.7, pp.2169-2176, July 2002.

  47. 勝股 充, 徳田恵一, 北村正, “単調連続2次元DPアルゴリズム の階層化,” 電子情報通信学会論文誌, vol.J85-D-II, no.9, pp.1382–1391, Sep. 2002.

  48. 高橋徹, 徳田恵一, 小林隆夫, 北村正, “スペクトル分析のため の尺度を用いたメルケプストラム係数のベクトル量子化,” 電子情 報通信学会論文誌, vol.J85-D-II, no.8, pp.1273–1283, Aug. 2002.

  49. Keiichi Tokuda, Takashi Masuko, Noboru Miyazaki, Takao Kobayashi, “Multi-space probability distribution HMM (Invited paper),” IEICE Trans. Information and Systems, vol.E85-D, no.3, pp.455-464, Mar. 2002 (translation of 55).

  50. 田村正統,益子貴史, 徳田恵一, 小林隆夫,“HMMに基づく音声 合成におけるピッチ・スペクトルの話者適応,” 電子情報通信学会論 文誌, vol.J85-D-II, no.4, pp.545-553, Apr. 2002.

  51. Kazuhito Koishida, Keiichi Tokuda, Takashi Masuko and Takao Kobayashi, “Vector quantization of speech spectral parameters using statistics of static and dynamic features,” IEICE Trans. Inf. & Syst., vol.E84-D, no.10, pp.1427–1434, Oct. 2001.

  52. Chiyomi Miyajima, Yohsuke Hattori, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi and Tadashi Kitamura, “Text-independent speaker identification using Gaussian Mixture models based on multi-space probability distribution,” IEICE Trans. Inf. & Syst., vol.E84-D, no.7, pp.847-855, July 2001.

  53. Chiyomi Miyajima, Hideyuki Watanabe, Keiichi Tokuda, Tadashi Kitamura, Shigeru Katagiri, ”A New Approach to Designing a Feature Extractor in Speaker Identification Based on Discriminative Feature Extraction,” Speech Communication, vol.35, no.3-4, pp.203–218, Oct. 2001.

  54. 益子貴史, 田村 正統, 小林隆夫, 徳田恵一, “HMMに基づく音声 合成システムにおけるMAP-VFSを用いた声質変換,” 電子情報通信学 会論文誌(D-II), vol.J83-D-II, no.12. pp.2509–2516, Dec. 2000.

  55. Junibakti Sanubari and Keiichi Tokuda, “RLS-type two dimensional adaptive filter with a t-distribution assumption,” Signal Processing, vol.80, no.12, pp.2483–2495, Nov. 2000.

  56. 益子貴史, 小林隆夫, 徳田恵一, “話者照合システムに対する合 成音声による詐称,” 電子情報通信学会論文誌(D-II), vol.J83-D-II, no.11, pp.2283–2290, Nov. 2000.

  57. 吉村貴克, 徳田恵一, 益子貴史, 小林隆夫, 北村 正, “HMMに基 づく音声合成におけるスペクトル・ピッチ・継続長の同時モデル化,” 電子情報通信学会論文誌(D-II), vol.J83-D-II, no.11, pp.2099–2107, Nov. 2000.

  58. Oscar Vanegas, Keiichi Tokuda, Tadashi Kitamura, “Lip location normalized training for visual speech recognition,” IEICE Trans. Inf. & Syst., vol.E83-D, no.11, pp.1969–1977, Nov. 2000.

  59. Junibakti Sanubari and Keiichi Tokuda, “A new robust two dimensional spectral estimation based on an AR model excited by a t-distribution process and its QR-decomposition recursive algorithm,” Journal of Circuits, Systems, and Computers, vol.9, nos.1–2, pp.51–66, Jan. 1999.

  60. 益子貴史, 徳田恵一, 宮崎 昇, 小林隆夫, “多空間確率分布HMM によるピッチパタン生成,” 電子情報通信学会論文誌(D-II), vol.J83-D-II, no.7, pp.1600–1609, July 2000.

  61. 徳田恵一, 益子貴史, 宮崎 昇, 小林隆夫, “多空間上の確率分布に基づいたHMM,” 電子情報通信学会論文誌 (D-II), vol.J83-D-II, no.7, pp.1579–1589, July 2000.

  62. Kazuhito Koishida, G. Hirabayashi, Keiichi Tokuda, and Takao Kobayashi, “A 16kbit/s wideband CELP-based speech coder using mel-generalized cepstral analysis,” IEICE Trans. Inf. & Syst., vol.E83-D, no.4, pp.876–883, Apr. 2000.

  63. Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi and K. Kitamura, “Speaker interpolation for HMM-based speech synthesis system,” The Journal of the Acoustical Society of Japan (E), vol.21, no.4, pp.199–206, Apr. 2000.

  64. 若子武士, 徳田恵一, 益子貴史, 小林隆夫, 北村正, “対数スペ クトルの任意基底関数による展開に基づく音声のスペクトル推定,” 電子情報通信学会論文誌(D-II), vol.J82-D-II, no.12, pp.2203–2211, Dec. 1999.

  65. 広井順, 徳田恵一, 益子貴史, 小林隆夫, 北村正, “HMMに基づ いた極低ビットレート音声符号化,”電子情報通信学会論文誌(D-II), vol.J82-D-II, no.11, pp.1857-1864, Nov. 1999.

  66. Fernando G. Resende, Paulo S. R. Diniz, Keiichi Tokuda, Mineo Kaneko and Akinori Nishihara, “New adaptive algorithms based on multi-band decomposition of the error signal,” IEEE Trans. Circuits and Synstems II, vol.45, no.5, pp.592–599, May. 1998.

  67. 小石田 和人, 徳田 恵一, 小林 隆夫, 今井 聖, “メル一般化 ケプストラム分析に基づくCELP符号化,”電子情報通信学会論文誌 (A), vol.J81-A, no.2, pp.252–260, Feb. 1998.

  68. 小石田和人, 徳田恵一, 小林隆夫, 今井 聖, “メル一般化ケプ ストラム係数に基づく音声のスペクトル表現とその諸特性,,”電子情 報通信学会論文誌(A), vol.J80-A, no.11, pp.1999–2006, Nov. 1997.

  69. Fernando G. Resende, Paulo S. R. Diniz, Keiichi Tokuda, Mineo Kaneko and Akinori Nishihara, “LMS-based algorithms with multi-band decomposition of the esimation error applied to system identification” Trans. IEICE, vol.E80-A, no.8, pp.1376–1383, Aug. 1997.

  70. 徳田恵一, 益子貴史, 小林隆夫, 今井 聖, “動的特徴を用いた HMMからの音声パラメータ生成アルゴリズム,”日本音響学会誌, vol.53, no.3, pp.192–200, Mar. 1997.

  71. Fernando G. Resende, Keiichi Tokuda, Mineo Kaneko and Akinori Nishihara, “Multi-band decomposition of the linear prediction error applied to adaptive AR spectral estimation,” Trans. IEICE, vol.E80-A, no.2, pp.365–376, Feb. 1997.

  72. 益子貴史, 徳田恵一, 小林隆夫, 今井 聖, “動的特徴を用いた HMMに基づく音声合成,”電子情報通信学会論文誌(D-II), vol.J79-D-II, 12, pp.2184–2190, Dec. 1996.

  73. Fernando G. Resende, Keiichi Tokuda and Mineo Kaneko, “Adaptive AR spectrum estimation based on wavelet decomposition of linear prediction error,” Trans. IEICE, vol.E79-A, no.5, pp.665–673, May 1996.

  74. Keiichi Tokuda, Takao Kobayashi and Satoshi Imai, “Adaptive cepstral analysis of speech,” IEEE Trans. Speech and Audio Proces., vol.SA-3, no.6, pp.481–489, Nov. 1995.

  75. 内藤 幸宏, 徳田 恵一, 金子 峰雄, “非一様解像度フィルタによ る適応アルゴリズム,”電子情報通信学会論文誌(A), vol.J78-A, no.9, pp.1092–1102, Sep. 1995.

  76. 徳田 恵一, 小林 隆夫, 深田 俊明, 今井 聖, “適応メルケプス トラム分析を利用した音声の符号化とその評価,”電子情報通信学会 論文誌(A), vol.J77-A, no.11, pp.1443–1452, Nov. 1994.

  77. Junibakti Sanubari, Keiichi Tokuda, Mahoki Onoda, “Time series analysis based on exponential model excited by t-distribution process and its algorithm,” IEICE Trans., vol.E76-A, no.5, pp.808–819, May 1993.

  78. Junibakti Sanubari, Keiichi Tokuda, Mahoki Onoda, “Speech analysis based on AR model driven by t-distribution process,” IEICE Trans., vol.E75-A, no.9, pp.1159–1169, Sep. 1992.

  79. 徳田恵一, 小林隆夫, 千葉健司, 今井 聖, “メル一般化ケプス トラム分析による音声のスペクトル推定,”電子情報通信学会論文誌 (A), vol.J75-A, no.7, pp.1124–1134, July 1992.

  80. Takao Kobayashi, Kazuyoshi Fukushi, Keiichi Tokuda and Satoshi Imai, “2-D LMA filters —design of stable two-dimensional digital filters,” IEICE Trans., vol.E75-A, no.2, pp.240–246, Feb. 1992.

  81. 徳田恵一, 小林隆夫, 深田俊明, 今井 聖, “音声の適応メルケ プストラム分析,”電子情報通信学会論文誌(A), vol.J74-A, no.8, pp.1249–1256, Aug. 1991.

  82. 徳田恵一, 小林隆夫, 深田俊明, 斎藤博徳, 今井 聖, “メルケ プストラムをパラメータとする音声のスペクトル推定,”電子情報通 信学会論文誌(A), vol.J74-A, no.8, pp.1240–1248, Aug. 1991.

  83. 徳田恵一, 小林隆夫, 塩本祥司, 今井 聖, “適応ケプストラム 分析 —ケプストラムを係数とする適応フィルタ—,”電子情報通信 学会論文誌(A), vol.J73-A, no.7, pp.1207–1215, July 1990.

  84. 徳田恵一, 小林隆夫, 山本竜太郎, 今井聖, “一般化ケプストラ ムをパラメータとする音声のスペクトル推定,”電子情報通信学会論 文誌(A), vol.J72-A, no.3, pp.457–465, Mar. 1989.

  85. 徳田恵一, 小林隆夫, 徳田篤洋, 今井 聖, “最大・最小位相分 離によるディジタルフィルタの振幅・位相同時近似,”電子情報通信 学会論文誌(A), vol.J71-A, no.2, pp.260–267, Feb. 1988.

  86. 徳田恵一, 小林隆夫, 今井 聖, “スペクトル包絡抽出のための 非一様スペクトル荷重によるケプストラム分析,”電子情報通信学会 論文誌(A), vol.J70-A, no.6, pp.952–959, June 1987.