* PUBLICATIONS - 2010 [#n782a49e] #contents ** Journal [#da90e548] +Keiichi Tokuda, “Speech synthesis as a machine learning problem," The International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques(Oriental COCOSDA 2010), Kathmandu, Nepal, 2010 (keynote). (without proceeding paper) +Junichi Yamagishi, Bela Usabaev, Simon King, Oliver Watts, John Dines, Jilei Tian, Yong Guan, Rile Hu, Keiichiro Oura, Yi-Jian Wu, Keiichi Tokuda, Reima Karhila, and Mikko Kurimo “Thousands of voices for HMM-based speech synthesis-analysis and application of TTS systems built on various ASR corpora," IEEE Transactions on Audio, Speech, and Language Processing, vol.18, no.5, pp.984-1004, 2010. (Full paper peer reviewed) +徳田恵一,大浦圭一郎, “声質・歌い方を自動で学習・再現できる新しい歌声合成システム~ Sinsy~" DTM MAGAZINE. vol.191, pp.96-97, 2010.(解説論文) //英語タイトルなし +Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda, “A covariance- tying technique for HMM-based speech synthesis," IEICE Transactions on Information and Systems, vol.E93-D, no.3, pp.595-601, 2010. (Full paper peer reviewed) +大浦圭一郎, 全炳河, 酒向慎司, 徳田恵一, “HTS を用いた音声合成システムの構築," ヒュー マンインタフェース学会誌, vol.12, no.1, pp.35-40, 2010.(解説論文) //Keiichiro Oura, Heiga Zen, Shinji Sako, and Keiichi Tokuda, 〝Speech/Sound based Human Interfaces (1) Construction of Speech Synthesis Systems using HTS" Journal of Human Interface Society : human interface 12(1), 35-40, 2010. (Tutorial paper) +河井恒, 徳田恵一, “多言語音声の合成," 電気学会誌, vol.130, no.1, pp.16-19, 2010.(解説論文) //英語タイトル不詳 + 寺嶌立太, 全炳河, 南角吉彦, 徳田恵一, ``フレーム単位のコンテキスト依存構造に基づく音声認識のための音響モデル,'' 電気学会論文集C (電子・情報・システム部門誌), vol. 130, no. 10, pp. 1856-1864, 2010. //Ryuta Terashima, Heiga Zen, Yoshihiko Nankaku, and Keiichi Tokuda, ``A frame-based context-dependent acoustic modeling for speech recognition,'' IEEJ Transactions on Electronics, Information and Systems, vol. 130, no. 10, pp. 1856-1864, 2010. + 寺嶌立太, 吉村貴克, 脇田敏裕, 徳田恵一, 北村正, ``HMM音声合成に基づく音声認識率予測手法,'' 電気学会論文誌C (電子・情報・システム部門誌), vol. 130, no. 4, pp. 557-564, 2010. //Ryuta Terashima, Takayoshi Yoshimura, Toshihiro Wakita, Keiichi Tokuda, and Tadashi Kitamura, ``Prediction method of speech recognition performance based on HMM-based speech synthesis technique,'' IEEJ Transactions on Electronics, Information and Systems, vol. 130, no. 4, pp. 557-564, 2010. ** International Conference [#vb6f4953] +%%%Xianglin Peng%%%, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, “Cross-lingual speaker adaptation for HMM-based speech synthesis considering differences between language-dependent average voices,” Proc. of IEEE 10th International Conference on Signal Processing, pp.605-608, Beijing China, 2010. (Full paper peer reviewed) +%%%Akira Saito%%%, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, “Voice activity detection based on conditional random fields using multiple features,” Interspeech 2010, pp.2086- 2089, Chiba Japan, 2010. (Full paper peer reviewed) +%%%Ayami Mase%%%, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, “HMM-based singing voice synthesis system using pitch-shifted pseudo training data,” Interspeech 2010, pp.845- 848, Chiba Japan, 2010. (Full paper peer reviewed) +%%%Toyohiro Hayashi%%%, Yoshihiko Nankaku, Akinobu Lee and Keiichi Tokuda, “Speaker Adaptation Based on Nonlinear Spectral Transform for Speech Recognition,” Interspeech 2010, pp.542-545, Chiba Japan, 2010. (Full paper peer reviewed) +%%%Keiichiro Oura%%%, Kei Hashimoto, Sayaka Shiota, and Keiichi Tokuda, “Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2010,” Blizzard Challenge 2010 Workshop, Kyoto, Japan, 2010. (web proceedings) +%%%Keiichiro Oura%%%, Ayami Mase, Tomohiko Yamada, Satoru Muto, Yoshihiko Nankaku, and Keiichi Tokuda, “Recent development of the HMM-based singing voice synthesis system – Sinsy,” Proc. of 7th ISCA Speech Synthesis Workshop, pp.211-216, Kyoto, Japan, 2010. (Full paper peer reviewed) +%%%Mirjam Wester%%%, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong Guan, Teemu Hirsimaki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, and Junichi Yamagishi, “Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project,” Proc. of 7th ISCA Speech Synthesis Workshop, pp.192-197, Kyoto, Japan, 2010. (Full paper peer reviewed) +%%%Kei Hashimoto%%%, Yoshihiko Nankaku, and Keiichi Tokuda,“Bayesian speech synthesis framework integrating training and synthesis processes,” Proc. of 7th ISCA Speech Synthesis Workshop, pp.106-111, Kyoto, Japan, 2010. (Full paper peer reviewed) +%%%Shinji Takaki%%%, Yoshihiko Nankaku, and Keiichi Tokuda, “Spectral modeling with contextual additive structure for HMM-based speech synthesis,” Proc. of 7th ISCA Speech Synthesis Workshop, pp.100-105, Kyoto, Japan, 2010. (Full paper peer reviewed) +%%%Akira Tamamori%%%, Yoshihiko Nankaku, and Keiichi Tokuda, “An extension of separable lattice 2- D HMMs for rotational data variations,” 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp.2206–2209, Dallas, Texas, U.S.A., 2010. (Full paper peer reviewed) +%%%Keiichiro Oura%%%, Keiichi Tokuda, Junichi Yamagishi, Simon King, and Mirjam Wester, “Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis,” 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp.4594–4597, Dallas, Texas, U.S.A., 2010. (Full paper peer reviewed) +%%%Heiga Zen%%%, Mark Gales, Yoshihiko Nankaku, and Keiichi Tokuda, “Statistical parametric speech synthesis based on product of experts,” 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp.4242–4245, Dallas, Texas, U.S.A., 2010. (Full paper peer reviewed) +%%%Kyosuke Kazumi%%%, Yoshihiko Nankaku, and Keiichi Tokuda, “Factor analyzed voice models for HMM-based speech synthesis,” 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp.4234–4237, Dallas, Texas, U.S.A., 2010. (Full paper peer reviewed) +%%%Yoshiaki Takahashi%%%, Akira Tamamori, Yoshihiko Nankaku, and Keiichi Tokuda, “Face recognition based on separable lattice 2-D HMM with state duration modeling,” 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp.2162–2165, Dallas, Texas, U.S.A., 2010. (Full paper peer reviewed) ** Workshop [#m11f486e] +%%%大浦圭一郎%%%, 間瀬絢美, 山田知彦, 徳田恵一, 後藤真孝, ``Sinsy: 「あの人に歌ってほしい」をかなえるHMM歌声合成システム,'' 第86回音楽情報科学研究会(SIGMUS), Vol. 2010-MUS-86, No. 1, pp. 1-8, 茨城, 日本, 2010. //%%%Keiichiro Oura%%%, Ayami Mase, Tomohiko Yamada, Keiichi Tokuda, and Masataka Goto, ``Sinsy — An HMM-based singing voice synthesis system which can realize your wish “I want this person to sing my song",'' SIGMUS, Vol. 2010-MUS-86, No. 1, pp. 1-8, Ibaraki, Japan, 2010. +%%%熊木慶介%%%, 南角吉彦, 徳田恵一, ``拡張分離型格子HMMに基づく顔画像認識,'' パターン認識・メディア理解研究会(PRMU), vol. 110, no. 97, PRMU2010-46, pp. 45-50, 青森, 日本, 2010. //%%%Keisuke Kumaki%%%, Yoshihiko Nankaku, and Keiichi Tokuda, ``Face recognition based on extended separable lattice HMMs,'' PRMU, vol. 110, no. 97, PRMU2010-46, pp. 45-50, Aomori, Japan, 2010. ** National Convention [#lc2ab40e] + %%%山田誠%%%, 西澤信行, 加藤恒夫, 大浦圭一郎, 徳田恵一, ``遅い話速の音響モデルを用いた話速制御合成音声の主観評価,'' 日本音響学会2010年秋季研究発表会, pp. 315-316, 大阪, 日本, 2010. //%%%Makoto Yamada%%%, Nobuyuki Nishizawa, Tsuneo Kato, Keiichiro Oura, and Keiichi Tokuda, `` Subjective evaluations for speech synthesizers trained only from slow speech sound,'' Acoustical Society of Japan 2010 Autumn Meeting , pp. 315-316, Osaka, Japan, 2010. + %%%橋本佳%%%, 南角吉彦, 徳田恵一, ``学習・合成過程が統合されたベイズ音声合成,'' 日本音響学会2010年秋季研究発表会, pp. 243-244, 大阪, 日本, 2010. //%%%Kei Hashimoto%%%, Yoshihiko Nankaku, and Keiichi Tokuda, ``Bayesian speech synthesis integrating training and synthesis processes,'' Acoustical Society of Japan 2010 Autumn Meeting , pp. 243-244, Osaka, Japan, 2010. + %%%高木信二%%%, 大浦圭一郎, 南角吉彦, 徳田 恵一, ``HMM音声合成における平均・分散パラメータ共有に関する検討,'' 日本音響学会2010年秋季研究発表会, pp. 241-242, 大阪, 日本, 2010. //%%%Shinji Takaki%%%, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``An examination of tied-structure in mean and variance parameters for HMM-based speech synthesis,'' Acoustical Society of Japan 2010 Autumn Meeting , pp. 241-242, Osaka, Japan, 2010. + %%%伊藤直晃%%%, 南角吉彦, 李晃伸, 徳田恵一, ``百万超語彙の連続音声認識におけるツリートレリス探索法の分析および評価,'' 日本音響学会2010年秋季研究発表会, pp. 155-156, 大阪, 日本, 2010. //%%%Naoaki Ito%%%, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, ``Analysis and evaluation of tree-trellis algorithm in over million vocabulary CSR,'' Acoustical Society of Japan 2010 Autumn Meeting , pp. 241-242, Osaka, Japan, 2010. + %%%花園正也%%%, 渡辺英樹, 西山高史, 徳田恵一, ``HMMに基づく感情音声合成のための収録テキスト設計とコーパス構築,'' システム制御情報学会第54回研究発表講演会, 京都, 日本, 2010. ///英語タイトル不明 + %%%加藤杏樹%%%, 南角吉彦, 李晃伸, 徳田恵一, ``音声対話システムのための複数キーワードを制約とするスポッティングアルゴリズム,'' 日本音響学会2010年春季研究発表会, pp. 141-142, 東京, 日本, 2010. //%%%Aki Kato%%%, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, ``Spotting algorithm constrained by keyword co-occurance for spoken dialogue system,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 141-142, Tokyo, Japan, 2010. + %%%吉見孔孝%%%, 南角吉彦, 李晃伸, 徳田恵一, ``音声対話システムのためのタスク非依存言語モデルを用いたキーワードからの質問文生成,'' 日本音響学会2010年春季研究発表会, pp. 139-140, 東京, 日本, 2010. //%%%Yoshitaka Yoshimi%%%, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, ``Question sentence generation from keywords using task independent language model for spoken dialog system,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 139-140, Tokyo, Japan, 2010. + %%%武藤聡%%%, 大浦圭一郎, 南角吉彦, 徳田恵一, ``HMM歌声合成における話者適応および楽譜情報を用いたモデル学習高速化,'' 日本音響学会2010年春季研究発表会, pp. 347-348, 東京, 日本, 2010. //%%%Satoru Muto%%%, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Reducing computational cost of training for HMM-based singing voice synthesis using note boundaries,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 347-348, Tokyo, Japan, 2010. + %%%間瀬絢美%%%, 大浦圭一郎, 南角吉彦, 徳田恵一, ``音高シフトによる疑似学習データを用いたHMM歌声合成の高精度化,'' 日本音響学会2010年春季研究発表会, pp. 345-346, 東京, 日本, 2010. //%%%Ayami Mase%%%, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``HMM-based singing voice synthesis system trained by using pitch-shifted pseudo data,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 345-346, Tokyo, Japan, 2010. + %%%大浦圭一郎%%%, 酒向慎司, 徳田恵一, ``日本語テキスト音声合成システムOpen JTalk,'' 日本音響学会2010年春季研究発表会, pp. 343-344, 東京, 日本, 2010. //%%%Keiichiro Oura%%%, Shinji Sako, and Keiichi Tokuda, ``Japanese Text-to-Speech system: Open JTalk,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 343-344, Tokyo, Japan, 2010. + %%%高木信二%%%, 南角吉彦, 徳田恵一, ``HMM音声合成のためのコンテキストの加算的構造に基づくスペクトルモデリング,'' 日本音響学会2010年春季研究発表会, pp. 335-338, 東京, 日本, 2010. //%%%Shinji Takaki%%%, Yoshihiko Nankaku, and Keiichi Tokuda, ``Spectral modeling with contextual additive structure for HMM-based synthesis,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 335-338, Tokyo, Japan, 2010. + %%%鹿住恭介%%%, 南角吉彦, 徳田恵一, ``多様な声質を表現するための因子分析に基づくHMM音声合成,'' 日本音響学会2010年春季研究発表会, pp. 331-334, 東京, 日本, 2010. //%%%Kyosuke Kazum%%%, Yoshihiko Nankaku, and Keiichi Tokuda, ``Factor analyzed acoustic models representing various voice characteristics for HMM-based speech synthesis,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 331-334, Tokyo, Japan, 2010. + %%%大野博之%%%, 南角吉彦, 李晃伸, 徳田恵一, ``音声認識における発話終了前確定のアルゴリズムの評価および改善,'' 日本音響学会2010年春季研究発表会, pp. 67-68, 東京, 日本, 2010. //%%%Hiroyuki Ono%%%, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, ``Evaluation and improvement of algorithm for hypothesis determination before end of speech for speech recognition,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 67-68, Tokyo, Japan, 2010. + %%%林豊大%%%, 南角吉彦, 李晃伸, 徳田恵一, ``音声認識のためのスペクトル変換を統合した音響モデルに基づく話者適応,'' 日本音響学会2010年春季研究発表会, pp. 155-158, 東京, 日本, 2010. //%%%Toyohiro Hayashi%%%, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, ``Speaker adaptation based on acoustic model combining spectral transform for speech recognition,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 155-158, Tokyo, Japan, 2010. + %%%山田誠%%%, 西澤信行, 加藤恒夫, 大浦圭一郎, 徳田 恵一, ``HMM音声合成における話速制御手法の評価,'' 日本音響学会2010年春季研究発表会, pp. 405-406, 東京, 日本, 2010. //%%%Makoto Yamada%%%, Nobuyuki Nishizawa, Tsuneo Kato, Keiichiro Oura, and Keiichi Tokuda, `` Subjective evaluations of speech rate control methods for HMM-based speech synthesis,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 405-406, Tokyo, Japan, 2010. + %%%彭湘琳%%%, 大浦圭一郎, 南角吉彦, 徳田恵一, ``言語依存平均声の差異を考慮したクロスリンガル話者適応,'' 日本音響学会2010年春季研究発表会, pp. 325-326, 東京, 日本, 2010. //%%%Peng Xianglin%%%, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, ``Cross-lingual speaker adaptation considering differences between language-dependent average voices,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 325-326, Tokyo, Japan, 2010. + %%%李蕾%%%, 油谷かおり, 南角吉彦, 徳田恵一, ``複数のモデル構造を用いたGMMに基づく声質変換,'' 日本音響学会2010年春季研究発表会, pp. 323-324, 東京, 日本, 2010. //%%%Lei Li%%%, Kaori Yutani, Yoshihiko Nankaku, and Keiichi Tokuda, ``Voice conversion based on GMM using multiple model structure,'' Acoustical Society of Japan 2010 Spring Meeting , pp. 323-324, Tokyo, Japan, 2010. ** Master's and Bachelor's Theses [#o55d521e] + %%%加藤杏樹%%%, ``音声対話システムのための複数キーワードの共起制約に基づくスポッティングアルゴリズム,'' 卒業論文, 名古屋工業大学, 2010. //%%%Aki Kato%%%, ``Spotting algorithm constrained by keyword co-occurance for spoken dialogue systems,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%彭湘琳%%%, ``音声翻訳システムのための話者適応手法の改善に関する検討,'' 卒業論文, 名古屋工業大学, 2010. //%%%Xianglin Peng%%%, ``Improvements of speaker adaptation for speech-to-speech translation system,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%間瀬絢美%%%, ``音高シフトによる疑似学習データを用いたHMM歌声合成の高精度化,'' 卒業論文, 名古屋工業大学, 2010. //%%%Ayami Mase%%%, ``An HMM-based singing voice synthesis system trained by using pitch-shifted pseudo data,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%笠松幹郎%%%, ``HMM音声合成におけるモデルパラメータの共有構造に関する検討,'' 卒業論文, 名古屋工業大学, 2010. //%%%Mikio Kasamatsu%%%, ``Investigation of parameter tying structures for HMM-based speech synthesis,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%李蕾%%%, ``複数のモデル構造を用いたGMMに基づく声質変換,'' 卒業論文, 名古屋工業大学, 2010. //%%%Lei Li%%%, ``Voice conversion based on GMM using multiple model structures,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%熊木慶介%%%, ``拡張分離型格子HMMに基づく顔画像認識,'' 卒業論文, 名古屋工業大学, 2010. //%%%Keisuke Kumaki%%%, ``Face recognition based on extended separable lattice HMMs,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%青山峻己%%%, ``スペクトル・状態継続長の同時モデル化による話者認識,'' 卒業論文, 名古屋工業大学, 2010. //%%%Shunki Aoyama%%%, ``Speaker identification using spectrum and duration models,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%澤田俊彦%%%, ``HMM音声認識におけるパラメータ共有構造に関する検討,'' 卒業論文, 名古屋工業大学, 2010. //%%%Toshihiko Sawada%%%, ``An examination of tied-structures for HMM-based speech recognition,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%大野博之%%%, ``単語辞書の共有構造に基づく早期確定法の評価および改善,'' 卒業論文, 名古屋工業大学, 2010. //%%%Hiroyuki Ono%%%, ``Evaluation and improvements of rapid determination based on sharing structure of word pronunciation dictionary,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%平井良佑%%%, ``対話的コンテンツのための時系列状態制御を用いた音声情報案内システム,'' 卒業論文, 名古屋工業大学, 2010. //%%%Ryosuke Hirai%%%, ``A spoken dialog system with time-course state control for interactive contents,'' Bachelor thesis, Nagoya institute of technology, 2010. + %%%玉森聡%%%, ``回転変動データのための分離型格子2次元HMMの拡張,'' 修士論文, 名古屋工業大学, 2010. //%%%Akira Tamamori%%%, ``An extension of separable lattice 2-D HMMs for rotational data variations,'' Master thesis, Nagoya institute of technology, 2010. + %%%吉見孔孝%%%, ``音声対話システムにおける確率モデルを用いた応答選択および質問文生成,'' 修士論文, 名古屋工業大学, 2010. //%%%Yoshitaka Yoshimi%%%, ``Answer selection and question sentence generation using statistical model in spoken dialog system,'' Master thesis, Nagoya institute of technology, 2010. + %%%小島弘%%%, ``木構造化辞書の単語間非共有部の尤度に基づく認識結果の早期確定,'' 修士論文, 名古屋工業大学, 2010. //%%%Hiroshi Kojima%%%, ``Rapid hypothesis determination using unshared nodes in tree lexicon for speech recognition,'' Master thesis, Nagoya institute of technology, 2010. + %%%伊藤達也%%%, ``変分ベイズ法による話者認識のための事前分布推定,'' 修士論文, 名古屋工業大学, 2010. //%%%Tatsuya Ito%%%, ``Hyperparameter estimation for speaker recognition based on variational bayesian method,'' Master thesis, Nagoya institute of technology, 2010. + %%%油谷かおり%%%, ``声質変換のためのスペクトル・継続長・F0の同時変換,'' 修士論文, 名古屋工業大学, 2010. //%%%Kaori Yutani%%%, ``Simultaneous conversion of spectrum, duration and F0 for voice conversion,'' Master thesis, Nagoya institute of technology, 2010. + %%%武藤聡%%%, ``HMM歌声合成における話者適応および楽譜情報を用いたモデル学習高速化,'' 修士論文, 名古屋工業大学, 2010. //%%%Satoru Muto%%%, ``Reducing computational cost of training for HMM-based singing voice synthesis using note boundaries,'' Master thesis, Nagoya institute of technology, 2010. ** Past Publications [#d97906db] #ls2(HOME/PUBLICATIONS,reverse);