- Shinji Takaki, Kei Sawada, Kei Hashimoto, Keiichiro Oura,
Keiichi Tokuda, ``Overview of NIT HMM-based speech synthesis
system for Blizzard Challenge 2012,'' Blizzard Challenge 2012
Workshop, Portland, USA, September 14, 2012 (web proceedings).
- Takafumi Hattori, Kei Hashimoto, Yoshihiko Nankaku,
Keiichi Tokuda, ``A Bayesian approach to speaker recognition
based on GMMs using multiple model structures,'' Interspeech
2012, Portland, USA, September 9-13, 2012.
- Viviane de Franca Oliveira, Sayaka Shiota, Yoshihiko
Nankaku, Keiichi Tokuda, ``Cross-lingual speaker adaptation
for HMM-based speech synthesis based on perceptual
characteristics and spaker interpolation,'' Interspeech
2012, Portland, USA, September 9-13, 2012.
- Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko
Nankaku, Keiichi Tokuda, ``Face recognition based on
separable lattice 2-D HMMs using variational Bayesian
method,'' 2012 IEEE International Conference on Acoustics,
Speech, and Signal Processing (ICASSP 2012), pp.2205-2208,
Kyoto, Japan, March 25-30, 2012.
- Keisuke Kumaki, Yoshihiko Nankaku, Keiichi Tokuda, ``Face
recognition based on separable lattice 2-D HMMs using
variational Bayesian method,'' 2012 IEEE International
Conference on Acoustics, Speech, and Signal Processing
(ICASSP 2012), pp.2209-2212, Kyoto, Japan, March 25-30,
2012.
- Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi
Tokuda, ``A model structure integration based on a Byesian
framework for speech recognition,'' 2012 IEEE International
Conference on Acoustics, Speech, and Signal Processing
(ICASSP 2012), pp.4813-4816, Kyoto, Japan, March 25-30,
2012.
- Keiichiro Oura, Ayami Mase, Yoshihiko Nankaku, Keiichi
Tokuda, ``Pitch adaptive training for HMM-based singing voice
synthesis,'' 2012 IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP 2012),
pp.5377-5380, Kyoto, Japan, March 25-30, 2012.
- Kei Hashimoto, Shinji Takaki, Keiichiro Oura, Keiichi
Tokuda, ``Overview of NIT HMM-based speech synthesis system
for Blizzard Challenge 2011,'' Blizzard Challenge 2011
Workshop, Turin, Italy, September 2, 2011 (web proceedings).
- Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda,
``Multi-speaker modeling with shared prior distributions and
model structures for Bayesian speech synthesis,'' Interspeech
2011, pp.113-116, Florence, Italy, August 28-31, 2011.
- Minoru Tsuzaki, Keiichi Tokuda, Hisashi Kawai, Jinfu Ni,
``Estimation of perceptual spaces for speaker identities based
on the cross-lingual discrimination task,'' Interspeech 2011,
pp.157-160, Florence, Italy, August 28-31, 2011.
- Lei Li, Yoshihiko Nankaku, Keiichi Tokuda, ``A Bayesian
approach to voice conversion based on GMMs using multiple
model structures,'' Interspeech 2011, pp.661-664, Florence,
Italy, August 28-31, 2011.
- Ulpu Remes, Yoshihiko Nankaku, Keiichi Tokuda, ``GMM-based
missing-feature reconstruction on multi-frame windows,''
Interspeech 2011, pp.1665-1668, Florence, Italy, August
28-31, 2011.
- Ling-Hui Chen, Yoshihiko Nankaku, Heiga Zen, Keiichi
Tokuda, Zhen-Hua Ling, Li-Rong Dai, ``Estimation of window
coefficients for dynamic feature extraction for HMM-based
speech synthesis,'' Interspeech 2011, pp.1801-1804, Florence,
Italy, August 28-31, 2011.
- Tsuneo Kato, Makoto Yamada, Nobuyuki Nishizawa, Keiichiro
Oura, Keiichi Tokuda, ``Large-scale subjective evaluations of
speech rate control methods for HMM-based speech
synthesizers,'' Interspeech 2011, pp.1845-1848, Florence,
Italy, August 28-31, 2011.
- Naoaki Ito, Yoshihiko Nankaku, Akinobu Lee, Keiichi
Tokuda, ``Evaluation of tree-trellis based decoding in
over-million LVCSR,'' Interspeech 2011, pp.1937-1940,
Florence, Italy, August 28-31, 2011.
- Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon
King, and Keiichi Tokuda, ``An analysis of machine translation
and speech synthesis in speech-to-speech translation system,''
2011 IEEE International Conference on Acoustics, Speech, and
Signal Processing (ICASSP 2011), pp.5108-5111, Prague, Czech
Republic, May 22-27, 2011.
- Shinji Takaki, Keiichiro Oura, Yoshihiko Nankaku, and
Keiichi Tokuda, ``An Optimization Algorithm of Independent
Mean and Variance Parameter Tying Structures for HMM-based
speech synthesis,'' 2011 IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP 2011),
pp.5108-5111, Prague, Czech Republic, May 22-27, 2011.
- Shifeng Pan, Yoshihiko Nankaku, Keiichi Tokuda, Jianhua
Tao, ``Global variance modeling on frequency domain delta LSP
for HMM-based speech synthesis,'' 2011 IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP
2011), pp.4716-4719, Prague, Czech Republic, May 22-27,
2011.
- Xianglin Peng, Keiichiro Oura, Yoshihiko Nankaku, Keiichi
Tokuda, ``Cross-Lingual Speaker Adaptation for HMM-Based
Speech Synthesis Considering Differences Between
Language-Dependent Average Voices,'' Proc. of IEEE 10th
International Conference on Signal Processing, pp.605-608,
Beijing China, 24-28 Oct. 2010.
- Toyohiro Hayashi, Yoshihiko Nankaku, Akinobu Lee and
Keiichi Tokuda, ``Speaker Adaptation Based on Nonlinear
Spectral Transform for Speech Recognition,'' Interspeech 2010,
pp.542-545, Chiba Japan, 26-30 Sep. 2010.
- Ayami Mase, Keiichiro Oura, Yoshihiko Nankaku, Keiichi
Tokuda, ``HMM-based singing voice synthesis system using
pitch-shifted pseudo training data,'' Interspeech 2010,
pp.845-848, Chiba Japan, 26-30 Sep. 2010.
- Akira Saito, Yoshihiko Nankaku, Akinobu Lee, Keiichi
Tokuda, ``Voice activity detection based on conditional random
fields using multiple features,'' Interspeech 2010,
pp.2086-2089, Chiba Japan, 26-30 Sep. 2010.
- Keiichiro Oura, Kei Hashimoto, Sayaka Shiota, and Keiichi
Tokuda, ``Overview of NIT HMM-based speech synthesis system
for Blizzard Challenge 2010,'' Blizzard Challenge 2010
Workshop, Kyoto, Japan, September 25, 2010.
- Shinji Takaki, Yoshihiko Nankaku, and Keiichi Tokuda,
``Spectral modeling with contextual additive structure for
HMM-based speech synthesis,'' Proc. of 7th ISCA Speech
Synthesis Workshop, pp.100-105, Kyoto, Japan, Sep. 22-24,
2010.
- Keiichiro Oura, Ayami Mase, Tomohiko Yamada, Satoru Muto,
Yoshihiko Nankaku, and Keiichi Tokuda, ``Recent Development of
the HMM-based Singing Voice Synthesis System - Sinsy,''
Proc. of 7th ISCA Speech Synthesis Workshop, pp.211-216,
Kyoto, Japan, Sep. 22-24, 2010.
- Kei Hashimoto, Yoshihiko Nankaku, and Keiichi
Tokuda,``Bayesian speech synthesis framework integrating
training and synthesis processes,'' Proc. of 7th ISCA Speech
Synthesis Workshop, pp.106-111, Kyoto, Japan, Sep. 22-24,
2010.
- Mirjam Wester, John Dines, Matthew Gibson, Hui Liang,
Yi-Jian Wu, Lakshmi Saheer, Simon King, Keiichiro Oura, Philip
N. Garner, William Byrne, Yong Guan, Teemu Hirsimaki, Reima
Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei
Tian, Keiichi Tokuda, Junichi Yamagishi, ``Speaker adaptation
and the evaluation of speaker similarity in the EMIME
speech-to-speech translation project,'' Proc. of 7th ISCA
Speech Synthesis Workshop, pp.192-197, Kyoto, Japan,
Sep. 22-24, 2010.
- Yoshiaki Takahashi, Akira Tamamori, Yoshihiko Nankaku,
Keiichi Tokuda, ``Face recognition based on separable lattice
2-D HMM with state duration modeling,'' 2010 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2010), pp.2162-2165, Dallas, Texas,
U.S.A., March 14-19, 2010.
- Kyosuke Kazumi, Yoshihiko Nankaku, Keiichi Tokuda,
``Factor analyzed voice models for HMM-based speech
synthesis,'' 2010 IEEE International Conference on Acoustics,
Speech, and Signal Processing (ICASSP 2010), pp.4234-4237,
Dallas, Texas, U.S.A., March 14-19, 2010.
- Akira Tamamori, Yoshihiko Nankaku, Keiichi Tokuda, ``An
extension of separable lattice 2-D HMMs for rotational data
variations,'' 2010 IEEE International Conference on Acoustics,
Speech, and Signal Processing (ICASSP 2010), pp.2206-2209,
Dallas, Texas, U.S.A., March 14-19, 2010.
- Heiga Zen, Mark Gales, Yoshihiko Nankaku, Keiichi Tokuda,
``Statistical parametric speech synthesis based on product of
experts,'' 2010 IEEE International Conference on Acoustics,
Speech, and Signal Processing (ICASSP 2010), pp.4242-4245,
Dallas, Texas, U.S.A., March 14-19, 2010.
- Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon
King, Mirjam Wester, ``Unsupervised cross-lingual speaker
adaptation for HMM-based speech synthesis,'' 2010 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2010), pp.4594-4597, Dallas, Texas,
U.S.A., March 14-19, 2010.
- Junichi Yamagishi, Bela Usabaev, Simon King, Oliver Watts,
John Dines, Jilei Tian, Rile Hu, Yong Guan, Keiichiro Oura,
Keiichi Tokuda, Reima Karhila, Mikko Kurimo ``Thousands of
Voices for HMM-based Speech Synthesis,'' Interspeech 2009,
pp.420-423, Brighton, U.K., September 6-10, 2009.
- Yi-Jian Wu, Yoshihiko Nankaku, Keiichi Tokuda, ``State
mapping based method for cross-lingual speaker adaptation in
HMM-based speech synthesis,'' Interspeech 2009, pp.528-531,
Brighton, U.K., September 6-10, 2009.
- Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi
Tokuda, ``Deterministic Annealing Based Training Algorithm for
Bayesian Speech Recognition,'' Interspeech 2009, pp.680-683,
Brighton, U.K., September 6-10, 2009.
- Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, ``A
Bayesian Approach to Hidden Semi-Markov Model Based Speech
Synthesis,'' Interspeech 2009, pp.1751-1754, Brighton, U.K.,
September 6-10, 2009.
- Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee,
Keiichi Tokuda, ``Tying covariance matrices to reduce the
footprint of HMM-based speech synthesis systems,'' Interspeech
2009, pp.1751-1762, Brighton, U.K., September 6-10, 2009.
- Ranniery Maia, Tomoki Toda, Keiichi Tokuda, Shinsuke
Sakai, Satoshi Nakamura, ``A decision tree-based clustering
approach to state definition in an excitation modeling
framework for HMM-based speech synthesis,'' Interspeech 2009,
pp.1783-1786, Brighton, U.K., September 6-10, 2009.
- Yi-Jian Wu, Long Qin, Keiichi Tokuda, ``An improved
minimum generation error based model adaptation for HMM-based
speech synthesis,'' Interspeech 2009, pp.1787-1790, Brighton,
U.K., September 6-10, 2009.
- Heiga Zen, Keiichiro Oura, Takashi Nose, Junichi
Yamagishi, Shinji Sako, Tomoki Toda, Takashi Masuko, Alan W.
Black, Keiichi Tokuda, ``Recent development of the HMM-based
speech synthesis system (HTS),'' Asia-Pacific Signal and
Information Processing Association 2009 Annual Summit and
Conference (APSIPA ASC 2009), pp.121-130, Sapporo, Japan,
October 4-7, 2009.
- Keiichiro Oura, Yi-Jian Wu , Keiichi Tokuda, ``Overview of
NIT HMM-based speech synthesis system for Blizzard Challenge
2009,'' 2009 Blizzard Challenge Workshop, September 4, 2009
(web proceedings).
- Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro
Sumita, Keiichi Tokuda, ``Reordering model using syntactic
information of a source tree for statistical machine
translation,'' NAACL HLT 2009 Workshop: Third Workshop on
Syntax and Structure in Statistical Translation (SSST-3),
pp.69-77, Boulder, Colorado, June 5, 2009.
- Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Keiichi
Tokuda, ``A Bayesian approach to HMM-based speech synthesis,''
2009 IEEE International Conference on Acoustics, Speech, and
Signal Processing (ICASSP 2009), pp.4029-4032, Taipei,
Taiwan, April 19-24, 2009.
- Yi-Jian Wu, Keiichi Tokuda ``Minimum generation error
training by using original spectrum as reference for log
spectral distortion measure,'' 2009 IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP
2009), pp.4013-4016, Taipei, Taiwan, April 19-24, 2009.
- Kaori Yutani, Yosuke Uto, Yoshihiko Nankaku, Akinobu Lee,
Keiichi Tokuda, ``Voice conversion based on simultaneous
modeling of spectrum and F0,'' 2009 IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP
2009), pp.3897-3900, Taipei, Taiwan, April 19-24, 2009.
Stereo-based stochastic noise compensation based on trajectory
GMMs
- Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda,
``Stereo-based stochastic noise compensation based on
trajectory GMMs,'' 2009 IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP 2009),
pp.4577-4580, Taipei, Taiwan, April 19-24, 2009.
- Lu Heng, Wu Yi-Jian, Tokuda Keiichi, Dai Li-Rong, Wang
Ren-Hua, ``FULL covariance state duration modeling for
HMM-based speech synthesis,'' 2009 IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP
2009), pp.4033-4036, Taipei, Taiwan, April 19-24, 2009.
- Keiichiro Oura, Yoshihiko Nankaku, Tomoki Toda, Keiichi
Tokuda, Rannierry Maia, Shinsuke Sakai, Satoshi Nakamura,
``Simultaneous Acoustic, Prosodic, and Phrasing Model Training
for TTS Conversion Systems,'' International Symposium on
Chinese Spoken Language Processing (ISCSLP2008), SPE1.1,
pp.1-4, Kunming, China, December 16-19, 2008 (Best Student
Paper Award).
- Yi-Jian Wu, Simon King and Keiichi Tokuda, ``Cross-Lingual
Speaker Adaptation for HMM-based Speech Synthesis,''
International Symposium on Chinese Spoken Language Processing
(ISCSLP2008), SPE1.1, pp.9-12, Kunming, China, December
16-19, 2008.
- Zhi-Peng Yu, Yi-Jian Wu, Heiga Zen, Yoshihiko Nankaku,
Keiichi Tokuda, ``Analysis of stream-dependent tying structure
for HMM-based speech synthesis,'' International Conference on
Signal Processing (ICSP'08), pp.655-658, Beijing, China,
October 26-29, 2008.
- Junichi Yamagishi, Heiga Zen, Yi-Jian Wu, Tomoki Toda,
Keiichi Tokuda, ``HTS-2008: Yet another evaluation of speaker
adaptive HMM-based speech synthesis system,'' Blizzard
Challenge Workshop 2008, Brisbane, Australia, September 2008
(web proceedings).
- Ranniery Maia, Jinfu Ni, Shinsuke Sakai, Tomoki Toda,
Keiichi TokudaTohru Shimizu, Satoshi Nakamura, ``The NICT/ATR
speech synthesis system for the Blizzard Challenge 2008'',
Blizzard Challenge Workshop 2008, Brisbane, Australia,
September 2008 (web proceedings).
- Yoshitaka Yoshimi, Ryota Kakitsuba, Yoshihiko Nankaku,
Akinobu Lee, Keiichi Tokuda, ``Probabilistic Answer Selection
Based on Conditional Random Fields for Spoken Dialog System,''
Interspeech 2008, pp.215-218, Brisbane, Australia, September
22-26, 2008.
- Yi-Jian Wu, Keiichi Tokuda, ``Minimum Generation Error
Training with Direct Log Spectral Distortion on LSPs for
HMM-Based Speech Synthesis,'' Interspeech 2008, pp.577-580,
Brisbane, Australia, September 22-26, 2008.
- Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko
Nankaku, Akinobu Lee, Keiichi Tokuda, ``Acoustic Modeling
Based on Model Structure Annealing for Speech Recognition,''
Interspeech 2008, pp.932-935, Brisbane, Australia, September
22-26, 2008.
- Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee,
Keiichi Tokuda, ``Bayesian Context Clustering Using Cross
Valid Prior Distribution for HMM-Based Speech Recognition,''
Interspeech 2008, pp.936-939, Brisbane, Australia, September
22-26, 2008.
- Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda,
``Probabilistic Feature Mapping Based on Trajectory HMMs,''
Interspeech 2008, pp.1068-1071, Brisbane, Australia,
September 22-26, 2008.
- Kaori Yutani, Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda,
Keiichi Tokuda, ``Simultaneous Conversion of Duration and
Spectrum Based on Statistical Models Including Time-Sequence
Matching,'' Interspeech 2008, pp.1072-1075, Brisbane,
Australia, September 22-26, 2008.
- Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu
Lee, Keiichi Tokuda, ``Speaker Recognition Based on
Variational Bayesian Method,'' Interspeech 2008,
pp.1417-1420, Brisbane, Australia, September 22-26, 2008.
- Simon King, Keiichi Tokuda, Heiga Zen, Junichi Yamagishi,
``Unsupervised Adaptation for HMM-Based Speech Synthesis,''
Interspeech 2008, pp.1869-1872, Brisbane, Australia,
September 22-26, 2008.
- Yoshihiko Nankaku, Kazuhiro Nakamura, Heiga Zen, Keiichi
Tokuda, ``Acoustic modeling with contextual additive structure
for HMM-based speech recognition,'' 2008 IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP
2008), pp.4469-4472, Las Vegas, Nevada, U.S.A., March
30-April 4, 2008.
- Junichi Yamagishi, Takashi Nose, Heiga Zen, Tomoki Toda,
Keiichi Tokuda, ``Performance evaluation of the
speaker-independent HMM-based speech synthesis system HTS-2007
for the Blizzard Challenge 2007,'' 2008 IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP
2008), pp.3957-3960, Las Vegas, Nevada, U.S.A., March
30-April 4, 2008.
- Tomoki Toda, Keiichi Tokuda, ``Statistical approach to
vocal tract transfer function estimation based on factor
analyzed trajectory HMM,'' 2008 IEEE International Conference
on Acoustics, Speech, and Signal Processing (ICASSP 2008),
pp.3925-3928, Las Vegas, Nevada, U.S.A., March 30-April 4,
2008.
- Ranniery Maia, Tomoki Toda, Keiichi Tokuda, Shinsuke
Sakai, Satoshi Nakamura, ``On the state definition for a
trainable excitation model in HMM-based speech synthesis,''
2008 IEEE International Conference on Acoustics, Speech, and
Signal Processing (ICASSP 2008), pp.3965-3968, Las Vegas,
Nevada, U.S.A., March 30-April 4, 2008.
- Yi-Jian Wu, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda,
``Minimum generation error criterion considering global/local
variance for HMM-based speech synthesis,'' 2008 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2008), pp.4621-4624, Las Vegas, Nevada,
U.S.A., March 30-April 4, 2008.
- Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda,
``Model-Space MLLR for Trajectory HMMs,'' Interspeech 2007 -
EUROSPEECH, pp.2065-2068, Antwerp, Belguim, August 27-31,
2007.
- Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku,
Keiichi Tokuda, ``A trainable excitation model for HMM-based
speech synthesis,'' Interspeech 2007 - EUROSPEECH,
pp,1909-1912, Antwerp, Belguim, August 27-31, 2007.
- Junichi Yamagishi, Heiga Zen, Tomoki Toda, Keiichi Tokuda,
``Speaker-independent HMM-based speech synthesis system -
HTS-2007 system for the Blizzard Challenge 2007,'' Proc. of
Blizzard Challenge 2007, Bonn, Germany, August 25, 2007
(CD-ROM Proceedings,
http://festvox.org/blizzard/bc2007/index.html).
- Jinfu Ni, Toshio Hirai, Hisashi Kawai, Tomoki Toda,
Keiichi Tokuda, Minoru Tsuzaki, Shinsuke Sakai, Ranniery Maia,
Satoshi Nakamura, ``ATRECSS - ATR English speech corpus for
speech synthesis,'' Proc. of Blizzard Challenge 2007, Bonn,
Germany, August 25, 2007 (CD-ROM Proceedings,
http://festvox.org/blizzard/bc2007/index.html).
- Yoshihiko Nankaku, Kenichi Nakamura, and Keiichi Tokuda,
``Spectral conversion based on statistical models including
time-frequency matching,'' Proc. of 6th ISCA Speech Synthesis
Workshop, Bonn, Germany, August 22-24, 2007 (CD-ROM
proceedings).
- Shinsuke Sakai, Jinfu Ni, Ranniery Maia, Keiichi Tokuda,
Minoru Tsuzaki, Tomoki Toda, Hisashi Kawai, and Satoshi
Nakamura, ``Communicative Speech Synthesis with XIMERA: a
first step,'' Proc. of 6th ISCA Speech Synthesis Workshop,
Bonn, Germany, August 22-24, 2007 (CD-ROM proceedings).
- Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku,
Keiichi Tokuda, ``An excitation model for HMM-based speech
synthesis based on residual modeling,'' Proc. of 6th ISCA
Speech Synthesis Workshop, Bonn, Germany, August 22-24, 2007
(CD-ROM proceedings).
- Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon
King, Heiga Zen, Tomoki Toda, Keiichi Tokuda, ``Improved
average-voice-based speech synthesis using gender-mixed
modeling and a parameter generation algorithm considering
GV,'' Proc. of 6th ISCA Speech Synthesis Workshop, Bonn,
Germany, August 22-24, 2007 (CD-ROM proceedings).
- Heiga Zen, Takashi Nose, Junichi Yamagishi, Shinji Sako,
Takashi Masuko, Alan W. Black, Keiichi Tokuda, ``The HMM-based
speech synthesis system (HTS) version 2.0,'' Proc. of 6th ISCA
Speech Synthesis Workshop, Bonn, Germany, August 22-24, 2007
(CD-ROM proceedings).
- Yoshihiko Nankaku, and Keiichi Tokuda, ``Face recognition
using hidden Markov eigenface models,'' 2007 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2007), vol.2, pp.II-469-II-472, Hawaii,
USA, April 15-20, 2007.
- Alan W. Black, Heiga Zen, Keiichi Tokuda, ``Statistical
parametric speech synthesis,'' 2007 IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP
2007), vol.4, pp.IV-1229-IV-1232, Hawaii, USA, April 15-20,
2007.
- Keiichi Tokuda, ``Hidden Markov model-based speech
synthesis as a tool for constructing comunicative spoken
dialog systems,'' Proc. of The 4th Joint Meeting of The
Acoustical Society of America and The Acoustical Society of
Japan, Honolulu, Hawaii, 28 November-2 December 2006 (in
J. Acoust. Soc. Am., vol.120, no.5, Part.2, p.3006, November
2006) (invited paper only with abstract).
- Yoshihiro Itogawa, Heiga Zen, Yoshihiko Nankaku, Akinobu
Li, and Keiichi Tokuda, ``Decision-tree-based F0 quantization
for hidden Markov model-based speech coding at 100 bit/s,''
Proc. of The 4th Joint Meeting of ASA/ASJ, Honolulu, Hawai,
Nov. 28-Dec. 2, 2006 (in J. Acoust. Soc. Am., vol.120, no.5,
Part.2, p.3038, November 2006) (abstract paper).
- Kei Hashimoto, Heiga Zen, Nankaku Yoshihiko, Lee Akinobu,
and Keiichi Tokuda, ``Hyperparameter estimation for speech
recognition based on variational Bayesian approach,'' Proc. of
The 4th Joint Meeting of ASA/ASJ, Honolulu, Hawai,
Nov. 28-Dec. 2, 2006 (in J. Acoust. Soc. Am., vol.120, no.5,
Part.2, p.3042, November 2006) (abstract paper).
- Kazuhiro Nakamura, Heiga Zen, Yoshihiro Nankaku, and
Keiichi Tokuda, ``Acoustic modeling with contextual additive
structure for hidden Markov model-based speech recognition,''
Proc. of The 4th Joint Meeting of ASA/ASJ, Honolulu, Hawai,
Nov. 28-Dec. 2, 2006 (in J. Acoust. Soc. Am., vol.120, no.5,
Part.2, p.3042, November 2006) (abstract paper).
- Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi
Kitamura, ``Speaker adaptation of trajectory HMMs using
feature-space MLLR,'' Interspeech 2006 - ICLSP, pp.1141-1144,
Pittsburgh, PA, Sept. 17-21, 2006.
- Keijiro Saino, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee,
Keiichi Tokuda, ``HMM-based singing voice synthesis system,''
Interspeech 2006 - ICSLP, pp.2274-2277, Pittsburgh, PA,
Sept. 17-21, 2006.
- Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Akinobu Lee,
Keiichi Tokuda, ``Voice conversion based on mixtures of factor
analyzers,'' Interspeech 2006 - ICSLP, pp.2278-2281,
Pittsburgh, PA, Sept. 17-21, 2006.
- Tomohiro Hakamata, Akinobu Lee, Yoshihiko Nankaku, Keiichi
Tokuda, ``Reducing computation on Parallel decoding using
frame-wise confidence scores,'' Interspeech 2006 - ICSLP,
pp.1638-1641, Pittsburgh, PA, Sept. 17-21, 2006.
- Heiga Zen, Tomoki Toda, Keiichi Tokuda, ``The Nitech-NAIST
HMM-based speech synthesis system for the Blizzard Challenge
2006,'' Blizzard Challenge 2006 Workshop, Pittsburgh,
PA, 2006 (http://festvox.org/blizzard/blizzard2006.html).
- Tomoki Toda, Hisashi Kawai, Toshio Hirai, Jinfu Ni,
Nobuyuki Nishizawa, Junichi Yamagishi, Minoru Tsuzaki, Keiichi
Tokuda, Satoshi Nakamura, ``Developing a Test Bed of English
Text-to-Speech System XIMERA for the Blizzard Challenge
2006,'' Blizzard Challenge 2006 Workshop, Pittsburgh, PA, 2006
(http://festvox.org/blizzard/blizzard2006.html).
- Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi
Kitamura, ``Estimating trajectory HMM parameters using Monte
Carlo EM with Gibbs sampler,'' 2006 IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP
2006), vol.1, pp.I-1173-I-1176, Toulouse, France, May 14-19,
2006.
- Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee,
Keiichi Tokuda, ``Hidden semi-Markov model based speech
recognition system using weighted finite-state transducer,''
2006 IEEE International Conference on Acoustics, Speech, and
Signal Processing (ICASSP 2006), vol.1, pp.I-33-I-36,
Toulouse, France, May 14-19, 2006.
- Kenichi Nakamura, Tomoki Toda, Yoshihiko Nankaku, Keiichi
Tokuda, ``On the use of phonetic information for mapping from
articulatory movements to vocal tract spectrum,'' 2006 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2006), vol.1, pp.I-93-I-96, Toulouse,
France, May 14-19, 2006.
- Daisuke Kurata, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi
Kitamura, Zoubin Ghahramani, ``Face recognition based on
separable lattice HMMs,'' 2006 IEEE International Conference
on Acoustics, Speech, and Signal Processing (ICASSP 2006),
vol.5, pp.V-737-V-740, Toulouse, France, May 14-19, 2006
(Student Paper Award).
- Alan W. Black, Keiichi Tokuda, ``The Blizzard Challenge -
2005: Evaluating corpus-based speech synthesis on common
datasets,'' INTERSPEECH 2005 - EUROSPEECH, pp.77-80, Lisbon,
Portugal, September 4-8, 2005.
- Maria João Barros, Ranniery Maia, Keiichi Tokuda,
Fernando Gil Resende, Diamantino Freitas, ``HMM-based European
Portuguese TTS system,'' INTERSPEECH 2005 - EUROSPEECH,
pp.2581-2584, Lisbon, Portugal, September 4-8, 2005.
- Tomoki Toda, Keiichi Tokuda, ``Speech parameter generation
algorithm considering global variance for HMM-based speech
synthesis,'' INTERSPEECH 2005 - EUROSPEECH, pp.2801-2804,
Lisbon, Portugal, September 4-8, 2005.
- Yusuke Kida, Hiroyoshi Yamamoto, Chiyomi Miyajima, Keiichi
Tokuda, Tadashi Kitamura, ``Minimum classification error
interactive training for speaker identification,'' 2005 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2005), pp.I-641-I-644, Philadelphia, USA,
March 18-23, 2005.
- Amaro Azevedo de Lima, Heiga Zen, Yoshihiko Nankaku,
Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura, ``Sparse
KPCA for feature extraction in speech recognition,'' 2005 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2005), pp.I-353-I-356, Philadelphia, USA,
March 18-23, 2005.
- Tomoki Toda, Alan W. Black, Keiichi Tokuda, ``Spectral
conversion based on maximum likelihood estimation considering
global variance of converted parameter,'' 2005 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2005), pp.I-9-I-12, Philadelphia, USA,
March 18-23, 2005.
- Keiichi Tokuda, Heiga Zen, and Tadashi Kitamura,
``Reformulating the HMM as a Trajectory Model,'' Workshop on
Statictical modeling Approach for Speech Recognition (Beyond
HMM), Kyoto, Japan, Dec. 20, 2004.
- Shunsuke Kataoka, Nobuaki Mizutani, Keiichi Tokuda,
Tadashi Kitamura, ``Decision-tree backing-off in HMM-based
speech synthesis,'' International Conference on Spoken
Language Processing (INTERSPEECH2000-ICSLP2000), vol.2,
pp.II-1205-II-1208, Jeju, Korea, Oct. 4-8, 2004.
- Ryosuke Tsuzuki, Heiga Zen, Keiichi Tokuda, Tadashi
Kitamura, Murtaza Bulut, Shrikanth S. Narayanan,
``Constructing emotional speech synthesizers with limited
speech database,'' International Conference on Spoken Language
Processing (INTERSPEECH2004-ICSLP2004), vol.2,
pp.II-1185-II-1188, Jeju, Korea, Oct. 4-8, 2004.
- Tomoki Toda, Alan W. Black, Keiichi Tokud
``Acoustic-to-articulatory inversion mapping with Gaussian
mixture model,'' International Conference on Spoken Language
Processing (INTERSPEECH2004-ICSLP2004), vol.2,
pp.II-1129-II-1132, Jeju, Korea, Oct. 4-8, 2004.
- Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao
Kobayashi, Tadashi Kitamura, ``Hidden semi-Markov model based
speech synthesis,'' International Conference on Spoken
Language Processing (INTERSPEECH2004-ICSLP2004), vol.2
pp.II-1393-1II-386, Jeju, Korea, Oct. 4-8, 2004.
- Yohei Itaya, Heiga Zen, Yoshihiko Nankaku, Chiyomi
Miyajima, Keiichi Tokuda, Tadashi Kitamura, ``Deterministic
annealing EM algorithm in parameter estimation for acoustic
model,'' International Conference on Spoken Language
Processing (INTERSPEECH2004-ICSLP2004), vol.1,
pp.I-437--I-440, Jeju, Korea, Oct. 4-8, 2004.
- T. Nitta, S. Sagayama, Y. Yamashita, T. Kawahara,
S. Morishima, S. Nakamura, A. Yamada, K. Ito, M. Kai, A. Li,
M. Mimura, K. Hirose, T. Kobayashi, K. Tokuda, N. Minematsu,
Y. Den, T. Utsuro, T. Yotsukura, H. Shimodaira, M. Araki,
T. Nishimoto, N. Kawaguchi, H. Banno, K. Katsurada,
``Activities of Interactive Speech Technology Consortium
(ISTC) Targeting Open Software Development for MMI Systems,''
13th IEEE International Workshop on Robot and Human
Interactive Communication (RO-MAN 2004), Kurashiki, Okayama,
Japan, September 20-22, 2004 (CD-ROM proceedings).
- Heiga Zen, Keiichi Tokuda, Tadashi Kitamura, ``An
introduction of trajectory model into HMM-based speech
synthesis,'' Proc. of 5th ISCA Speech Synthesis Workshop,
Pittsburgh, U.S.A., June 2004 (CD-ROM proceedings).
- Tomoki Toda, Alan W. Black, and Keiichi Tokuda, ``Mapping
from articulatory movements to vocal tract spectrum with
gaussian mixture model for articulatory speech synthesis,''
Proc. of 5th ISCA Speech Synthesis Workshop, Pittsburgh,
U.S.A., June 2004 (CD-ROM proceedings).
- Hisashi Kawai, Tomoki Toda, Jinfu Ni, Minoru Tsuzaki, and
Keiichi Tokuda, ``Ximera: A New TTS from ATR Based on
Corpus-Based Technologies,'' ISCA 5th Speech Synthesis
Workshop, pp.179-184, Pittsburgh, U.S.A., June 2004.
- Hiroyoshi Yamamoto, Yoshihoko Nankaku, Chiyomi Miyajima,
Keiichi Tokuda, Tadashi Kitamura, ``Parameter sharing and
minimum classification error training of mixtures of factor
analyzers for speaker identification,'' 2004 IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2004), vol.1, pp.29-32, Montreal, Canada,
May 17-21, 2004.
- Heiga Zen, Keiichi Tokuda, Tadashi Kitamura, ``A Viterbi
algorithm for a trajectory model derived from HMM with
explicit relationship between static and dynamic features,''
Proceedings of 2004 IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP 2004), vol.1,
pp.837-840, Montreal, Canada, May 17-21, 2004.
- Takahiro Hoshiya, Heiga Zen, Shinji Sako, Keiichi Tokuda,
Takashi Masuko, Takao Kobayashi, Tadashi Kitamura, ``An
HMM-based approach to speaker-dependent 100 bit/s speech
coding,'' Special Workshop in MAUI (SWIM), Lectures by
Masters in Speech Processing, Maui, Hawaii, Jan. 12-14, 2004
(invited session).
- Heiga Zen, Keiichi Tokuda, Tadashi Kitamura, ``Decision
Tree-based Simultaneous Clustering of Phonetic Contexts,
Dimensions, and State Positions for Acoustic Modeling,''
Proceedings of European Conference on Speech Communication and
Technology, pp.3189-3192, Geneva, Switzerland, Sep. 1-4,
2003.
- Amaro Azevedo de Lima, Heiga Zen, Yoshihiko Nankaku,
Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura, ``On the
Use of Kernel PCA for Feature Extraction in Speech
Recognition,'' Proceedings of European Conference on Speech
Communication and Technology, pp.2625-2628, Geneva,
Switzerland, Sep. 1-4, 2003.
- Ranniery da Silva Maia, Heiga Zen, Keiichi Tokuda, Tadashi
Kitamura, Fernando Gil Vianna Resende Junior, ``Towards the
development of a Brazilian Portuguese text-to-speech system
based on HMM,'' Proceedings of European Conference on Speech
Communication and Technology, pp.2465-2468, Geneva,
Switzerland, Sep. 1-4, 2003.
- Keiichi Tokuda, Heiga Zen, Tadashi Kitamura, ``Trajectory
modeling based on HMMs with the explicit relationship between
static and dynamic features,'' Proceedings of European
Conference on Speech Communication and Technology,
pp.865-868, Geneva, Switzerland, Sep. 1-4, 2003.
- Junichi Yamagishi, Takashi Masuko, Keiichi Tokuda, Takao
Kobayashi, ``A training method for average voice model based
on shared decision tree context clustering and speaker
adaptive training,'' Proceedings of IEEE International
Conference on Acoustics, Speech, and Signal Processing
(ICASSP), vol.1, pp.716-719, Hong Kong, April 6-10, 2003.
- Hiroyuki Suzuki, Heiga Zen, Yoshihiko Nankaku, Chiyomi
Miyajima, Keiichi Tokuda, Tadashi Kitamura, ``Speech
recognition using voice-characteristic-dependent acoustic
models,'' Proceedings of IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP), vol.1,
pp.740-743, Hong Kong, April 6-10, 2003.
- Takahiro Hoshiya, Shinji Sako, Keiichi Tokuda, Takashi
Masuko, Takao Kobayashi, Tadashi Kitamura, Heiga Zen, ``
Improving the performance of HMM-based very low bitrate speech
coding,'' Proceedings of IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP), vol.1,
pp.800-803, Hong Kong, April 6-10, 2003.
- Keiichi Tokuda, Zen Heiga, Alan W. Black, ``An HMM-based
speech synthesis system applied to English,'' 2002 IEEE Speech
Synthesis Workshop, Santa Monica, California, Sep. 11-13,
2002 (CD-ROM proceedings).
- Shin-ichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta,
Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo
Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi
Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose,
Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito
Utsuro, Shigeki Sagayama, ``Open-source Software for
Developing Anthropomorphic Spoken Dialog Agents,''
International Workshop on LIFELIKE ANIMATED AGENTS Tools,
Affective Functions, and Applications, pp.64-69, Tokyo,
Japan, August 19, 2002.
- Junichi Yamagishi, Masatsune Tamura, Takashi Masuko,
Keiichi Tokuda, Takao Kobayashi, ``A context clustering
technique for average voice model in HMM-based speech
synthesis,'' 8th International Conference on Spoken Language
Processing, pp.133-136, Sep. 16-20, Denver, Colorado, 2002.
- Kengo Shichiri, Atsushi Sawabe, Keiichi Tokuda, Takashi
Masuko, Takao Kobayashi, Tadashi Kitamura, ``Eigenvoices for
HMM-based speech synthesis,'' 8th International Conference on
Spoken Language Processing, pp.1269-1272, Sep. 16-20,
Denver, Colorado, 2002.
- Heiga Zen, Keiichi Tokuda, Tadashi Kitamura, ``Decision
tree distribution tying based on a dimensional split
technique,'' 8th International Conference on Spoken Language
Processing, pp.1257-1260, Sep. 16-20, Denver, Colorado, 2002.
- Shin-ichi Kawamoto, Hiroshi Shimodaira, Shigeki Sagayama,
Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katunobu
Itou, Shigeo Morishima, Tatsuo Yotsukura, Akihiko Kai, Akinobu
Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda,
Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu
Den, Takehito Utsuro, ``Developments of anthropomorphic dialog
agent: A plan and development and its significance,''
International Workshop on Information Presentation and Natural
Multimodal Dialogue, Verona, Italy, 14-15 Dec. 2001.
- Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao
Kobayashi, Tadashi Kitamura, ``Mixed excitation for HMM-based
speech Synthesis,'' Proceedings of European Conference on
Speech Communication and Technology, vol.3, pp.2263-2266,
Aalborg, Denmark, Sep. 3-7, 2001.
- Takayuki Satoh, Takashi Masuko, Takao Kobayashi, Keiichi
Tokuda, ``A robust speaker verification system against
imposture using an HMM-based speech synthesis system,''
Proceedings of European Conference on Speech Communication and
Technology, vol.2, pp.759-762, Aalborg, Denmark, Sep. 3-7,
2001.
n
- Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao
Kobayashi, ``Text-to-speech synthesis with arbitrary speaker's
voice from average voice,'' Proceedings of European Conference
on Speech Communication and Technology, vol.1, pp.345-348,
Aalborg, Denmark, vol.1 pp.345-348, Sep. 3-7, 2001.
- Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura,
``Minimum classification error training for speaker
identification using Gaussian mixture models based on
multi-space probability distribution,'' Proceedings of
European Conference on Speech Communication and Technology,
Aalborg, Denmark, vol.4, pp.2347-2350, Sep. 3-7, 2001.
- Masatsune Tamura, Takashi Masuko, Keiichi Tokuda and Takao
Kobayashi, ``Adaptation of pitch and spectrum for HMM-based
speech synthesis using MLLR,'' Proceedings of IEEE
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP), vol.2, pp.805-808, Salt Lake City, Utah,
USA, May 7-11, 2001.
- Chiyomi Miyajima, Yosuke Hattori, Keiichi Tokuda, Takashi
Masuko, Takao Kobayashi, and Tadashi Kitamura, ``Speaker
identification using Gaussian mixture models based on
multi-space probability distribution,'' Proceedings of
International Conference on Acoustics, Speech, and Signal
Processing (ICASSP), Salt Lake City, Utah, USA, pp.433-436,
May 7-11, 2001.
- Junibakti Sanubari, Keiichi Tokuda, ``Fast convergence
transversal adaptive filtering algorithm for impulsive
environment based on t-distribution,'' 2001 IEEE International
Symposium on Circuits and Systems, Sydney, Australia, May 6 -
9, 2001.
- Takahiro Nakanishi, Keiichi Tokuda, Tadashi Kitamura,
``Simultaneous determination of model-order and frame
partitioning for time series analysis based on MDL
criterion,'' International Symposium on Information Theory and
Its Applications (ISITA-2000), Honolulu, Hawaii, November 5-8,
2000 (accepted).
- Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura,
``Audio-visual speech recognition based on minimum
classification error discriminative training,'' 2000 IEEE
International Workshop on Neural Networks for Signal
Processing, Syndney, Australia, pp.3-12, 11-13 Dec. 2000
(invited session).
- Shinji Sako, Keiichi Tokuda, Takashi Masuko, Takao
Kobayashi, Tadashi Kitamura, ``HMM-based text-to-audio-visual
speech synthesis,'' International Conference on Spoken
Language Processing (ICSLP2000/INTERSPEECH2000), vol.III,
pp.25-28, Beijing, China, Oct. 16-20, 2000 (invited session).
- Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura,
``Audio-visual speech recognition using MCE-based HMMs and
model-dependent stream weights,'' International Conference on
Spoken Language Processing (ICSLP2000/INTERSPEECH2000),
vol.II, pp.1023-1026, Beijing, China, Oct. 16-20, 2000.
- Takashi Masuko, Keiichi Tokuda and Takao Kobayashi,
``Imposture using synthetic speech against speaker
verification based on spectrum and pitch,'' International
Conference on Spoken Language Processing
(ICSLP2000/INTERSPEECH2000), vol.III, pp.302-305, Beijing,
China, Oct. 16-20, 2000.
- Toru Takahashi, Keiichi Tokuda, Takao Kobayashi, Tadashi
Kitamura, ``Vector quantization of mel-cepstral coefficients
based on a statistical measure,'' IEEE International Synposium
on Intelligent Signal Processing and Communication Synstems,
Honolulu, Hawaii, pp.692-695, Nov. 5-8, 2000.
- Junibakti Sanubari, Keiichi Tokuda, ``Robust estimation of
an AR multi-channel model by using t-distribution
assumption,'' Proc. The 10th European Signal Processing
Conference (EUSIPCO-2000), vol.3, pp.1783-1786, Tempere,
Finland, 4-8, Sep. 2000.
- Yoshihiko Nankaku, Keiichi Tokuda, Takao Kobayashi,
Tadashi Kitamura, ``Normalized training for HMM-based visual
speech recognition,'' Proc. of IEEE International Conferece on
Image Processing, Vancouver, Canada, vol.3, pp.234-237,
Sep. 2000.
- Keiichi Tokuda, Takayoshi Yoshimura, Takashi Masuko, Takao
Kobayashi, Tadashi Kitamura, ``Speech parameter generation
algorithms for HMM-based speech synthesis,'' Proceedings of
IEEE International Conference on Acoustics, Speech, and Signal
Processing, Istanbul, Turkey, vol.3, pp.1315-1318, June 2000.
- Junibakti Sanubari, Keiichi Tokuda, ``Image modeling using
two dimentional exponential systems,'' IEEE International
Conference on Image Processing, Kobe, Japan,
pp.28AP4.8.1-28AP4.8.4, Oct. 1999.
- Junibakti Sanubari, Keiichi Tokuda, ``Two dimensional
adaptive filter based on t-distribution assumption and
full-plane support,'' Proceedings of Thirty-Third Asilomar
Conference on Signal, Systems, and Computers, Pcific Groove,
California, Oct. 24-27, pp.815-819, 1999.
- Oscar Vanegas, Keiichi Tokuda, Tadashi Kitamura,
``Location normalization of HMM-based lip reading: Experiments
for the M2VTS Database,'' IEEE International Conference on
Image Processing, Kobe, Japan, Oct. 1999 (accepted).
- Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao
Kobayashi and Tadashi Kitamura, ``Simultaneous modeling of
spectrum, pitch and duration in HMM-based speech synthesis,''
Proceedings of European Conference on Speech Communication and
Technology, Budapest, Hungary, vol.5, pp.2347-2350, Sep. 1999.
- Yoshihiko Nankaku, Keiichi Tokuda and Tadashi Kitamura,
``Intensity- and location-normalized training for HMM-based
visual speech recognition,'' Proceedings of European
Conference on Speech Communication and Technology, Budapest,
Hungary, vol.3, pp.1287-1290, Sep. 1999.
- Takashi Masuko, Takafumi Hitotsumatsu, Keiichi Tokuda and
Takao Kobayashi, ``On the security of HMM-based speaker
verification systems against imposture using synthetic
speech,'' Proceedings of European Conference on Speech
Communication and Technology, Budapest, Hungary, vol.3,
pp.1223-1226, Sep. 1999.
- Junibakti Sanubari, Keiichi Tokuda, ``LMS-like two
dimensional adaptive filter with t-distribution assumption
and non-symmetral half plane support,'' Proceedings of
IEEE-EURASIP Workshop on Nonlinear and Image Processing,
Antalaya, Turkey, pp.419-423, June 1999.
- Keiichi Tokuda, Takashi Masuko, Noboru Miyazaki and Takao
Kobayashi, ``Hidden Markov models based on multi-space
probability distribution for pitch pattern modeling,''
Proceedings of IEEE International Conference on Acoustics,
Speech, and Signal Processing, vol.1, pp.229-232, Phoenix,
USA, Mar. 1999.
- Masatsune Tamura, Takashi Masuko, Takao Kobayashi and
Keiichi Tokuda, ``Visual speech synthesis based on parameter
generation from HMM: speech-driven and text-and-speech-driven
approach,'' Proc. International Conference of Auditory-Visual
Speech Processing, pp.219-224, Terrigal, Australia, Dec. 1998.
- Kazuhito Koishida, Goh Hirabayashi, Keiichi Tokuda, and
Takao Kobayashi, ``A 16kbit/s wideband CELP coder using
mel-generalized cepstral analysis and its subjective
evaluation,'' Proc. of International Conference on Spoken
Language Processing (ICLSP-98), vol.6., pp.2583-2586, Sydney,
Australia, Nov.-Dec., 1998.
- Oscar Vanegas, Akiji Tanaka, Keiichi Tokuda, Tadashi
Kitamura, ``HMM-based visual speech recognition using
intensity and location normalization,'' Proc. of International
Conference on Spoken Language Processing (ICLSP-98), vol.2,
pp.789-792, Sydney, Australia, Nov.-Dec., 1998.
- Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao
Kobayashi and Tadashi Kitamura, ''Duration modeling for
HMM-based speech synthesis,'' Proc. of International
Conference on Spoken Language Processing (ICLSP-98), vol.2,
Sydney, Australia, pp.29-32, Nov.-Dec., 1998.
- Takashi Masuko, Takao Kobayashi and Keiichi Tokuda, ''A
very low bit rate speech coder using HMM with speaker
adaptation,'' Proc. of International Conference on Spoken
Language Processing (ICLSP-98), vol.2, pp.507-510, Sydney,
Australia, Nov.-Dec., 1998.
- Masatsune Tamura, Takashi Masuko, Keiichi Tokuda and Takao
Kobayashi, ''Speaker adaptation for HMM-based speech synthesis
system using MLLR,'' Proc. ESCA/COCOSDA Workshop on Speech
Synthesis, pp.273-276, Blue Mountains, Australia, Nov. 1998.
- Oscar Vanegas, Akiji Tanaka, Keiichi Tokuda, Tadashi
Kitamura, ``Intensity/location normalization for automatice
lipreading,'' Proc. International Conference on Signal
Processing, vol.2, pp.920-923, Oct. 1998.
- Junibakti Sanubari, Keiichi Tokuda, ``Recursive two
dimensional spectral estimation based on an AR model excited
by a t-distribution process using AR decomposition
approach,'' Proceedings of IEEE Asia Pacific Conference on
Circuits and Systems, Chiang Mai, Thailand, pp.447-450,
Nov. 1998.
- Junibakti Sanubari, Keiichi Tokuda, ``Adaptive two
dimensional filter based on an AR model excited by a
t-distribution process,'' Proceedings of IESTED
International Conference on Signal and Image Processing, Las
Vegas, Nevada-USA, pp.679-683, Oct. 1998.
- Junibakti Sanubari, Keiichi Tokuda, ``Adaptive spectral
estimation based on an exponential model,'' Proceedings of
IEEE International Conference on Circuits and Systems,
Monterey, California, pp.TAA13-11.1-TAA13-11.4, May 1998.
- Keiichi Tokuda, Takashi Masuko, Jun Hiroi, Takao
Kobayashi, and Tadashi Kitamura, ``A very low bit rate speech
coder using HMM-based speech recognition/synthesis
techniques,'' Proceedings of IEEE International Conference on
Acoustics, Speech, and Signal Processing, vol.2, pp.609-612,
May 1998.
- Kazuhito Koishida, Goh Hirabayashi, Keiichi Tokuda, and
Takao Kobayashi, ``A wideband CELP speech coder at 16 kbit/s
based on mel-generalized cepstral analysis,'' Proceedings of
IEEE International Conference on Acoustics, Speech, and Signal
Processing, vol.1, pp.161-164, Seattle, USA, May 1998.
- Takashi Masuko, Takao Kobayashi, Masatsune Tamura, Jun
Masubuchi, and Keiichi Tokuda, ``Text-to-visual speech
synthesis based on parameter generation from HMM,''
Proceedings of IEEE International Conference on Acoustics,
Speech, and Signal Processing, vol.6, pp.3745-3748, Seattle,
USA, May 1998.
- Junibakti Sanubari and Keiichi Tokuda, ``Non stationary
spectral estimation based on robust time varying AR model
excited by a t-distribution process,'' IEEE Region Ten
Annual Conference on Speech and Image Technologies for
Computing and Telecommunications, Brisbane, Australia,
pp.51-54, Dec. 1997.
- Takao Kobayashi, Takashi Masuko and Keiichi Tokuda, ``HMM
compensation for noisy speech recognition based on cepstral
parameter generation,'' Proceedings of European Conference on
Speech Communication and Technology, vol.3, pp.1583-1586,
Rhodes, Greece, Sep. 1997.
- Takayoshi Yoshimura, Takashi Masuko, Keiichi Tokuda, Takao
Kobayashi and Tadashi Kitamura, ``Speaker interpolation in
HMM-based speech synthesis system,'' Proceedings of European
Conference on Speech Communication and Technology, vol.5,
pp.2523-2526, Rhodes, Greece, Sep. 1997.
- Kazuhito Koishida, Keiichi Tokuda, Takashi Masuko and
Takao Kobayashi, ``Spectral quantization using statistics of
static and dynamic features,'' Proc. of 1997 IEEE Workshop on
Speech Coding for Telecommunications, pp.19-20, Pocono Manor,
USA, Sep. 1997.
- Chiyomi Miyajima, Yoshitsuna Sugiura, Keiichi Tokuda and
Tadashi Kitamura, ``Descrete or tied-mixture HMM based of
sefl-organizing feature map for robust probability
estimation,'' Proceedings of International Conference on
Speech Processing, vol.2, pp.529-532, Seoul, Korea,
Aug. 1997.
- Kazuhito Koishida, Keiichi Tokuda, Takashi Masuko and
Takao Kobayashi, ``Vector quantization of speech spectral
parameters using statistics of dynamic features,'' Proceedings
of International Conference on Speech Processing, vol.1,
pp.247-252, Seoul, Korea, Aug. 1997.
- Junibakti Sanubari, Keiichi Tokuda, ``Robust spectral
estimation based on an AR model excited by a t-distribution
process by using QR decomposition algorithm,'' Proceedings of
IEEE International Conference on Circuits and Systems,
pp.2497-2500, June 1997.
- Fernando G. Resende, Keiichi Tokuda and Mineo Kaneko,
``Multi-band decomposition of the linear prediction error
applied to the least-mean squares method with fixed and
variable step-sizes,'' Proceedings of IEEE International
Conference on Circuits and Systems, pp.2176-2179, June 1997.
- Takashi Masuko, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``Voice characteristics conversion for HMM-based
speech synthesis system,'' Proceedings of IEEE International
Conference on Acoustics, Speech, and Signal Processing, vol.3,
pp.1611-1614, Munichi, Germany, Apr. 1997.
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``Efficient encoding of mel-generalized cepstrum
for CELP coders,'' Proceedings of IEEE International
Conference on Acoustics, Speech, and Signal Processing, vol.2,
pp.1355-1358, Munichi, Germany, Apr. 1997.
- Yasuichi Hamano, Keiichi Tokuda and Mineo Kaneko, ``Image
restoration based on estimation of fractal structure,'' IEEE
Region Ten Conference (Digital Signal Processing
Applications), pp.311-316, Nov. 1996.
- Fernando G. Resende, Keiichi Tokuda and Mineo Kaneko,
``RLS algorithms for adaptive AR spectrum analysis based on
multi-band decomposition of the linear prediction error,''
IEEE Region Ten Conference (Digital Signal Processing
Applications), pp.541-546, Nov. 1996.
- Keiichi Tokuda, Takao Kobayashi, Takashi Masuko and
Satoshi Imai, ``Quantization of vector sequence using
statistics of neighboring input vectors,'' Proc. of ASA and
ASJ 3rd Joint Meeting, Honolulu, USA, pp.1067-1072, Dec. 1996.
- Takao Kobayashi, Takashi Masuko, Keiichi Tokuda and
Satoshi Imai, ``Noisy speech recognition using HMM-based
cepstral parameter generation and compensation,'' Proc. of
ASA and ASJ 3rd Joint Meeting, Honolulu, USA, pp.1117-1122,
Dec. 1996.
- Takashi Masuko, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``HMM-based speech synthesis with various voice
characteristics,'' Proc. of ASA and ASJ 3rd Joint Meeting,
Honolulu, USA, pp.1043-1046, Dec. 1996.
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``Spectral representation of speech using
mel-generalized cepstral coefficients,'' Proc. of ASA and ASJ
3rd Joint Meeting, Honolulu, USA, pp.963-968, Dec. 1996.
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``CELP coding system based on mel-generalized
cepstral analysis,'' Proceedings of International Conference
on Spoken Language Processing, Philadelphia, USA, vol.1,
pp.314-317, Oct. 1996.
- Junibakti Sanubari, Keiichi Tokuda and Mahoki Onoda, ``The
application of t-distribution assumption for robust
spectral estimation,'' Proc. of the IASTED International
Conferences on Signal, Image Processing and Application,
SIPA-96, Annecy, France, pp.217-220, June 1996.
- Junibakti Sanubari, Keiichi Tokuda, Mahoki Onoda, ``Robust
two-dimensional spectral estimation based on an AR model
excited by a t-distribution process,'' Proceedings of IEEE
International Conference on Acoustics, Speech, and Signal
Processing, vol.5, pp.2998-3001, Atlanta, USA, May 1996.
- Takashi Masuko, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``Speech synthesis from HMMs using dynamic
features,'' Proceedings of IEEE International Conference on
Acoustics, Speech, and Signal Processing, vol.1, pp.389-392,
May 1996.
- Fernando G. Resende, Keiichi Tokuda and Mineo Kaneko, ``A
fast algorithm for adaptive AR spectral estimation based on
multi-scale decomposition of linear prediction error,''
Proceedings of Midwest Symposium on Circuits and Systems,
pp.119-122, Aug. 1995.
- Keiichi Tokuda, Takashi Masuko, Tetsuya Yamada, Takao
Kobayashi and Satoshi Imai, ``An algorithm for speech
parameter generation from continuous mixture HMMs with dynamic
features,'' Proceedings of European Conference on Speech
Communication and Technology, vol.1, pp.757-760, Sep. 1995.
- Keiichi Tokuda, Takao Kobayashi and Satoshi Imai, ``Speech
parameter generation from HMM using dynamic features,''
Proceedings of IEEE International Conference on Acoustics,
Speech, and Signal Processing, vol.1, pp.660-663, May 1995.
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``CELP coding based on mel-cepstral analysis,''
Proceedings of IEEE International Conference on Acoustics,
Speech, and Signal Processing, vol.1, pp.33-36, May 1995.
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``Speech coding based on adaptive mel-cepstral
analysis for noisy channels,'' Proceedings of International
Conference on Spoken Language Processing, vol.4,
pp.2087-2090, Sep. 1994.
- Keiichi Tokuda, Takao Kobayashi, Takashi Masuko and
Satoshi Imai, ``Mel-generalized cepstral analysis --a
unified approach to speech spectral estimation,'' Proceedings
of International Conference on Spoken Language Processing,
vol.3, pp.1043-1046, Sep. 1994.
- Fernando G. Resende, Keiichi Tokuda and Mineo Kaneko, ``AR
spectrum estimation based on wavelet representation,''
Proceedings of IEEE International Conference on Circuits and
Systems, vol.2, pp.625-628, June 1994.
- Junibakti Sanubari, Keiichi Tokuda, Mahoki Onoda, ``Robust
recursive spectral estimation based on AR model excited by a
t-distribution process,'' Proceedings of IEEE International
Conference on Acoustics, Speech, and Signal Processing, vol.3,
pp.497-500, Apr. 1994.
- Keiichi Tokuda, Hidetoshi Matsumura, Takao Kobayashi and
Satoshi Imai, ``Speech coding based on adaptive mel-cepstral
analysis,'' Proceedings of IEEE International Conference on
Acoustics, Speech, and Signal Processing, vol.1, pp.197-200,
Apr. 1994.
- Junibakti Sanubari, Keiichi Tokuda and Mahoki Onoda,
``Spectral estimation based on AR model excited by
t-distribution process,'' Proceedings of IEEE International
Conference on Acoustics, Speech, and Signal Processing, vol.5,
pp.521-524, Mar. 1992.
- Takao Kobayashi, Kazuyoshi Fukushi, Keiichi Tokuda and
Satoshi Imai, ``Design of stable two-dimensional IIR digital
filters with arbitrary magnitude function,'' Proceedings of
IEEE International Conference on Acoustics, Speech, and Signal
Processing, vol.5, pp.93-96, Mar. 1992.
- Toshiaki Fukada, Keiichi Tokuda, Takao Kobayashi and
Satoshi Imai, ``An adaptive algorithm for mel-cepstral
analysis of speech,'' Proceedings of IEEE International
Conference on Acoustics, Speech, and Signal Processing, vol.1,
pp.137-140, Mar. 1992.
- Keiichi Tokuda, Takao Kobayashi and Satoshi Imai,
``Generalized cepstral analysis of speech --unified approach
to LPC and cepstral method,'' Proceedings of International
Conference on Spoken Language Processing, pp.37-40, Nov. 1990.
- Keiichi Tokuda, Takao Kobayashi, Shoji Shiomoto and
Satoshi Imai, ``Adaptive filtering based on cepstral
representation --adaptive cepstral analysis of speech,''
Proceedings of IEEE International Conference on Acoustics,
Speech, and Signal Processing, pp.377-380, Apr. 1990.
Keiichi Tokuda
2013-02-11