TOKUDA, NANKAKU and HASHIMOTO LABORATORY - HOME/SOFTWARE の履歴(No.3)

Software

In Tokuda & Lee laboratory, softwares for promotion of speech and image research are developed and opened to the public. These are used to research in various organizations and companies.

Software

HMM Speech Sysnthesis System toolkit: HTS

HTS is a basic software for speech synthesis that a lot of research laboratories (Microsoft, IBM, etc) adopt.

Open-source large vocabulary continuous speech recognition engine: Julius

Julius は，音声認識システムの開発・研究のためのオープンソースの高性能な汎用大語彙連続音声認識エンジンです．
数万語彙の連続音声認識を一般のPC上で実時間で実行できます．
高い汎用性を持ち，発音辞書や言語モデル・音響モデルなどのモジュールを組み替えることで，様々な幅広い用途に応用できます．
機能はライブラリで提供されており，アプリケーションへの組み込みも可能です．

Speech signal processing toolkit: SPTK

This is a software that does signal processing and data processing for the acoustical analysis.

音声合成エンジン: hts_engine

HTSで学習したモデルを用いて音声を合成するソフトウェアです．
BSDライセンスで公開しています．

Anthropomorphic spoken dialogue agent: Galatea

This is an open-source, license-free software toolkit for building anthropomorphic spoken dialogue agents. This is a product of project which speech, language, and image researchers from ten or more university in Japan participate to build anthropomorphic spoken dialogue agents. HTS and Julius, those are developed in this laboratory, are used on speech wave form generation module and speech recognition module respectively in this software.

端末

名工大音声対話端末めいちゃん (Japanese)

名古屋工業大学2号館の1階に音声情報案内端末を設置しました．
名工大にお越しの際はぜひ喋りかけてみてください．

Database

Multimodal speech data base for research: M2TINIT (Japanese)

M2TINIT is a multi-modal data base which japanese speech and lip dynamic scene are recorded concurrently. It is developed and opened to the public by Takao Kobayashi laboratory (Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology) and Kitamura & Tokuda laboratory (Department of Computer Science, Nagoya Institute of Technology. Currently, Tokuda & Lee laboratory) for promotion of multi-modal speech research. It has been used to researches that are generation of speech and lip dynamic scene and bimodal speech recognition.

Software

Software

HMM Speech Sysnthesis System toolkit: HTS

Open-source large vocabulary continuous speech recognition engine: Julius

Speech signal processing toolkit: SPTK

音声合成エンジン: hts_engine

Anthropomorphic spoken dialogue agent: Galatea

端末

名工大 音声対話端末 めいちゃん (Japanese)

Database

Multimodal speech data base for research: M2TINIT (Japanese)

名工大音声対話端末めいちゃん (Japanese)