EXHIBITION

In Tokuda & Lee laboratory, softwares for promotion of speech and image research are developed and opened to the public. These are used to research in various organizations and companies.

Software

HMM Speech Sysnthesis System toolkit: HTS

#ref(): File not found: "01.jpg" at page "HOME/EXHIBITION"

HTS is a basic software for speech synthesis that a lot of research laboratories (Microsoft, IBM, etc) adopt.

Here

Open-source large vocabulary continuous speech recognition engine: Julius

#ref(): File not found: "02.jpg" at page "HOME/EXHIBITION"

This is a speech recognition software that is adopted by various research laboratories, and maintains Google Rank of the top of Japan as free software.

Here

Speech signal processing toolkit: SPTK

#ref(): File not found: "03.jpg" at page "HOME/EXHIBITION"

This is a software that does signal processing and data processing for the acoustical analysis.

Here

Anthropomorphic spoken dialogue agent: Galatea

#ref(): File not found: "04.jpg" at page "HOME/EXHIBITION"

This is an open-source, license-free software toolkit for building anthropomorphic spoken dialogue agents. This is a product of project which speech, language, and image researchers from ten or more university in Japan participate to build anthropomorphic spoken dialogue agents. HTS and Julius, those are developed in this laboratory, are used on speech wave form generation module and speech recognition module respectively in this software.

Here

Data base

Multimodal speech data base for research: M2TINIT

#ref(): File not found: "05.jpg" at page "HOME/EXHIBITION"

M2TINIT is a multi-modal data base which japanese speech and lip dynamic scene are recorded concurrently. It is developed and opened to the public by Takao Kobayashi laboratory (Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology) and Kitamura & Tokuda laboratory (Department of Computer Science, Nagoya Institute of Technology. Currently, Tokuda & Lee laboratory) for promotion of multi-modal speech research. It has been used to researches that are generation of speech and lip dynamic scene and bimodal speech recognition.

Here (Japanese page)





トップ   新規 一覧 検索 最終更新   ヘルプ   最終更新のRSS