* DEMONSTRATION - Speaker adaptation [#oab71fa7]

CENTER:&ref(adaptation.png,nolink);

> In the speech synthesis system, a large amount of trainin data is required.~
However, the cost for recoding speech is unaffordable.~
Speaker adaptation is a technique for control style with a small amount of adaptation data.

- Speech of the target speaker~
"nazejibuNbakarikoNnameniaunodarou"~
「なぜ自分ばかりこんな目にあうのだろう.」
-- &ref(orig.wav,,target speech);

- Synthesis speech with speaker adaptation~
"nazejibuNbakarikoNnameniaunodarou"~
「なぜ自分ばかりこんな目にあうのだろう.」
-- without adaptation~
&ref(adapt0.wav,,synthesis speech);
-- with adaptation using one sentence~
&ref(adapt1.wav,,synthesis speech);
-- with adaptation using three sentences~
&ref(adapt3.wav,,synthesis speech);
-- with adaptation using five sentences~
&ref(adapt5.wav,,synthesis speech);
-- with adaptation using seven sentences~
&ref(adapt7.wav,,synthesis speech);

> We could perform controling style with only a few adaptation data.~
By using this technique, we will be able to construct the speech synthesis sytem which have higer quality with a small amount of training data.




トップ   編集 差分 履歴 添付 複製 名前変更 リロード   新規 一覧 検索 最終更新   ヘルプ   最終更新のRSS