Department of Computer Science,
Nagoya Institute of Technology, Nagoya, Japan
Low-dimensional Style Token Control for Hyperarticulated Speech Synthesis
Miku Nishihara, Dan Wells, Korin Richmond, Aidan Pine
Interspeech 2024, Kos, Greece, September, 2024.
Singing voice synthesis based on a frame-driven attention mechanism considering vocal timing deviation
Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda
Technical Report of IEICE, vol. 122, no. 339, pp. 19-24, Okinawa, Japan, February, 2023.
A neural audio codec training method for singing voice synthesis
Masato takagi, Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda
Acoustical Society of Japan 2025 Spring Meeting, pp. 947-950, Saitama, Japan, March, 2025.
A study on vocal timing modeling for sequence-to-sequence singing voice synthesis
Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda
Acoustical Society of Japan 2022 Autumn Meeting, pp. 1359-1362, Hokkaido, Japan, September, 2022.
Singing voice synthesis based on time-lag modeling and frame-driven attention mechanism
Miku Nishihara
Master Thesis, Nagoya Institute of Technology, Feburary, 2024.
Vocal timing modeling method in sequence-to-sequence singing voice synthesis
Miku Nishihara
Graduation Thesis, Nagoya Institute of Technology, Feburary, 2022.
Singing voice synthesis based on frame-level sequence-to-sequence models considering vocal timing deviation
Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda
arXiv preprint arXiv:2301.02262, January, 2023. (arXiv)