論文リスト
各研究チームの論文リストを公開しております。
論文リスト
Kentaro Onda, Satoru Fukayama, Daisuke Saito, Nobuaki Minematsu, "Benchmarking Prosody Encoding in Discrete Speech Tokens," in Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Dec. 2025.
Go Nishikawa*, Wataru Nakata*, Yuki Saito, Kanami Imamura, Hiroshi Saruwatari, and Tomohiko Nakamura, "Multi-sampling-frequency naturalness MOS prediction using self-supervised learning model with sampling-frequency-independent layer," in Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Dec. 2025. (*: equal contribution)
Rinka Nobukawa, Makito Kitamura, Tomohiko Nakamura, Shinnosuke Takamichi, and Hiroshi Saruwatari, "Drum-to-vocal percussion sound conversion and its evaluation methodology," in Proceedings of Asia Pacific Signal and Information Processing Association Annual Summit and Conference, Oct. 2025.
Ryan Niu, Shoichi Koyama, and Tomohiko Nakamura, "Head-related transfer function individualization using anthropometric features and spatially independent latent representations," in Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct. 2025.
Hitoshi Suda, Junya Koguchi, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, and Jun Ogata, "IdolSongsJp corpus: A multi-singer song corpus in the style of Japanese idol groups," in Proceedings of the 26th International Society for Music Information Retrieval (ISMIR) Conference, Sep. 2025.
Kanami Imamura, Tomohiko Nakamura, Norihiro Takamune, Kohei Yatabe, and Hiroshi Saruwatari, "Local equivariance error-based metrics for evaluating sampling-frequency-independent property of neural network," in Proceedings of European Signal Processing Conference, Sep. 2025.
Aogu Wada, Tomohiko Nakamura, and Saruwatari Hiroshi, "Hyperbolic embeddings for order-aware classification of audio effect chains," in Proceedings of International Conference on Digital Audio Effects, Sep. 2025.
Yuki Ito, Tomohiko Nakamura, Shoichi Koyama, Shuichi Sakamoto, and Hiroshi Saruwatari, "Spatial upsampling of head-related transfer function using neural network conditioned on source position and frequency," IEEE Open Journal of Signal Processing, Sep. 2025.
Kentaro Onda, Keisuke Imoto, Satoru Fukayama, Daisuke Saito, Nobuaki Minematsu, "Prosodically Enhanced Foreign Accent Simulation by Discrete Token-based Resynthesis Only with Native Speech Corpora," in Proceedings of Interspeech 2025, Aug. 2025.
Kentaro Onda, Keisuke Imoto, Satoru Fukayama, Daisuke Saito, Nobuaki Minematsu, "Discrete Tokens Exhibit Interlanguage Speech Intelligibility Benefit: an Analytical Study Towards Accent-robust ASR Only with Native Speech Data," in Proceedings of Interspeech 2025, Aug. 2025.