論文リスト
各研究チームの論文リストを公開しております。
論文リスト
Kentaro Onda, Satoru Fukayama, Daisuke Saito, Nobuaki Minematsu, "Benchmarking Prosody Encoding in Discrete Speech Tokens," in Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Dec. 2025.
Go Nishikawa*, Wataru Nakata*, Yuki Saito, Kanami Imamura, Hiroshi Saruwatari, and Tomohiko Nakamura, "Multi-sampling-frequency naturalness MOS prediction using self-supervised learning model with sampling-frequency-independent layer," in Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Dec. 2025. (*: equal contribution)
Nobutaka Ito, "Introducing fast multichannel nonnegative matrix factorization for underdetermined blind audio source separation," Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan, Dec. 2025.
Kanami Imamura, Tomohiko Nakamura, Kohei Yatabe, and Hiroshi Saruwatari, "Continuous function approximation of convolutional kernels for sampling frequency adaptation of pre-trained source separation networks," Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan, Dec. 2025.
Yuta Amezawa, Tomohiko Nakamura, Takahiro Shiina, Satoru Fukayama, Jun Ogata, Hiroki Kuroda, and Takahiko Uchide, "Automatic detection and extraction of later phase in S coda using machine learning for crustal heterogeneity exploration," ACES (APEC Cooperation for Earthquake Science) International Workshop, Nov. 2025.
Yu Hayashizaki, Takashi Nose, Sumiharu Kobayashi, Satoru Fukayama, Akinori Ito, "PUNSER: Large-Scale Pre-trained and Unified Model for Practical Speech Emotion Recognition," in Proceedings of the 17th Asia Pacific Signal and Information Processing Association Annual Summit and Conference, Oct. 2025.
Rinka Nobukawa, Makito Kitamura, Tomohiko Nakamura, Shinnosuke Takamichi, and Hiroshi Saruwatari, "Drum-to-vocal percussion sound conversion and its evaluation methodology," in Proceedings of Asia Pacific Signal and Information Processing Association Annual Summit and Conference, Oct. 2025.
Shikhar Bharadwaj, Samuele Cornell, Kwanghee Choi, Satoru Fukayama, Hye-jin Shim, Soham Deshmukh, Shinji Watanabe, "OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder," in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2025, Oct. 2025.
Ryan Niu, Shoichi Koyama, and Tomohiko Nakamura, "Head-related transfer function individualization using anthropometric features and spatially independent latent representations," in Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct. 2025.
Hitoshi Suda, Junya Koguchi, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, and Jun Ogata, "IdolSongsJp corpus: A multi-singer song corpus in the style of Japanese idol groups," in Proceedings of the 26th International Society for Music Information Retrieval (ISMIR) Conference, Sep. 2025.