Publications

2026.05.04

Kentaro Onda, Satoru Fukayama, Daisuke Saito, and Nobuaki Minematsu, "Advanced Modeling of Interlanguage Speech Intelligibility Benefit with L1-L2 Multi-Task Learning Using Differentiable K-Means for Accent-Robust Discrete Token-Based ASR," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2026.

2026.05.04

Kohei Asai, Wataru Nakata, Yuki Saito, and Hiroshi Saruwatari, "Geneses: Unified generative speech enhancement and separation," in Proceedings of The Joint Workshop on HSCMA and CHiME 2026 (IEEE ICASSP2026 Satellite Workshop), May 2026.

2026.05.04

Kanami Imamura, Tomohiko Nakamura, Kohei Yatabe, and Hiroshi Saruwatari, "Dissecting performance degradation in audio source separation under sampling frequency mismatch," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2026.

2026.05.04

Karl Schrader, Shoichi Koyama, Tomohiko Nakamura, and Mirco Pezzoli, "Phase-retrieval-based physics-informed neural networks for acoustic magnitude field reconstruction," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2026.

2026.05.04

Shinnosuke Takamichi, Tomohiko Nakamura, Hitoshi Suda, Satoru Fukayama, and Jun Ogata, "MangaVox: Dataset of acted voices aligned with manga images towards computer understanding of audio comics," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2026.

2025.11.28

Kanami Imamura, Tomohiko Nakamura, Norihiro Takamune, Kohei Yatabe, and Hiroshi Saruwatari, "Stride conversion algorithms for convolutional layers and its application to sampling-frequency-independent deep neural networks," Signal Processing, Dec. 2025.

2025.09.22

Yuki Ito, Tomohiko Nakamura, Shoichi Koyama, Shuichi Sakamoto, and Hiroshi Saruwatari, "Spatial upsampling of head-related transfer function using neural network conditioned on source position and frequency," IEEE Open Journal of Signal Processing, vol. 6, pp. 1109-1123, Sept. 2025.

2025.12.01

Nobutaka Ito, "Introducing fast multichannel nonnegative matrix factorization for underdetermined blind audio source separation," Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan, Dec. 2025.

2025.11.04

Yuta Amezawa, Tomohiko Nakamura, Takahiro Shiina, Satoru Fukayama, Jun Ogata, Hiroki Kuroda, and Takahiko Uchide, "Automatic detection and extraction of later phase in S coda using machine learning for crustal heterogeneity exploration," ACES (APEC Cooperation for Earthquake Science) International Workshop, Nov. 2025.

2025.12.01

Kanami Imamura, Tomohiko Nakamura, Kohei Yatabe, and Hiroshi Saruwatari, "Continuous function approximation of convolutional kernels for sampling frequency adaptation of pre-trained source separation networks," Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan, Dec. 2025.

1 2 3 4 5 >

PageTop