Intelligent Media Processing Research Team
Team Outline

Satoru Fukayama
Team Leader
Information
We will present a demo "What does your voice sound like?" on the AIST Open Day 2024 at AIST Tokyo Waterfront.
Saturday, October 26th, 2024 10am~4pm Last admission 3:30pm
Yoshihiro Sato, Satoru Fukayama, and Jun Ogata will be presenting at the Seismological Society of Japan, Fall Meeting 2022.
Fault Plane Estimation from 3D Hypocenter Distribution by Two-step Clustering Considering Local Shapes
Hiroki Karatsu, Satoru Fukayama will be presenting at IPSJ SIGMUS 135th meeting.
Evaluation of Data Augmentation for DeepBach-based Automatic Four-part Harmonisation
List of Publications
Kentaro Onda, Satoru Fukayama, Daisuke Saito, and Nobuaki Minematsu, "Advanced Modeling of Interlanguage Speech Intelligibility Benefit with L1-L2 Multi-Task Learning Using Differentiable K-Means for Accent-Robust Discrete Token-Based ASR," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2026.
Kohei Asai, Wataru Nakata, Yuki Saito, and Hiroshi Saruwatari, "Geneses: Unified generative speech enhancement and separation," in Proceedings of The Joint Workshop on HSCMA and CHiME 2026 (IEEE ICASSP2026 Satellite Workshop), May 2026.
Kanami Imamura, Tomohiko Nakamura, Kohei Yatabe, and Hiroshi Saruwatari, "Dissecting performance degradation in audio source separation under sampling frequency mismatch," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2026.
Researcher Profile
| Photo | Name and role | Field of Expertise | E-mail address HP |
|---|---|---|---|
![]() |
Team Leader Satoru Fukayama |
Media Informatics, Acoustic Signal Processing, Music Informatics | |
![]() |
Senior Researcher Tomohiko Nakamura |
Signal-processing-inspired deep Learning, Audio signal processing, Music information processing | |
![]() |
Senior Researcher Nobutaka Ito |
Acoustic Signal Processing, Source Separation, Array Signal Processing | |
![]() |
Researcher Hitoshi Suda |
Spoken language processing, Singing information processing | |
![]() |
Cross-appointment fellow Yuki Saito |
speech synthesis, speech quality assessment | |
![]() |
AI Engineer Daigo Takizawa |
AI System | |
![]() |
Post-Doctoral Researcher Kai Hiraiwa |
Acoustic Signal Processing, Music Information Processing | |
Research Assistant Hiroki Karatsu |
Music Information Processing | ||
![]() |
Research Assistant Kanami Imamura |
Audio signal processing | |
![]() |
Research Assistant Naoya Matsuyama |
Audio signal processing | |
![]() |
Research Assistant Kentaro Onda |
Audio signal processing, Speech synthesis | |
![]() |
Research Assistant Shun Takahashi |
Spoken Language Processing | |
![]() |
Research Assistant Riki Takizawa |
Sound synthesis | |
Research Assistant Natsuki Toda |
Speech production | ||
![]() |
Research Assistant Ryoko Arita |
Speech synthesis,Singing voice synthesis | |
![]() |
Invited Researcher Transferred temporarily to METI Jun Ogata |
Spoken Language Processing, Time Series Processing | |
![]() |
Invited Researcher |
Speech and Audio Processing, Anomaly Detection | |
![]() |
Invited Researcher (Assoc. prof. of Keio University) Shinnosuke Takamichi |
||
![]() |
Invited Researcher (Assoc. prof. of Tokyo Metropolitan University) Sayaka Shiota |
Speech Signal Processing | |
![]() |
Visiting Researcher (Lecturer of Tokyo City University) Yoshihiro Sato |
Image/Signal Processing, 3-D structure analysis | |
![]() |
Visiting Researcher (National Institute of Informatics) Yusuke Yasuda |
Speech information processing |




















