Intelligent Media Processing Research Team
Team Outline

Satoru Fukayama
Team Leader
Information
We will present a demo "What does your voice sound like?" on the AIST Open Day 2024 at AIST Tokyo Waterfront.
Saturday, October 26th, 2024 10am~4pm Last admission 3:30pm
Yoshihiro Sato, Satoru Fukayama, and Jun Ogata will be presenting at the Seismological Society of Japan, Fall Meeting 2022.
Fault Plane Estimation from 3D Hypocenter Distribution by Two-step Clustering Considering Local Shapes
Hiroki Karatsu, Satoru Fukayama will be presenting at IPSJ SIGMUS 135th meeting.
Evaluation of Data Augmentation for DeepBach-based Automatic Four-part Harmonisation
List of Publications
William Chen, Shinnosuke Takamichi, Sayaka Shiota, Satoru Fukayama, Samuele Cornell, and Shinji Watanabe, "YODAS v3: Over 1 Million Hours of High-Bandwidth, Stereophonic, Multilingual Speech," in Proceedings of INTERSPEECH, Sept. 2026.
Kentaro Onda, Satoru Fukayama, Daisuke Saito, and Nobuaki Minematsu, "Leveraging Soft Distributions of SSL-Derived Discrete Speech Tokens for Downstream Inference," in Proceedings of INTERSPEECH, Sept. 2026.
Sota Koshino, Shotaro Ueji, Shinnosuke Takamichi, and Tomohiko Nakamura, "Automatic generation of audio comic from manga images," in Proceedings of INTERSPEECH, Show&Tell Session, Sept. 2026.
Researcher Profile
| Photo | Name and role | Field of Expertise | E-mail address HP |
|---|---|---|---|
![]() |
Team Leader Satoru Fukayama |
Media Informatics, Acoustic Signal Processing, Music Informatics | |
![]() |
Senior Researcher Tomohiko Nakamura |
Signal-processing-inspired deep Learning, Audio signal processing, Music information processing | |
![]() |
Senior Researcher Nobutaka Ito |
Acoustic Signal Processing, Source Separation, Array Signal Processing | |
![]() |
Researcher Hitoshi Suda |
Spoken language processing, Singing information processing | |
![]() |
Cross-appointment fellow Yuki Saito |
speech synthesis, speech quality assessment | |
![]() |
AI Engineer Daigo Takizawa |
AI System | |
![]() |
Post-Doctoral Researcher Kai Hiraiwa |
Acoustic Signal Processing, Music Information Processing | |
![]() |
Post-Doctoral Researcher Yu-Hua Chen |
Music information processing, Audio effect modeling, guitar music information retrieval | |
Research Assistant Hiroki Karatsu |
Music Information Processing | ||
![]() |
Research Assistant Kanami Imamura |
Audio signal processing | |
![]() |
Research Assistant Kentaro Onda |
Audio signal processing, Speech synthesis | |
![]() |
Research Assistant Shun Takahashi |
Spoken Language Processing | |
![]() |
Research Assistant Riki Takizawa |
Sound synthesis | |
Research Assistant Natsuki Toda |
Speech production | ||
![]() |
Research Assistant Ryoko Arita |
Speech synthesis,Singing voice synthesis | |
![]() |
Research Assistant Wataru Nakata |
Speech synthesis | |
![]() |
Invited Researcher Transferred temporarily to METI Jun Ogata |
Spoken Language Processing, Time Series Processing | |
![]() |
Invited Researcher |
Speech and Audio Processing, Anomaly Detection | |
![]() |
Invited Researcher (Assoc. prof. of Keio University) Shinnosuke Takamichi |
||
![]() |
Invited Researcher (Assoc. prof. of Tokyo Metropolitan University) Sayaka Shiota |
Speech Signal Processing | |
![]() |
Invited Researcher |
Lightning Protection, Anomaly detection of wind turbines | |
![]() |
Visiting Researcher (National Institute of Informatics) Yusuke Yasuda |
Speech information processing |





















