Intelligent Media Processing Research Team
Team Outline

Satoru Fukayama
Team Leader
Information
We will present a demo "What does your voice sound like?" on the AIST Open Day 2024 at AIST Tokyo Waterfront.
Saturday, October 26th, 2024 10am~4pm Last admission 3:30pm
Yoshihiro Sato, Satoru Fukayama, and Jun Ogata will be presenting at the Seismological Society of Japan, Fall Meeting 2022.
Fault Plane Estimation from 3D Hypocenter Distribution by Two-step Clustering Considering Local Shapes
Hiroki Karatsu, Satoru Fukayama will be presenting at IPSJ SIGMUS 135th meeting.
Evaluation of Data Augmentation for DeepBach-based Automatic Four-part Harmonisation
List of Publications
Tomohiko Nakamura, Kwanghee Choi, Keigo Hojo, Yoshiaki Bando, Satoru Fukayama, and Shinji Watanabe, "Discrete speech unit extraction via independent component analysis," in SALMA: Speech and Audio Language Models - Architectures, Data Sources, and Training Paradigms, IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, Apr. 2025.
Yuto Ishikawa, Osamu Take, Tomohiko Nakamura, Norihiro Takamune, Yuki Saito, Shinnosuke Takamichi, and Hiroshi Saruwatari, "Real-time noise estimation for lombard-effect speech synthesis in human-avatar dialogue systems," in Proceedings of Asia Pacific Signal and Information Processing Association Annual Summit and Conference, Dec. 2024.
Kanami Imamura, Tomohiko Nakamura, Kohei Yatabe, and Hiroshi Saruwatari, "Neural analog filter for sampling-frequency-independent convolutional layer," APSIPA Transactions on Signal and Information Processing, Dec. 2024.
Researcher Profile
Photo | Name and role | Field of Expertise | E-mail address HP |
---|---|---|---|
![]() |
Team Leader Satoru Fukayama |
Media Informatics, Acoustic Signal Processing, Music Informatics | |
![]() |
Senior Researcher Tomohiko Nakamura |
Signal-processing-inspired deep Learning, Audio signal processing, Music information processing | |
![]() |
Senior Researcher Nobutaka Ito |
Acoustic Signal Processing, Source Separation, Array Signal Processing | |
![]() |
Researcher Hitoshi Suda |
Spoken language processing, Singing information processing | |
![]() |
AI Engineer Daigo Takizawa |
AI System | |
![]() |
Post-Doctoral Researcher Kai Hiraiwa |
Acoustic Signal Processing, Music Information Processing | |
Research Assistant Hiroki Karatsu |
Music Information Processing | ||
![]() |
Research Assistant Kanami Imamura |
Audio signal processing | |
![]() |
Research Assistant Naoya Matsuyama |
Audio signal processing | |
![]() |
Research Assistant Kentaro Onda |
Audio signal processing, Speech synthesis | |
![]() |
Research Assistant Shun Takahashi |
Spoken Language Processing | |
![]() |
Research Assistant Riki Takizawa |
Sound synthesis | |
Research Assistant Natsuki Toda |
Speech production | ||
![]() |
Invited Researcher Transferred temporarily to METI Jun Ogata |
Spoken Language Processing, Time Series Processing | |
![]() |
Invited Researcher |
Speech and Audio Processing, Anomaly Detection | |
![]() |
Invited Researcher (Assoc. prof. of Keio University) Shinnosuke Takamichi |
||
![]() |
Invited Researcher (Assoc. prof. of Tokyo Metropolitan University) Sayaka Shiota |
Speech Signal Processing | |
![]() |
Visiting Researcher (Lecturer of Tokyo City University) Yoshihiro Sato |
Image/Signal Processing, 3-D structure analysis |