List of Publications
A list of publications by each team is publicly available.
List of Publications
Aogu Wada, Tomohiko Nakamura, and Saruwatari Hiroshi, "Hyperbolic embeddings for order-aware classification of audio effect chains," in Proceedings of International Conference on Digital Audio Effects, Sep. 2025.
Hitoshi Suda, Shinnosuke Takamichi, Satoru Fukayama, "Voice Conversion for Likability Control via Automated Rating of Speech Synthesis Corpora," in the Proceedings of Interspeech 2025, Aug. 2025.
Tomohiko Nakamura, Kwanghee Choi, Keigo Hojo, Yoshiaki Bando, Satoru Fukayama, and Shinji Watanabe, "Discrete speech unit extraction via independent component analysis," in SALMA: Speech and Audio Language Models - Architectures, Data Sources, and Training Paradigms, IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, Apr. 2025.
Yuto Ishikawa, Osamu Take, Tomohiko Nakamura, Norihiro Takamune, Yuki Saito, Shinnosuke Takamichi, and Hiroshi Saruwatari, "Real-time noise estimation for lombard-effect speech synthesis in human-avatar dialogue systems," in Proceedings of Asia Pacific Signal and Information Processing Association Annual Summit and Conference, Dec. 2024.
Kanami Imamura, Tomohiko Nakamura, Kohei Yatabe, and Hiroshi Saruwatari, "Neural analog filter for sampling-frequency-independent convolutional layer," APSIPA Transactions on Signal and Information Processing, Dec. 2024.
Hiroaki Hyodo, Shinnosuke Takamichi, Tomohiko Nakamura, Junya Koguchi, and Hiroshi Saruwatari, "DNN-based ensemble singing voice synthesis with interactions between singers," in Proceedings of IEEE Spoken Language Technology Workshop, Dec. 2024.
Hitoshi Suda, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, and Jun Ogata. FruitsMusic: A Real-World Corpus of Japanese Idol-Group Songs. in Proceedings of the 25th International Society for Music Information Retrieval (ISMIR) Conference, 2024.
Mitsuji, Fumiya, Sudesna Chakraborty, Takeshi Morita, Shusaku Egami, Takanori Ugai, and Ken Fukuda. "Entity Linking for Wikidata Using Large Language Models and Wikipedia Links." 2024 Twelfth International Symposium on Computing and Networking Workshops (CANDARW),pp144-49,(2024)
Egami, Shusaku, Takanori Ugai, and Ken Fukuda. :Compressing Multi-Modal Temporal Knowledge Graphs of Videos.: Edited by Lorena Etcheverry, Vanessa Lopez Garcia, Francesco Osborne, and Romana Pernisch. Proceedings of the ISWC 2024 Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice, CEUR Workshop Proceedings,pp3828,(2024)
Anaguchi, Fumikatsu, Sudesna Chakraborty, Takeshi Morita, Shusaku Egami, Takanori Ugai, and Ken Fukuda. :Reasoning and Justification System for Domestic Hazardous Behaviors Based on Knowledge Graph of Daily Activities and Retrieval-Augmented Generation.: 2024 Twelfth International Symposium on Computing and Networking (CANDAR),pp11-20,(2024)