NeurIPS 2025 Past Speech & audio

AI for Music Workshop

AI4Music

Submission deadline
Aug 30, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (73)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Loopy Framework and Tool for Real-time Human-AI Music Collaboration

    Sageev Oore, Finlay Miller, Chandramouli Shama Sastry, Sri Harsha Dumpala, Marvin F. da Silva, Daniel Oore, Scott C. Lowe · PDF
  2. ACappellaSet: A Multilingual A Cappella Dataset for Source Separation and AI-assisted Rehearsal Tools

    Ting-Yu Pan, Kexin Phyllis Ju, Hao-Wen Dong · PDF
  3. Adapting Speech Language Model to Singing Voice Synthesis

    Yiwen Zhao, Jiatong Shi, Jinchuan Tian, Yuxun Tang, Jiarui Hai, Jionghao Han, Shinji Watanabe · PDF
  4. Advancing Multi-Instrument Music Transcription: Results from the 2025 AMT Challenge

    Ojas Chaturvedi, Kayshav Bhardwaj, Tanay Gondil, Benjamin Shiue-Hal Chou, Yujia Yan, Kristen Yeon-Ji Yun, Yung-Hsiang Lu, Sungkyun Chang · PDF
  5. AI Harmonica: A Smart Electronic Harmonica for Music Learning and Co-Creativity

    Sherry Ruan · PDF
  6. AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion

    Junyoung Koh, Soo Yong Kim, GYU HYEONG CHOI, Yongwon choi · PDF
  7. AMBISONIC-DML: Higher-Order Ambisonic Music Dataset for Spatial AI Generation

    Seungryeol Paik, Kyogu Lee · PDF
  8. Asura's Harp: Direct Latent Control of Neural Sound

    Kaj Bostrom · PDF
  9. Audio-to-Audio Schrodinger Bridges

    Kevin J. Shih, Zhifeng Kong, Weili Nie, Arash Vahdat, Sang-gil Lee, Joao Felipe Santos, Ante Jukić, Rafael Valle, Bryan Catanzaro · PDF
  10. Beyond Collaborative Filtering: Using Decoders for Personalized Music Recommendation

    Timothy Greer, Nicholas Capel, Emanuele Coviello, Amina Shabbeer · PDF
  11. Bias beyond Borders: Global Inequalities in AI-Generated Music

    Ahmet Solak, Florian Grötschla, Luca A Lanzendörfer, Roger Wattenhofer · PDF
  12. BNMusic: Blending Environmental Noises into Personalized Music

    Chi Zuo, Martin B. Møller, Pablo Martínez-Nuevo, Huayang Huang, Yu Wu, Ye Zhu · PDF
  13. BOSSA: Learning Music Style Through Cross-Modal Bootstrapping

    Jingwei Zhao, Ziyu Wang, Gus Xia, Ye Wang · PDF
  14. Chord-conditioned Melody and Bass Generation

    Alexandra C Salem, Mohammad Shokri, Johanna Devaney · PDF
  15. CLAM: Safeguarding Authenticity and Addressing Implications for the Music Industry

    Arnesh Batra, Krish Thukral, Naman Batra, Dev Sharma, Ruhani Bhatia, Aditya Gautam · PDF
  16. Composer Vector: Style-steering Symbolic Music Generation in a Latent Space

    Xunyi Jiang, Xin Xu · PDF
  17. DAWZY: A New Addtion to AI powered "Human in the Loop" Music Co-creation

    Aaron C Elkins, Sanchit Singh, Adrian Kieback, Sawyer Blankenship, Uyiosa Philip Amadasun, Aman Chadha · PDF
  18. DAWZY: Human-in-the-Loop Natural-Language Control of REAPER

    Aaron C Elkins, Sanchit Singh, Sawyer Blankenship, Adrian Kieback, Uyiosa Philip Amadasun, Aman Chadha · PDF
  19. Demonstrating Singing accompaniment capabilities for MuseControlLite

    Fang-Duo Tsai, Yi-Hsuan Yang · PDF
  20. Discovering and Steering Interpretable Concepts in Large Generative Music Models

    Nikhil Singh, Manuel Cherep, Patricia Maes · PDF
  21. Do Joint Language-Audio Embeddings Encode Perceptual Timbre Semantics?

    Qixin Deng, Bryan Pardo, Thrasyvoulos N Pappas · PDF
  22. E-Motion Baton: Human-in-the-Loop Music Generation via Expression and Gesture

    Mingchen Ma, Stephen Ni-Hahn, Simon Mak, Yue Jiang, Cynthia Rudin · PDF
  23. Effortless: AI-Augmented Music Composition and Live Performance in Virtual and Mixed Reality

    Strong Bear · PDF
  24. Embedding Alignment in Code Generation for Audio

    Sam Kouteili, Hiren Madhu, George Typaldos, Mark Paul Santolucito · PDF
  25. ENHANCING TEXT-TO-MUSIC GENERATION THROUGH RETRIEVAL-AUGMENTED PROMPT REWRITE

    Meiying Ding, Brian McFee, Chenkai Hu, Sunny Yang, Juhua Huang · PDF
  26. Enhancing Text-to-Music Generation through Retrieval-Augmented Prompt Rewrite Demo

    Meiying Ding, Brian McFee, Sunny Yang, Chenkai Hu, Juhua Huang · PDF
  27. Ethics Statements in AI Music Papers: The Effective and the Ineffective

    Julia Barnett, Patrick O'Reilly, Jason Smith, Annie Chu, Bryan Pardo · PDF
  28. Evaluating Multimodal Large Language Models on Core Music Perception Tasks

    Brandon James Carone, Iran R Roman, Pablo Ripollés · PDF
  29. EVxRAVE: Incorporating Neural Synthesis in an Augmented String Instrument Platform

    Brian Lindgren · PDF
  30. FlashFoley: Fast Interactive Sketch2Audio Generation

    Zachary Novack, Koichi Saito, Zhi Zhong, Takashi Shibuya, Shuyang Cui, Julian McAuley, Taylor Berg-Kirkpatrick, christian simon, Shusuke Takahashi, Yuki Mitsufuji · PDF
  31. From Generation to Attribution: Music AI Agent Architectures for the Post-Streaming Era

    Wonil Kim, Hyeongseok Wi, Seungsoon Park, Taejun Kim, Sangeun Keum, Keunhyoung Kim, Taewan Kim, Jongmin Jung, Taehyoung Kim, Gaetan Guerrero, Mael Le Goff, Julie Po, Dongjoo Moon, Juhan Nam, Jongpil Lee · PDF
  32. Generating Piano Music with Transformers: A Comparative Study of Scale, Data, and Metrics

    Jonathan Lehmkuhl, Ábel Ilyés-Kun, Nico Bremes, Cemhan Kaan Özaltan, Frederik Muthers, Jiayi Yuan · PDF
  33. Generative Multi-modal Feedback for Singing Voice Synthesis Evaluation

    Xueyan Li, Yuxin Wang, Mengjie Jiang, Qingzi Zhu, Jing Zhang, Zoey Kim, Yazhe Niu · PDF
  34. HARP 3.0: Generalizing I/O and API Support for Machine Learning in Digital Audio Workstations

    Frank Cwitkowitz, Christodoulos Benetatos, Qixin Deng, Huiran Yu, Nathan Pruyne, Patrick O'Reilly, Hugo Flores García, Zhiyao Duan, Bryan Pardo · PDF
  35. LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR

    Guang Yang, Victoria Ebert, Nazif Can Tamer, Luiza Amador Pozzobon, Noah A. Smith · PDF
  36. Leveraging Diffusion Models For Predominant Instrument Recognition

    Charis Cochran, Yeongheon Lee, Youngmoo Kim · PDF
  37. Linear RNNs for autoregressive generation of long music samples

    Konrad Szewczyk, Daniel Gallo Fernández, James Townsend · PDF
  38. LyricLens: An Interactive System for Multi-Label Music Content Rating

    Kai-Yu Lu, Malhar Sham Ghogare, Zihan Su, Shanu Sushmita · PDF
  39. Memership and Dataset Inference Attacks on Large Audio Generative Models

    Jakub Proboszcz, Paweł Kochański, Karol Korszun, Katarzyna Stankiewicz, Donato Crisostomi, Giorgio Strano, Emanuele Rodolà, Kamil Deja, Jan Dubiński · PDF
  40. MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction

    Yunkee Chae, Kyogu Lee · PDF
  41. MIDI-LLM: Adapting Large Language Models for Text-to-MIDI Music Generation

    Shih-Lun Wu, Yoon Kim, Cheng-Zhi Anna Huang · PDF
  42. Mozart AI: Browser-Based AI Music Co-Production

    Pascual Merita, Petr Ivan, Immanuel Rajadurai, Sundar Arvind, Arjun Khanna · PDF
  43. MuCPT: Music-related Natural Language Model Continued Pretraining

    Kai Tian, Yirong Mao, Wendong Bi, Hanjie Wang, Que Wenhui · PDF
  44. Multi-bit Audio Watermarking for Music

    Luca A Lanzendörfer, Kyle Fearne, Florian Grötschla, Roger Wattenhofer · PDF
  45. Multimodal Music Tokenization with Residual Quantization for Generative Retrieval

    Wo Jae Lee, Emanuele Coviello, Rifat Joyee, Sudev Mukherjee · PDF
  46. Music to Video Matching Based on Beats and Tempo

    Aleksandr Mikheev, Ilya Makarov · PDF
  47. MusicSem: A Semantically Rich Language-Audio Dataset of Organic Musical Discourse

    Rebecca Salganik, Teng Tu, Fei-Yueh Chen, Xiaohao Liu, Kaifeng Lu, Ethan Luvisia, Zhiyao Duan, Guillaume Salha-Galvan, Anson Kahng, Yunshan Ma, Jian Kang · PDF
  48. MusPyExpress: Extending MusPy with Enhanced Expression Text Support

    Phillip Long, Hao-Wen Dong, Julian McAuley, Zachary Novack · PDF
  49. My Music My Choice: Adversarial Protection Against Vocal Cloning in Songs

    Ilke Demir, Gerald Pena Vargas, Alicia Unterreiner, David Ponce, Umur A. Ciftci · PDF
  50. No Encore: Unlearning as Opt-Out in Music Generation

    Jinju Kim, Taehan Kim, Abdul Waheed, Jong Hwan Ko, Rita Singh · PDF
  51. PANDORA: Diffusion Policy Learning for Dexterous Robotics Piano Playing with a Train-only LLM Expressiveness Reward

    Yanjia Huang, Renjie Li, Zhengzhong Tu · PDF
  52. Perceptually Aligning Representations of Music via Noise-Augmented Autoencoders

    Mathias Rose Bjare, Giorgia Cantisani, Marco Pasini, Stefan Lattner, Gerhard Widmer · PDF
  53. Persian Musical Instruments Classification Using Polyphonic Data Augmentation

    Diba Hadi Esfangereh, Mohammad Hossein Sameti, Sepehr Harfi Moridani, Leili Javidpour, Mahdieh Soleymani Baghshah · PDF
  54. Prompt-Based Music Discovery: A Prototype Using Source Separation And LLMs

    Vansh Chugh · PDF
  55. Rhythmic Stability and Synchronization in Multi-Track Music Generation

    Hongrui Wang, Fan Zhang, Zhiyuan Yu, Ziya Zhou, Xi Chen, Yang Wang, Can Yang · PDF
  56. Robust Neural Audio Fingerprinting using Music Foundation Models

    Shubhr Singh, Kiran Bhat, Benjamin Resnick, Xavier Riley, John Thickstun, Walter De Brouwer · PDF
  57. Robust Personalized Human-AI Collaboration with SmartLooper

    Sageev Oore, Finlay Miller, Chandramouli Shama Sastry, Sri Harsha Dumpala, Marvin F. da Silva, Daniel Oore, Scott C. Lowe · PDF
  58. Segment-Factorized Full-Song Generation on Symbolic Piano Music

    Ping-Yi Chen, Chih-Pin Tan, Yi-Hsuan Yang · PDF
  59. Semitone-Aware Fourier Encoding: A Music-Structured Approach to Audio-Text Alignment

    Chengze Du, JinYang Zhang, Wenxin Zhang · PDF
  60. SepACap: Source Separation for A Cappella Music

    Luca A Lanzendörfer, Constantin Pinkl, Florian Grötschla, Roger Wattenhofer · PDF
  61. Slimmable NAM: Neural Amp Models with adjustable runtime computational cost

    Steven Atkinson · PDF
  62. Soundtrack Retrieval for Film Production

    Bill Wang, Haven Kim, Leduo Chen, Minje Kim, Julian McAuley · PDF
  63. StylePitcher: Generating Style-Following and Expressive Pitch Curves for Versatile Singing Tasks

    Jingyue Huang, Qihui Yang, Fei-Yueh Chen, Julian McAuley, Randal Leistikow, Yongyi Zang · PDF
  64. TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling

    SeungHeon Doh, Keunwoo Choi, Juhan Nam · PDF
  65. The Ghost in the Keys: A Disklavier Demo for Human-AI Musical Co-Creativity

    Louis Bradshaw, Alexander Spangher, Stella Biderman, Simon Colton · PDF
  66. The Name-Free Gap: Policy-Aware Stylistic Control in Music Generation

    Ashwin Nagarajan, Hao-Wen Dong · PDF
  67. Towards AI Rapper: Creating an Interactive Rap Battle Experience with Generative AI

    Nikita Kozodoi, Elizaveta Zinovyeva, Zainab Afolabi, Egor Krashenninikov · PDF
  68. Using a Joint-Embedding Predictive Architecture for Symbolic Music Understanding

    Rafik Hachana, Bader Rasheed · PDF
  69. Video-to-Music Generation for Film Production: A Dataset and Framework

    Haven Kim, Leduo Chen, Bill Wang, Hao-Wen Dong, Julian McAuley · PDF
  70. When Creative Machines Learn from Each Other

    Haven Kim, Yusong Wu, Taylor Berg-Kirkpatrick, Julian McAuley · PDF
  71. Who Gets Heard? Rethinking Fairness in AI for Music Systems

    Atharva Mehta, Shivam Chauhan, Megha Sharma, Gus Xia, Kaustuv Kanti Ganguli, Nishanth Chandran, Zeerak Talat, Monojit Choudhury · PDF
  72. Why Do Music Models Plagiarize? A Motif-Centric Perspective

    Tatsuro Inaba, Kentaro Inui · PDF
  73. Zero-shot Geometry-Aware Diffusion Guidance for Music Restoration

    Jia-Wei Liao, Pin-Chi Pan, Li-Xuan Peng, Sheng-Ping Yang, Yen-Tung Yeh, Cheng-Fu Chou, Yi-Hsuan Yang · PDF