NeurIPS 2025 Past Speech & audio
AI for Music Workshop
AI4Music
- Submission deadline
- Aug 30, 2025, 11:59 UTC imported from OpenReview — check the website for extensions
- Submission portal
- OpenReview
- Notes
- Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).
Accepted papers (73)
Fetched from OpenReview (v2) on 2026-06-10.
-
A Loopy Framework and Tool for Real-time Human-AI Music Collaboration
-
ACappellaSet: A Multilingual A Cappella Dataset for Source Separation and AI-assisted Rehearsal Tools
-
Adapting Speech Language Model to Singing Voice Synthesis
-
Advancing Multi-Instrument Music Transcription: Results from the 2025 AMT Challenge
-
AI Harmonica: A Smart Electronic Harmonica for Music Learning and Co-Creativity
-
AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion
-
AMBISONIC-DML: Higher-Order Ambisonic Music Dataset for Spatial AI Generation
-
Asura's Harp: Direct Latent Control of Neural Sound
-
Audio-to-Audio Schrodinger Bridges
-
Beyond Collaborative Filtering: Using Decoders for Personalized Music Recommendation
-
Bias beyond Borders: Global Inequalities in AI-Generated Music
-
BNMusic: Blending Environmental Noises into Personalized Music
-
BOSSA: Learning Music Style Through Cross-Modal Bootstrapping
-
Chord-conditioned Melody and Bass Generation
-
CLAM: Safeguarding Authenticity and Addressing Implications for the Music Industry
-
Composer Vector: Style-steering Symbolic Music Generation in a Latent Space
-
DAWZY: A New Addtion to AI powered "Human in the Loop" Music Co-creation
-
DAWZY: Human-in-the-Loop Natural-Language Control of REAPER
-
Demonstrating Singing accompaniment capabilities for MuseControlLite
-
Discovering and Steering Interpretable Concepts in Large Generative Music Models
-
Do Joint Language-Audio Embeddings Encode Perceptual Timbre Semantics?
-
E-Motion Baton: Human-in-the-Loop Music Generation via Expression and Gesture
-
Effortless: AI-Augmented Music Composition and Live Performance in Virtual and Mixed Reality
-
Embedding Alignment in Code Generation for Audio
-
ENHANCING TEXT-TO-MUSIC GENERATION THROUGH RETRIEVAL-AUGMENTED PROMPT REWRITE
-
Enhancing Text-to-Music Generation through Retrieval-Augmented Prompt Rewrite Demo
-
Ethics Statements in AI Music Papers: The Effective and the Ineffective
-
Evaluating Multimodal Large Language Models on Core Music Perception Tasks
-
EVxRAVE: Incorporating Neural Synthesis in an Augmented String Instrument Platform
-
FlashFoley: Fast Interactive Sketch2Audio Generation
-
From Generation to Attribution: Music AI Agent Architectures for the Post-Streaming Era
-
Generating Piano Music with Transformers: A Comparative Study of Scale, Data, and Metrics
-
Generative Multi-modal Feedback for Singing Voice Synthesis Evaluation
-
HARP 3.0: Generalizing I/O and API Support for Machine Learning in Digital Audio Workstations
-
LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR
-
Leveraging Diffusion Models For Predominant Instrument Recognition
-
Linear RNNs for autoregressive generation of long music samples
-
LyricLens: An Interactive System for Multi-Label Music Content Rating
-
Memership and Dataset Inference Attacks on Large Audio Generative Models
-
MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
-
MIDI-LLM: Adapting Large Language Models for Text-to-MIDI Music Generation
-
Mozart AI: Browser-Based AI Music Co-Production
-
MuCPT: Music-related Natural Language Model Continued Pretraining
-
Multi-bit Audio Watermarking for Music
-
Multimodal Music Tokenization with Residual Quantization for Generative Retrieval
-
Music to Video Matching Based on Beats and Tempo
-
MusicSem: A Semantically Rich Language-Audio Dataset of Organic Musical Discourse
-
MusPyExpress: Extending MusPy with Enhanced Expression Text Support
-
My Music My Choice: Adversarial Protection Against Vocal Cloning in Songs
-
No Encore: Unlearning as Opt-Out in Music Generation
-
PANDORA: Diffusion Policy Learning for Dexterous Robotics Piano Playing with a Train-only LLM Expressiveness Reward
-
Perceptually Aligning Representations of Music via Noise-Augmented Autoencoders
-
Persian Musical Instruments Classification Using Polyphonic Data Augmentation
-
Prompt-Based Music Discovery: A Prototype Using Source Separation And LLMs
-
Rhythmic Stability and Synchronization in Multi-Track Music Generation
-
Robust Neural Audio Fingerprinting using Music Foundation Models
-
Robust Personalized Human-AI Collaboration with SmartLooper
-
Segment-Factorized Full-Song Generation on Symbolic Piano Music
-
Semitone-Aware Fourier Encoding: A Music-Structured Approach to Audio-Text Alignment
-
SepACap: Source Separation for A Cappella Music
-
Slimmable NAM: Neural Amp Models with adjustable runtime computational cost
-
Soundtrack Retrieval for Film Production
-
StylePitcher: Generating Style-Following and Expressive Pitch Curves for Versatile Singing Tasks
-
TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
-
The Ghost in the Keys: A Disklavier Demo for Human-AI Musical Co-Creativity
-
The Name-Free Gap: Policy-Aware Stylistic Control in Music Generation
-
Towards AI Rapper: Creating an Interactive Rap Battle Experience with Generative AI
-
Using a Joint-Embedding Predictive Architecture for Symbolic Music Understanding
-
Video-to-Music Generation for Film Production: A Dataset and Framework
-
When Creative Machines Learn from Each Other
-
Who Gets Heard? Rethinking Fairness in AI for Music Systems
-
Why Do Music Models Plagiarize? A Motif-Centric Perspective
-
Zero-shot Geometry-Aware Diffusion Guidance for Music Restoration