Audio ML Papers

Last 7 Days (June 02 - June 09, 2026)

Subcategories: All (3) | Speech Synthesis (1) | Music Synthesis (0) | Ambient Synthesis (0) | Quality Evaluation (0) | Enhancement (0) | Asr (0) | Llm Audio (0) | Midi Generation (0) | Generative Conditioning (0) | Other (1)
← Previous Week | Current Week

🏆 Top Papers This Week

#1 TOP PAPER (Score: 93)
Ziyang Ma, Ruiqi Yan, Ruiyang Xu ... · arXiv
We introduce MMAE, a Massive Multitask Audio Editing benchmark, serving as the first comprehensive evaluation testbed designed for general-purpose instruction-based audio editing. Spurred by the shift toward intelligent creation, interactive editing has rapidly expanded from visu...
#2 TOP PAPER (Score: 86)
Marco Pasini, Javier Nistal, Mathias Rose Bjare ... · arXiv
We present LiveBand, a real-time system that generates high-fidelity music accompaniments to live audio input, respecting strict causal constraints. Our method trains a causal transformer generator in the continuous latent space of a pre-trained causal audio autoencoder, using ad...
#3 TOP PAPER (Score: 85)
Nikita Koriagin, Georgii Aparin, Nikita Balagansky ... · arXiv
Language models increasingly serve as the backbone of text-to-speech (TTS) systems, yet we understand little about the representations they build when text and generated speech tokens share a single residual stream. We train BatchTopK sparse autoencoders on the LM backbone of Cos...