NeurIPS 2025 Past Large language models

AI That Keeps Up: NeurIPS 2025 Workshop on Continual and Compatible Foundation Model Updates

CCFM

Submission deadline
Sep 3, 2025, 16:00 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (34)

Fetched from OpenReview (v2) on 2026-06-10.

  1. Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities

    Urs Spiegelhalter, Jörg K.H. Franke, Frank Hutter · PDF
  2. Continual Learning of Domain Knowledge from Human Feedback in Text-to-SQL

    Thomas Cook, Kelly Patel, Sivapriya Vellaichamy, Saba Rahimi, Zhen Zeng, Sumitra Ganesh · PDF
  3. Continual Pre-training of MoEs: How robust is your router?

    Benjamin Thérien, Charles-Étienne Joseph, Zain Sarwar, Ashwinee Panda, Anirban Das, Shi-Xiong Zhang, Stephen Rawls, Sambit Sahu, Eugene Belilovsky, Irina Rish · PDF
  4. Continuous Self-Improvement of Large Language Models by Test-time Training with Verifier-Driven Sample Selection

    Mohammad Mahdi Moradi, Hossam Amer, Sudhir Mudur, Weiwei Zhang, Yang Liu, Walid Ahmed · PDF
  5. CurLL: Curriculum Learning of Language Models

    Pavan Kalyan Tankala, Shubhra Mishra, Satya Lokam, Navin Goyal · PDF
  6. Curriculum Learning as Transport: Training Along Wasserstein Geodesics

    Changho Shin, David Alvarez-Melis · PDF
  7. Do Language Models Robustly Acquire New Knowledge?

    Harshay Shah, Badih Ghazi, Yangsibo Huang, Ravi Kumar, Da Yu, Chiyuan Zhang · PDF
  8. ELLA: Efficient Lifelong Learning for Adapters in Large Language Models

    Shristi Das Biswas, Yue Zhang, Anwesan Pal, Radhika Bhargava, Kaushik Roy · PDF
  9. Embedding‑to‑Prefix: Continual Personalization with Large Language Models

    Bernd Huber, Ghazal Fazelnia, Andreas Damianou, Sebastian Peleato, Maksym Lefarov, Praveen Chandar, Marco De Nadai, Mounia Lalmas, Paul N. Bennett · PDF
  10. EWC-Guided Diffusion Replay for Exemplar-Free Continual Learning in Medical Imaging

    Anoushka Harit, William Prew, Zhongtian Sun, Florian Markowetz · PDF
  11. Exploring Continual Distillation of Teachers from Different Domains

    Nicolas Michel, Maorong Wang, Jiangpeng He, Toshihiko Yamasaki · PDF
  12. Exploring The Effectiveness of Test Time Learning In LLMs for Long Contexts

    Nizar Islah, Irina Rish, Eilif B. Muller · PDF
  13. Harnessing Quantum Principles for Parameter-Efficient Continual Learning

    Xiaobing Yu, Weiwei Ma, Jin Yang, Peijie Qiu, Xiao Wu, Pan Xiao, Xiaofeng Liu · PDF
  14. HyperAdapt: Simple High-Rank Adaptation

    Abel Gurung, Joseph Campbell · PDF
  15. Information-Geometric Perspectives on Merging Variational Foundation Models

    Nour Jamoussi, Giuseppe Serra, Photios A. Stavrou, Marios Kountouris · PDF
  16. IPA: An Information-Preserving Input Projection Framework for Model Adaptation

    Yuan Yin, Shashanka Venkataramanan, Tuan-Hung Vu, Andrei Bursuc, Matthieu Cord · PDF
  17. Mapping Post-Training Forgetting in Language Models at Scale

    Jackson Harmon, Andreas Hochlehnert, Matthias Bethge, Ameya Prabhu · PDF
  18. Per-Axis Weight Deltas for Frequent Model Updates

    Stefan Kuyumdzhiev, Radostin Cholakov · PDF
  19. Pre-training Limited Memory Language Models with Internal and External Knowledge

    Linxi Zhao, Sofian Zalouk, Christian Belardi, Justin Lovelace, Jin Peng Zhou, Kilian Q Weinberger, Yoav Artzi, Jennifer J. Sun · PDF
  20. Probe-Rewrite-Evaluate: A Workflow for Reliable Benchmarks and Quantifying Evaluation Awareness

    Lang Xiong, Nishant Bhargava, Jeremy Chang, Jianhang Hong, Haihao Liu, Vasu Sharma, Kevin Zhu · PDF
  21. PTPP-Aware Adaptation Scaling Laws: Predicting Domain-Adaptation Performance at Unseen Pre-Training Budgets

    Etienne Goffinet, Shane Bergsma, Avraham Sheinin, Natalia Vassilieva, Preslav Nakov, Gurpreet Gosal · PDF
  22. Retrieval Capabilities of Large Language Models Scale with Pretraining FLOPs

    Jacob Portes, Connor Jennings, Erica Ji Yuen, Sasha Doubov, Michael Carbin · PDF
  23. Revisiting Warm-Start Training: No Generalization Loss under Standard Training Schemes

    Hongjoon Ahn, Jinu Hyeon, Hyeonseop Shin, Taesup Moon · PDF
  24. RL's Razor: Why On-Policy Reinforcement Learning Forgets Less

    Idan Shenfeld, Jyothish Pari, Pulkit Agrawal · PDF
  25. Robust LLM Unlearning with MUDMAN: Meta-Unlearning with Disruption Masking And Normalization

    Filip Sondej, Yushi Yang, Mikolaj Kniejski, Marcel Windys · PDF
  26. Sample-Efficient Parametric Learning from Natural Language

    Parth Asawa, Alex Dimakis, Matei Zaharia · PDF
  27. Sculpting [CLS] Features for Foundation Model-Based Class-Incremental Learning

    Murat Onur Yildirim, Elif Ceren Gok Yildirim, Joaquin Vanschoren · PDF
  28. Slim Adaptation Modules: A Simple yet Strong Baseline for Continual Foundation Models

    Elif Ceren Gok Yildirim, Murat Onur Yildirim, Joaquin Vanschoren · PDF
  29. Specialization after Generalization: Towards Understanding Test-Time Training in Foundation Models

    Jonas Hübotter, Patrik Wolf, Aleksandr Shevchenko, Dennis Jüni, Andreas Krause, Gil Kur · PDF
  30. TEMPiRL: Foundational Compounding Temporal Drift Theory for Temporal-Graph Adaptation in Large Language Models

    Arnav Sharma, Karthik Srikumar · PDF
  31. Unlearning That Lasts: Utility-Preserving, Robust, and almost Irreversible Forgetting in LLMs

    Naman Deep Singh, Maximilian Müller, Francesco Croce, Matthias Hein · PDF
  32. Vocabulary Customization for Efficient Domain‑Specific LLM Deployment

    Christian Herold, Michael Kozielski, Nicholas Santavas, Yannick Versley, Shahram Khadivi · PDF
  33. When Data Falls Short: Grokking Below the Critical Threshold

    Vaibhav Singh, Eugene Belilovsky, Rahaf Aljundi · PDF
  34. When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models

    Michael Shihong Zhang, Rishi Adi Ruia, Arnav Kewalram, Saathvik Dharmapuram, Utkarsh Sharma, Kevin Zhu · PDF