ICML 2026 Past Large language modelsSafety & alignment

The Second Workshop on the Impact of Memorization on Trustworthy Foundation Models at ICML

ICML MemFM 2026 Workshop

Submission deadline
May 9, 2026, 12:00 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (45)

Fetched from OpenReview (v2) on 2026-06-10.

  1. \textsc{ContinuousBench}: Can Differentially Private Synthetic Text Improve Capabilities?

    Peihan Liu, Lucas Rosenblatt, Weiwei Kong, Natalia Ponomareva, Gautam Kamath, Rachel Cummings, Roxana Geambasu, Yu Gan, Lillian Tsai, Alex Bie · PDF
  2. Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models

    Xinyue Liu, Niloofar Mireshghallah, Jane C. Ginsburg, Tuhin Chakrabarty · PDF
  3. Alignment-aware Data Selection for Unlearning in Contrastive Vision-Language Models

    Dongjun Hwang, Yejin Kim, Beomyun Kwon, Junsuk Choe · PDF
  4. Amplifying Membership Signal Through Iterative Regeneration

    Stanisław Pawlak, Wojciech Łapacz · PDF
  5. An Explicit Memory-Driven Agentic Framework for Power System Simulation

    Qinjuan Wang, Yongli Zhu · PDF
  6. Auditing Reasoning-Trace Memorization Claims after Unlearning with Head-Conditioned Canaries

    Yanhang Li, Zhichao Fan, Zexin Zhuang · PDF
  7. Bayes-Optimal Coexistence via Fact Localizability in Trainable-Feature Decoder-Only Transformers

    Manoj Saravanan · PDF
  8. Break the Output Geometry for Large Language Model Unlearning

    Yejin Kim, William F. Shen, Seokwon Jung, Seong Joon Oh · PDF
  9. Cheap Forgetting: Linear Adapter Interpolation as a Post-Hoc Memorization Mitigation

    Anmol Pandey · PDF
  10. Deployment-Time Memorization in Foundation-Model Agents

    Rachel Chen, Guilin Zhang, Kai Zhao, Xu Chu, Amine Anoun, Jerry Ting · PDF
  11. Detecting Functional Memorization in Code Language Models

    Matthieu Meeus, Anil Ramakrishna, Matthew Grange, Zheng Xu, Luca Melis · PDF
  12. Do Text Anonymizers Generalize Across Contexts? Extending RAT-Bench to Malaysian Microdata and PII

    David Hong Liang Chew, Zexi Yao, Nataša Krčo, Matthieu Meeus, Waqas Khalid Obeidy, Yves-Alexandre de Montjoye · PDF
  13. Estimating Model-Level Membership Inference Vulnerability Without Reference Models

    Euodia Dodd, Nataša Krčo, Igor Shilov, Matthew Robert Wicker, Yves-Alexandre de Montjoye · PDF
  14. Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

    A. Feder Cooper, Mark Lemley, Christopher De Sa, Lea Duesterwald, Allison Casasola, Jamie Hayes, Katherine Lee, Daniel E. Ho, Percy Liang · PDF
  15. Evidence-bearing Insights under Differential Privacy: Beyond the Limits of Private Text Generation

    Tsubasa Takahashi, Takumi Hiraoka · PDF
  16. Internal Data Repetition Destroys Language Models

    Jessica Chudnovsky, Joshua Kazdan, Noam Itzhak Levi, Rylan Schaeffer, Yegor Denisov-Blanch, Sanmi Koyejo, David L. Donoho · PDF
  17. KVEraser: Learning to Steer KV Cache for Efficient Localized Context Erasing

    Mufei Li, Shikun Liu, Dongqi Fu, Haoyu Peter Wang, Yinglong Xia, Hong Li, Hong Yan, Pan Li · PDF
  18. Local Coverage Governs Memorization in Diffusion Models

    Claudia Merger, Sebastian Goldt · PDF
  19. Machine Text Detectors are Membership Inference Attacks

    Ryuto Koike, Liam Dugan, Masahiro Kaneko, Chris Callison-Burch, Naoaki Okazaki · PDF
  20. MemBoost: A Memory-Boosted Framework for Cost-Aware LLM Inference

    Joris Köster, Zixuan Liu, Zizhan Zheng, Siavash H. Khajavi · PDF
  21. Memorization Dynamics of Fill-in-the-Middle Pretraining

    Tobias von Arx, Tanguy Dieudonné · PDF
  22. Memorization Removal as a Two-Player Game: The Adversarial Work Criterion as a Test for Foundation-Model Defenses

    Fryderyk Kuzma · PDF
  23. Memory Adapters Enable Fast, Flexible Knowledge Unlearning in LLMs

    Keltin Grimes, Kevin Kuo, Steven Wu, Virginia Smith, Marissa Catherine Connor · PDF
  24. Mitigating Unintended Memory Use in LLMs via Structured Memory

    Hakeem Hannoon, Andrew Zhao, Mihir Narayan, Sharvin Goyal, Ivaxi Sheth · PDF
  25. NumLeak: Public Numeric Benchmarks as Latent Label in Foundation Models

    Anany Kotawala · PDF
  26. On Optimization Complexity of Second-Order Certified Unlearning

    Nikita Doikov, Anastasia Koloskova · PDF
  27. On the Geometry of Memorization: Interpolation and Second-Order Representation Irregularity

    Satwik Bathula · PDF
  28. On the Learning Dynamics of Label-Noise Memorization in ReLU MLPs

    Yannis Kaltampanidis, Mykola Pechenizkiy, Hannah Pinson · PDF
  29. Position: The Term “Machine Unlearning” Is Overused in LLMs

    Sangyeon Yoon, Yeachan Jun, Albert No · PDF
  30. Probing Memorization of Tabular In-Context Learning

    Francesco Capano, Jonas Böhler · PDF
  31. Probing Policy-Level Memorization in Reasoning LLMs via Atomic Chess

    Ryan Co, Karthik Reddy Konuganti · PDF
  32. Prune to Protect: Faster Training and Enhanced Privacy by Dynamic Data Pruning

    Chinmay Joshi, Advait Gadhikar, Celia Rubio-Madrigal, Aneet Kumar Dutta, Mridula Singh, Rebekka Burkholz · PDF
  33. Rare, Distinctive, Memorized: Auditing Memorization in Fine-Tuned Medical Foundation Models

    Santhosh Parampottupadam, Sinem Sav, Dimitrios Bounias, Saikat Roy, Klaus Maier-Hein, Adam Dziedzic, Franziska Boenisch, Ralf Floca · PDF
  34. Reconstructing Training Images from Foundation Model Parameters in the Healthcare Domain: Privacy Risks and Defences

    Athanasios Panagiotis Glykos, Yannis Kaltampanidis, Mykola Pechenizkiy, Hannah Pinson · PDF
  35. Scale Dependent Data Duplication

    Joshua Kazdan, Noam Itzhak Levi, Rylan Schaeffer, Jessica Chudnovsky, Abhay Puri, Bo He, Mehmet Donmez, Sanmi Koyejo, David L. Donoho · PDF
  36. Semantic Gravity: When Parametric Memory Overpowers Visual Thermodynamics in Video-LLMs

    Vidya Ganesh, Sethuraman T V, Aylmer Britto Rex Harison, Sibi Anitha Ragunathan · PDF
  37. Structural Memorization in AlphaFold: Adversarial Mutations Reveal Template Reliance, Confidence Failures, and Implications for Protein Design

    Jonathan Feldman, Maximilian Brogi, Jeffrey Skolnick · PDF
  38. Suppression is not Deletion: Adversarial Probes Recover Unlearned Knowledge in Code LLMs

    Dhairyasheel Patil, Gustavo Sandoval · PDF
  39. SYMBOLICDRIFT: Measuring Reasoning Drift on Unverifiable Questions

    Weijie Xu, Xi Fang, Yingqiang Ge, Yuhui Xu, Scott Nickleach, Stephanie Eckman, Chandan K. Reddy · PDF
  40. Synthetic Data and the Rise of Spiky Intelligence

    Abitha Thankaraj, Amro Abbas, Dongyang Fan, Vineeth Dorna, Luke Merrick, David J. Schwab, Anshuman Suri, Aldo Gael Carranza, Alex Fang, Alvin Deng, Brett W. Larsen, Darren Teh, Diego Kiner, Fan Pan, Haakon Mongstad, Haoli Yin, Jack Urbanek, Jason Chan Lee, Jason Telanoff, Josh Wills, Katherine L. Mentzer, Maximilian Böther, Parth Doshi, Paul Burstein, Rishabh Adiga, Siddharth Joshi, Tony Jiang, Vidhi Jain, Zhengping Wang, Yonatan Bisk, Bogdan Gaza, Ari S. Morcos, Matthew L Leavitt, Pratyush Maini · PDF
  41. The Distillation Game: Adaptive Attacks & Efficient Defenses

    Youssef Allouah, Mahdi Haghifam, Sanmi Koyejo, Reza Shokri · PDF
  42. The Source of Competence Shapes Metacognition in Language Models

    Roi Cohen, Gerard de Melo · PDF
  43. Watermarking for Proprietary Dataset Protection

    John Kirchenbauer, Brian R. Bartoldson, Bhavya Kailkhura, Tom Goldstein · PDF
  44. What to Forget in Unlearning? Forget Set Curation for Language Models

    Animesh Jha, Arpandeep Khatua, Youssef Allouah, Sanmi Koyejo · PDF
  45. Why Forget-Only Unlearning Needs Memorization

    Luka Radić, Vikrant Singhal, Amartya Sanyal · PDF