ICML 2025 Past Fairness & ethics

ICML Workshop on Technical AI Governance (TAIG)

ICML 2025 Workshop TAIG

Submission deadline
May 13, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (45)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Blueprint for a Secure EU AI Audit Ecosystem

    Alejandro Tlaie · PDF
  2. A Conceptual Framework for AI Capability Evaluations

    María Victoria Carro, Denise Alejandra Mester, Francisca Gauna Selasco, Luca Nicolás Forziati Gangi, Matheo Sandleris Musa, Lola Ramos Pereyra, Mario Leiva, Juan Gustavo Corvalan, Maria Vanina Martinez, Gerardo Simari · PDF
  3. A Taxonomy for Design and Evaluation of Prompt-Based Natural Language Explanations

    Isar Nejadgholi, Mona Omidyeganeh, Marc-Antoine Drouin, Jonathan Boisvert · PDF
  4. Acceleration potential in the GPU design-to-manufacturing pipeline

    Maximilian Negele · PDF
  5. Access Controls Will Solve the Dual-Use Dilemma

    Evžen Wybitul · PDF
  6. AI Benchmarks: Interdisciplinary Issues and Policy Considerations

    Maria Eriksson, Erasmo Purificato, Arman Noroozian, João Vinagre, Guillaume Chaslot, Emilia Gomez, David Fernández-Llorca · PDF
  7. Attestable Audits: Verifiable AI Safety Benchmarks Using Trusted Execution Environments

    Christoph Schnabl, Daniel Hugenroth, Bill Marino, Alastair R. Beresford · PDF
  8. CALMA: Context‑Aligned Axes for Language Model Alignment

    Prajna Soni, Deepika Raman, Dylan Hadfield-Menell · PDF
  9. Compute Requirements for Algorithmic Innovation in Frontier AI Models

    Peter Barnett · PDF
  10. Deprecating Benchmarks: Criteria and Framework

    Ayrton San Joaquin, Rokas Gipiškis, Leon Staufer, Ariel Gil · PDF
  11. Detecting Compute Structuring in AI Governance is likely feasible

    Emmanouil Seferis, Timothy Fist · PDF
  12. Distributed and Decentralised Training: Technical Governance Challenges in a Shifting AI Landscape

    Jakub Kryś, Yashvardhan Sharma, Janet Egan · PDF
  13. Evaluating LLM Agent Adherence to Hierarchical Principles: A Lightweight Benchmark for Verifying AI Safety Plan Components

    Ram Potham · PDF
  14. Expert Survey: Technical AI Safety & Security Research Priorities

    Joe O'Brien, Jeremy Dolan, Jeba Sania, Jay Kim, Rocio Cara Labrador, Jonah Dykhuizen, Sebastian Becker, Jam Kraprayoon · PDF
  15. Exploring an Agenda on Memorization-based Copyright Verification

    Harry H. Jiang, Aster Plotnik, Carlee Joe-Wong · PDF
  16. Exploring Functional Similarities of Backdoored Models

    Yufan Feng, Benjamin Tan, Yani Ioannou · PDF
  17. ExpProof : Operationalizing Explanations for Confidential Models with ZKPs

    Chhavi Yadav, Evan Laufer, Dan Boneh, Kamalika Chaudhuri · PDF
  18. Fallacies of Data Transparency: Rethinking Nutrition Facts for AI

    Judy Hanwen Shen, Ken Liu, Angelina Wang, Sarah H. Cen, Andy K Zhang, Caroline Meinhardt, Daniel Zhang, Kevin Klyman, Rishi Bommasani, Daniel E. Ho · PDF
  19. Fragile by Design: Formalizing Watermarking Tradeoffs via Paraphrasing

    Ali Falahati, Lukasz Golab · PDF
  20. From Individual Experience to Collective Evidence: A Reporting-Based Framework for Identifying Systemic Harms

    Jessica Dai, Paula Gradu, Inioluwa Deborah Raji, Benjamin Recht · PDF
  21. Guaranteeable Memory: An HBM-Based Chiplet for Verifiable AI Workloads

    James Petrie · PDF
  22. Hardware-Enabled Mechanisms for Verifying Responsible AI Development

    Aidan O'Gara, Gabriel Kulp, Will Hodgkins, James Petrie, Vincent Immler, Aydin Aysu, Kanad Basu, Shivam Bhasin, Stjepan Picek, Ankur Srivastava · PDF
  23. In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

    Shayne Longpre, Kevin Klyman, Ruth E. Appel, Sayash Kapoor, Rishi Bommasani, Michelle Sahar, Sean McGregor, Avijit Ghosh, Borhane Blili-Hamelin, Nathan Butters, Alondra Nelson, Dr. Amit Elazari, Andrew Sellars, Casey John Ellis, Dane Sherrets, Dawn Song, Harley Geiger, Ilona Cohen, Lauren McIlvenny, Madhulika Srikumar, Mark M. Jaycox, Markus Anderljung, Nadine Farid Johnson, Nicholas Carlini, Nicolas Miailhe, Nik Marda, Peter Henderson, Rebecca S. Portnoff, Rebecca Weiss, Victoria Westerhoff, Yacine Jernite, Rumman Chowdhury, Percy Liang, Arvind Narayanan · PDF
  24. LibVulnWatch: A Deep Assessment Agent System and Leaderboard for Uncovering Hidden Vulnerabilities in Open-Source AI Libraries

    Zekun Wu, Seonglae Cho, Umar Mohammed, CRISTIAN ENRIQUE MUNOZ VILLALOBOS, Kleyton Da Costa, Xin Guan, Theo King, Ze Wang, Emre Kazim, Adriano Koshiyama · PDF
  25. LLMs Can Covertly Sandbag On Capability Evaluations Against Chain-of-Thought Monitoring

    Chloe Li, Mary Phuong, Noah Y. Siegel · PDF
  26. Locking Open Weight Models with Spectral Deformation

    Domenic Rosati, Sebastian Dionicio, Xijie Zeng, Subhabrata Majumdar, Frank Rudzicz, Hassan Sajjad · PDF
  27. Marginal Risk Relative to What? Distinguishing Baselines in AI Risk Management

    Jide Alaga, Michael Chen · PDF
  28. Measuring What Matters: A Framework for Evaluating Safety Risks in Real-World LLM Applications

    Jia Yi Goh, Shaun Khoo, Nyx Iskandar, Gabriel Chua, Leanne Tan, Jessica Foo · PDF
  29. Meek Models Shall Inherit The Earth

    Hans Gundlach, Jayson Lynch, Neil Thompson · PDF
  30. Methodological Challenges in Agentic Evaluations of AI Systems

    Kevin Wei, Stephen Guth, Gabriel Wu, Patricia Paskov · PDF
  31. Position: Formal Methods are the Principled Foundation of Safe AI

    Gagandeep Singh, Deepika Chawla · PDF
  32. Position: Generative AI Regulation Can Learn from Social Media Regulation

    Ruth Elisabeth Appel · PDF
  33. Practical Principles for AI Cost and Compute Accounting

    Stephen Casper, Luke Bailey, Tim Schreier · PDF
  34. Probing Evaluation Awareness of Language Models

    Jord Nguyen, Hoang Huu Khiem, Carlo Leonardo Attubato, Felix Hofstätter · PDF
  35. Proofs of Autonomy: Scalable and Practical Verification of AI Autonomy

    Artem Grigor, Christian Schroeder de Witt, Ivan Martinovic · PDF
  36. Relative Bias: A Comparative Approach for Quantifying Bias in LLMs

    Alireza Arbabi, Florian Kerschbaum · PDF
  37. Reproducibility: The New Frontier in AI Governance

    Israel Mason-Williams, Gabryel Mason-Williams · PDF
  38. Robust ML Auditing using Prior Knowledge

    Jade Garcia Bourrée, Augustin Godinot, Martijn De Vos, Milos Vujasinovic, Sayan Biswas, Gilles Tredan, Erwan Le Merrer, Anne-Marie Kermarrec · PDF
  39. Scaling Limits to AI Chip Production

    Maximilian Negele, Lennart Heim, Peter Ruschhaupt · PDF
  40. Societal Capacity Assessment Framework: Measuring Advanced AI Implications for Vulnerability, Resilience, and Transformation

    Milan M. Gandhi, Peter Cihon, Owen C. Larter, Rebecca Anselmetti · PDF
  41. Technical Requirements for Halting Dangerous AI Activities

    Peter Barnett, Aaron Scher, David Abecassis · PDF
  42. The Strong, weak and benign Goodhart's law. An independence-free and paradigm-agnostic formalisation

    Adrien Majka, El-Mahdi El-Mhamdi · PDF
  43. Trends in AI Supercomputers

    Konstantin Friedemann Pilz, James Sanders, Robi Rahman, Lennart Heim · PDF
  44. Trends in Frontier AI Model Count: A Forecast to 2028

    Iyngkarran Kumar, Sam Manning · PDF
  45. Watermarking Without Standards Is Not AI Governance

    Alexander Nemecek, Yuzhou Jiang, Erman Ayday · PDF