NeurIPS 2025PastTabular & structured data

NeurIPS 2025 Workshop on Regulatable ML

RegML 2025

Official website ↗OpenReview venue ↗See all NeurIPS workshops →✎ Edit this entry

Submission deadline: Aug 30, 2025, 23:59 UTC
imported from OpenReview — check the website for extensions
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (53)

Fetched from OpenReview (v2) on 2026-06-10.

(When) Should We Delegate AI Governance to AIs? Some Lessons from Administrative Law
Nicholas A. Caputo · PDF
A Framework for the Categorisation of General-Purpose AI Models under the EU AI Act
Lorenzo Pacchiardi, John Burden, Fernando Martínez-Plumed, Jose Hernandez-Orallo, Emilia Gomez, David Fernández-Llorca · PDF
AgentCrypt: Advancing Privacy and (Secure) Computation in AI Agent Collaboration
Harish Karthikeyan, Yue Guo, Udari Madhushani Sehwag, Leo de Castro, Antigoni Polychroniadou, Leo Ardon, Sumitra Ganesh · PDF
AI, Climate, and Transparency: Operationalizing and Improving the AI Act
Nicolas Alder, Kai Ebert, Ralf Herbrich, Philipp Hacker · PDF
Anatomy of a Machine Learning Ecosystem: 2 Million Models on Hugging Face
Hamidah Oderinwale, Benjamin Laufer, Jon Kleinberg · PDF
Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs
Will Cai, Tianneng Shi, Xuandong Zhao, Dawn Song · PDF
Auditable AI Literacy Interventions: Embedding Regulatory Principles into Higher Education
Edisy Kin Wai Chan, Beatrice Yan-yan Dang · PDF
Beware! The AI Act Can Also Apply to Your AI Research Practices
Alina Wernick, Kristof Meding · PDF
Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety
Vamshi Krishna Bonagiri, Ponnurangam Kumaraguru, Khanh Xuan Nguyen, Benjamin Plaut · PDF
Cost Efficient Fairness Audit Under Partial Feedback
Nirjhar Das, Mohit Sharma, Praharsh Nanavati, Kirankumar Shiragur, Amit Deshpande · PDF
Data Forging Attacks on Cryptographic Model Certification
Carter Luck, Olive Franzese, Elisaweta Masserova, Akira Takahashi, Antigoni Polychroniadou · PDF
Debugging Concept Bottleneck Models through Removal and Retraining
Eric Enouen, sainyam galhotra · PDF
Deepfakes in Political Manipulation: Evaluating Risks Under the AI Act
Mst Rafia Islam, Azmine Toushik Wasi · PDF
Differentially Private Adaptation of Diffusion Models via Noisy Aggregated Embeddings
Pura Peetathawatchai, Wei-Ning Chen, Berivan Isik, Sanmi Koyejo, Albert No · PDF
Do AI Companies Make Good on Voluntary Commitments to the White House?
Jennifer Wang, Kayla Huang, Kevin Klyman, Rishi Bommasani · PDF
Emergency Response Measures for Catastrophic Risk
James Zhang, Miles Kodama, Zongze Wu, Michael Chen, Yue Zhu, Geng Hong · PDF
Empirical Evidence for Alignment Faking in a Small LLM and Prompt-Based Mitigation Techniques
Jeanice Koorndijk · PDF
ENCORE: Entropy-guided Reward Composition for Multi-head Safety Reward Models
Xiaomin Li, Xupeng Chen, Jingxuan Fan, Eric Hanchen Jiang, Mingye Gao · PDF
EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law
Ilija Lichkovski, Alexander Müller, Mariam Ibrahim, Tiwai Mhundwa · PDF
Examining the Vulnerability of Multi-Agent Medical Systems to Human Interventions for Clinical Reasoning
Benjamin Liu, Dillon Mehta, Rishi Malhotra, Adam Zobian, Yong Ying Tan, Samir Chopra, Daniella Rand, Natalie Pang, Abhiram Gudimella, Raghav Thallapragada, Derek Jiu, Kevin Zhu · PDF
Explanation-Driven Counterfactual Testing for Faithfulness in Vision-Language Model Explanations
Sihao Ding, Santosh Vasa, Aditi Ramadwar · PDF
From Proposals to Enactment: The Procedural Bottleneck in AI Safety Regulation
Mansur Ali Khan, Mehmet Efe Akengin, Ahmad A Rushdi · PDF
Harmful Information Management Practices in Frontier AI Development
Carson Ezell, Ben Bucknall · PDF
HashMark: Watermarking Tabular/Synthetic Data For Machine Learning Via Cryptographic Hash Functions
Harish Karthikeyan, Leo de Castro, Antigoni Polychroniadou · PDF
How Data-Related AI Research can Support Technical Solutions for Regulatory Compliance
Danilo Brajovic, David A. Kreplin, Marco Huber · PDF
How do data owners say no? A case study of data consent mechanisms in web-scraped vision-language AI training datasets
Chung Peng Lee, Rachel Hong, Harry H. Jiang, Aster Plotnik, William Agnew, Jamie Heather Morgenstern · PDF
Inducing Uncertainty on Open-Weight Models for Test-Time Privacy in Image Recognition
Muhammad H. Ashiq, Peter Triantafillou, Hung Yun Tseng, Grigorios Chrysos · PDF
Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
Xuansheng Wu, Jiayi Yuan, Wenlin Yao, Xiaoming Zhai, Ninghao Liu · PDF
It's complicated. The relationship of algorithmic fairness and non-discrimination regulations for high-risk systems in the EU AI Act
Kristof Meding · PDF
LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation
Huizhen Shu, xuying li, Zhuo Li · PDF
Local Differences, Global Lessons: Insights from Organisation Policies for Legislation
Lucie-Aimée Kaffee, Pepa Atanasova, Anna Rogers · PDF
MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction
Sepideh Abedini, Shubhankar Mohapatra, D. B. Emerson, Masoumeh Shafieinejad, Jesse C. Cresswell, Xi He · PDF
Military AI Cyber Agents (MAICAs) Constitute a Global Threat to Critical Infrastructure
Timothy R. Dubber, Seth Lazar · PDF
On the Regulatory Potential of User Interfaces for AI Agent Governance
Kevin Feng, Tae Soo Kim, Rock Yuren Pang, Faria Huq, Tal August, Amy X Zhang · PDF
PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming
Wesley Deng, Sunnie S. Y. Kim, Akshita Jha, Ken Holstein, Motahhare Eslami, Lauren Wilcox, Leon Alexander Gatys · PDF
Perspective: Lessons from Cybersecurity for Biological AI Safety and Regulation
Azmine Toushik Wasi, Mst Rafia Islam · PDF
Policy-as-Prompt: Turning AI Governance Rules into Guardrails for AI Agents
Gauri Kholkar, Ratinder Paul Singh Ahuja · PDF
Position: Bridge the Gaps between Machine Unlearning and AI Regulation
Bill Marino, Meghdad Kurmanji, Nicholas D. Lane · PDF
Refining Inverse Constitutional AI for Dataset Validation under the EU AI Act
Carl-Leander Henneking, Claas Beger · PDF
Regulating the Agency of LLM-based Agents
Seán Boddy, Joshua Joseph · PDF
Scratchpad Thinking: Alternation Between Storage and Computation in Latent Reasoning Models
Sayam Goyal, Brad Peters, María Emilia Granda, Akshath Vijayakumar Narmadha, Dharunish Yugeswardeenoo, Callum Stuart McDougall, Sean O'Brien, Ashwinee Panda, Kevin Zhu, Cole Blondin · PDF
SemScore: Practical Explainable AI through Quantitative Methods to Measure Semantic Spuriosity
Jovin Leong, Wei May Chen, Tiong Kai Tan · PDF
SPEAR++: Scaling Gradient Inversion via Sparsely-Used Dictionary Learning
Alexander Bakarsky, Dimitar Iliev Dimitrov, Maximilian Baader, Martin Vechev · PDF
SpecEval: Evaluating Model Adherence to Behavior Specifications
Ahmed M Ahmed, Kevin Klyman, Yi Zeng, Sanmi Koyejo, Percy Liang · PDF
Specifying Computational Compliance for AI: Blueprint for a New Research Domain
Bill Marino, Nicholas D. Lane · PDF
Statutory Construction and Interpretation for Artificial Intelligence
Luxi He, Nimra Nadeem, Michel Liao, Howard Chen, Danqi Chen, Peter Henderson · PDF
StealthEval: A Probe-Rewrite-Evaluate Workflow for Reliable Benchmarks
Lang Xiong, Nishant Bhargava, Jeremy Chang, Jianhang Hong, Haihao Liu, Kevin Zhu · PDF
The Backfiring Effect of Weak AI Safety Regulation
Benjamin Laufer, Jon Kleinberg, Hoda Heidari · PDF
The Contribution of XAI for the Safe Development and Certification of AI: An Expert-Based Analysis
Benjamin Fresz, Vincent Philipp Göbels, Safa Omri, Danilo Brajovic, Andreas Aichele, Janika Kutz, Jens Neuhüttler, Marco Huber · PDF
The Hidden Cost of Modeling $P(X)$: Membership Inference Attacks in Generative Text Classifiers
Owais Makroo, Karan Gupta, Siva Rajesh Kasa, Sumegh Roychowdhury, Pattisapu Nikhil Priyatam, Santhosh Kumar Kasa, Sumit Negi · PDF
The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence
Matt White, Cailean Osborne, Xiao-Yang Liu, Keyi Wang, Sachin Mathew Varghese · PDF
The Right to be Forgotten in Pruning: Unveil Machine Unlearning on Sparse Models
Yang Xiao, Gen Li, Jie Ji, Ruimeng Ye, Xiaolong Ma, Bo Hui · PDF
ValueDCG: Framework for Investigating Human Value Understanding Ability of Language Models through Discriminator-Critique Gap
Zhaowei Zhang, Fengshuo Bai, Jun Gao, Yaodong Yang · PDF

Accepted papers (53)

☆(When) Should We Delegate AI Governance to AIs? Some Lessons from Administrative Law

☆A Framework for the Categorisation of General-Purpose AI Models under the EU AI Act

☆AgentCrypt: Advancing Privacy and (Secure) Computation in AI Agent Collaboration

☆AI, Climate, and Transparency: Operationalizing and Improving the AI Act

☆Anatomy of a Machine Learning Ecosystem: 2 Million Models on Hugging Face

☆Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

☆Auditable AI Literacy Interventions: Embedding Regulatory Principles into Higher Education

☆Beware! The AI Act Can Also Apply to Your AI Research Practices

☆Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety

☆Cost Efficient Fairness Audit Under Partial Feedback

☆Data Forging Attacks on Cryptographic Model Certification

☆Debugging Concept Bottleneck Models through Removal and Retraining

☆Deepfakes in Political Manipulation: Evaluating Risks Under the AI Act

☆Differentially Private Adaptation of Diffusion Models via Noisy Aggregated Embeddings

☆Do AI Companies Make Good on Voluntary Commitments to the White House?

☆Emergency Response Measures for Catastrophic Risk

☆Empirical Evidence for Alignment Faking in a Small LLM and Prompt-Based Mitigation Techniques

☆ENCORE: Entropy-guided Reward Composition for Multi-head Safety Reward Models

☆EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law

☆Examining the Vulnerability of Multi-Agent Medical Systems to Human Interventions for Clinical Reasoning

☆Explanation-Driven Counterfactual Testing for Faithfulness in Vision-Language Model Explanations

☆From Proposals to Enactment: The Procedural Bottleneck in AI Safety Regulation

☆Harmful Information Management Practices in Frontier AI Development

☆HashMark: Watermarking Tabular/Synthetic Data For Machine Learning Via Cryptographic Hash Functions

☆How Data-Related AI Research can Support Technical Solutions for Regulatory Compliance

☆How do data owners say no? A case study of data consent mechanisms in web-scraped vision-language AI training datasets

☆Inducing Uncertainty on Open-Weight Models for Test-Time Privacy in Image Recognition

☆Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders

☆It's complicated. The relationship of algorithmic fairness and non-discrimination regulations for high-risk systems in the EU AI Act

☆LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation

☆Local Differences, Global Lessons: Insights from Organisation Policies for Legislation

☆MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction

☆Military AI Cyber Agents (MAICAs) Constitute a Global Threat to Critical Infrastructure

☆On the Regulatory Potential of User Interfaces for AI Agent Governance

☆PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming

☆Perspective: Lessons from Cybersecurity for Biological AI Safety and Regulation

☆Policy-as-Prompt: Turning AI Governance Rules into Guardrails for AI Agents

☆Position: Bridge the Gaps between Machine Unlearning and AI Regulation

☆Refining Inverse Constitutional AI for Dataset Validation under the EU AI Act

☆Regulating the Agency of LLM-based Agents

☆Scratchpad Thinking: Alternation Between Storage and Computation in Latent Reasoning Models

☆SemScore: Practical Explainable AI through Quantitative Methods to Measure Semantic Spuriosity

☆SPEAR++: Scaling Gradient Inversion via Sparsely-Used Dictionary Learning

☆SpecEval: Evaluating Model Adherence to Behavior Specifications

☆Specifying Computational Compliance for AI: Blueprint for a New Research Domain

☆Statutory Construction and Interpretation for Artificial Intelligence

☆StealthEval: A Probe-Rewrite-Evaluate Workflow for Reliable Benchmarks

☆The Backfiring Effect of Weak AI Safety Regulation

☆The Contribution of XAI for the Safe Development and Certification of AI: An Expert-Based Analysis

☆The Hidden Cost of Modeling $P(X)$: Membership Inference Attacks in Generative Text Classifiers

☆The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence

☆The Right to be Forgotten in Pruning: Unveil Machine Unlearning on Sparse Models

☆ValueDCG: Framework for Investigating Human Value Understanding Ability of Language Models through Discriminator-Critique Gap

(When) Should We Delegate AI Governance to AIs? Some Lessons from Administrative Law

A Framework for the Categorisation of General-Purpose AI Models under the EU AI Act

AgentCrypt: Advancing Privacy and (Secure) Computation in AI Agent Collaboration

AI, Climate, and Transparency: Operationalizing and Improving the AI Act

Anatomy of a Machine Learning Ecosystem: 2 Million Models on Hugging Face

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

Auditable AI Literacy Interventions: Embedding Regulatory Principles into Higher Education

Beware! The AI Act Can Also Apply to Your AI Research Practices

Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety

Cost Efficient Fairness Audit Under Partial Feedback

Data Forging Attacks on Cryptographic Model Certification

Debugging Concept Bottleneck Models through Removal and Retraining

Deepfakes in Political Manipulation: Evaluating Risks Under the AI Act

Differentially Private Adaptation of Diffusion Models via Noisy Aggregated Embeddings

Do AI Companies Make Good on Voluntary Commitments to the White House?

Emergency Response Measures for Catastrophic Risk

Empirical Evidence for Alignment Faking in a Small LLM and Prompt-Based Mitigation Techniques

ENCORE: Entropy-guided Reward Composition for Multi-head Safety Reward Models

EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law

Examining the Vulnerability of Multi-Agent Medical Systems to Human Interventions for Clinical Reasoning

Explanation-Driven Counterfactual Testing for Faithfulness in Vision-Language Model Explanations

From Proposals to Enactment: The Procedural Bottleneck in AI Safety Regulation

Harmful Information Management Practices in Frontier AI Development

HashMark: Watermarking Tabular/Synthetic Data For Machine Learning Via Cryptographic Hash Functions

How Data-Related AI Research can Support Technical Solutions for Regulatory Compliance

How do data owners say no? A case study of data consent mechanisms in web-scraped vision-language AI training datasets

Inducing Uncertainty on Open-Weight Models for Test-Time Privacy in Image Recognition

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders

It's complicated. The relationship of algorithmic fairness and non-discrimination regulations for high-risk systems in the EU AI Act

LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation

Local Differences, Global Lessons: Insights from Organisation Policies for Legislation

MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction

Military AI Cyber Agents (MAICAs) Constitute a Global Threat to Critical Infrastructure

On the Regulatory Potential of User Interfaces for AI Agent Governance

PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming

Perspective: Lessons from Cybersecurity for Biological AI Safety and Regulation

Policy-as-Prompt: Turning AI Governance Rules into Guardrails for AI Agents

Position: Bridge the Gaps between Machine Unlearning and AI Regulation

Refining Inverse Constitutional AI for Dataset Validation under the EU AI Act

Regulating the Agency of LLM-based Agents

Scratchpad Thinking: Alternation Between Storage and Computation in Latent Reasoning Models

SemScore: Practical Explainable AI through Quantitative Methods to Measure Semantic Spuriosity

SPEAR++: Scaling Gradient Inversion via Sparsely-Used Dictionary Learning

SpecEval: Evaluating Model Adherence to Behavior Specifications

Specifying Computational Compliance for AI: Blueprint for a New Research Domain

Statutory Construction and Interpretation for Artificial Intelligence

StealthEval: A Probe-Rewrite-Evaluate Workflow for Reliable Benchmarks

The Backfiring Effect of Weak AI Safety Regulation

The Contribution of XAI for the Safe Development and Certification of AI: An Expert-Based Analysis

The Hidden Cost of Modeling $P(X)$: Membership Inference Attacks in Generative Text Classifiers

The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence

The Right to be Forgotten in Pruning: Unveil Machine Unlearning on Sparse Models

ValueDCG: Framework for Investigating Human Value Understanding Ability of Language Models through Discriminator-Critique Gap