ICLR 2025PastOther

ICLR 2025 Workshop: VerifAI: AI Verification in the Wild

ICLR 2025 Workshop VerifAI

Official website ↗OpenReview venue ↗See all ICLR workshops →✎ Edit this entry

Submission deadline: Feb 8, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (32)

Fetched from OpenReview (v2) on 2026-06-10.

ABSINT-AI: Language Models for Abstract Interpretation
Michael Wang, Kexin Pei, Armando Solar-Lezama · PDF
AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement
Pranjal Aggarwal, Bryan Parno, Sean Welleck · PDF
CAPM: Fast and Robust Verification on Maxpool-based CNN via Dual Network
Jia-Hau Bai, Chi-Ting Liu, Yu Wang, Fu-Chieh Chang, Pei-Yuan Wu · PDF
CRANE: Reasoning with constrained LLM generation
Debangshu Banerjee, Tarun Suresh, Shubham Ugare, Sasa Misailovic, Gagandeep Singh · PDF
Exact Certification of (Graph) Neural Networks Against Label Poisoning
Mahalakshmi Sabanayagam, Lukas Gosch, Stephan Günnemann, Debarghya Ghoshdastidar · PDF
Guided Proof Search Using Large Language Models and Lemma Extraction in Coq
Tarun Prasad, Nada Amin · PDF
LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction
Suozhi Huang, Peiyang Song, Robert Joseph George, Anima Anandkumar · PDF
Learning Automata from Demonstrations, Examples, and Natural Language
Marcell Vazquez-Chanlatte, Karim Elmaaroufi, Stefan Witwicki, Matei Zaharia, Sanjit A. Seshia · PDF
Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations
Qian Meng, Jin Peng Zhou, Kilian Q Weinberger, Hadas Kress-Gazit · PDF
Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Bartosz Piotrowski, Witold Drzewakowski, Konrad Staniszewski, Piotr Miłoś · PDF
LipShiFT: A Certifiably Robust Shift-based Vision Transformer
Rohan Menon, Nicola Franco, Stephan Günnemann · PDF
LLMV-AgE: Verifying LLM-Guided Planning for Agentic Exploration in Open-World RL
Haotian Chi, Songwei Zhao, Ivor Tsang, Yew-Soon Ong, Hechang Chen, Yi Chang, Haiyan Yin · PDF
MathConstruct: Challenging LLM Reasoning with Constructive Proofs
Jasper Dekoninck, Mislav Balunovic, Nikola Jovanović, Ivo Petrov, Martin Vechev · PDF
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers (Abridged)
Shalev Lifshitz, Sheila A. McIlraith, Yilun Du · PDF
Multi-Turn Code Generation Through Single-Step Rewards
Arnav Kumar Jain, Gonzalo Gonzalez-Pumariega, Wayne Chen, Alexander M Rush, Wenting Zhao, Sanjiban Choudhury · PDF
Neural Abstract Interpretation
Shaurya Gomber, Gagandeep Singh · PDF
NO STRESS NO GAIN: STRESS TESTING BASED SELF-CONSISTENCY FOR OLYMPIAD PROGRAMMING
Kunal Singh, Sayandeep Bhowmick, Pradeep Moturi, Siva Kishore Gollapalli · PDF
On the Query Complexity of Verifier-Assisted Language Generation
Edoardo Botta, Yuchen Li, Aashay Mehta, Jordan T. Ash, Cyril Zhang, Andrej Risteski · PDF
Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Kavi Gupta, Kate Sanders, Armando Solar-Lezama · PDF
ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration
Minghang Deng, Ashwin Ramachandran, Canwen Xu, Lanxiang Hu, Zhewei Yao, Anupam Datta, Hao Zhang · PDF
Reinforcement Learning with LTL and $\omega$-Regular Objectives via Optimality-Preserving Translation to Average Rewards
Xuan-Bach Le, Dominik Wagner, Leon Witzman, Alexander Rabinovich, Luke Ong · PDF
Scaling Randomized Smoothing to state-of-the-art Vision-Language Models
Emmanouil Seferis · PDF
Scaling Test-Time Compute Without Verification or RL is Suboptimal
Amrith Setlur, Nived Rajaraman, Sergey Levine, Aviral Kumar · PDF
Self-Steering Language Models
Gabriel Grand, Joshua B. Tenenbaum, Vikash Mansinghka, Alexander K. Lew, Jacob Andreas · PDF
Synthesis and Verification of String Stable Control for Interconnected Systems via Neural sISS Certificate
Jingyuan Zhou, Haoze Wu, Longhao Yan, Kaidi Yang · PDF
Tasks, Challenges, and Paths Towards AI for Software Engineering
Alex Gu, Naman Jain, Wen-Ding Li, Manish Shetty, Yijia Shao, Ziyang Li, Diyi Yang, Koushik Sen, Kevin Ellis, Armando Solar-Lezama · PDF
Temporal Consistency for LLM Reasoning Process Error Identification
Jiacheng Guo, Yue Wu, Jiahao Qiu, Kaixuan Huang, Xinzhe Juan, Ling Yang, Mengdi Wang · PDF
Toward Trustworthy Neural Program Synthesis
Wen-Ding Li, Darren Yan Key, Kevin Ellis · PDF
Training and Verifying robust Kolmogorov-Arnold Networks
Björn Heiderich, Max-Lion Schumacher, Marco Huber · PDF
Type-Constrained Code Generation with Language Models
Niels Mündler, Jingxuan He, Hao Wang, Koushik Sen, Dawn Song, Martin Vechev · PDF
Using GPUs And LLMs Can Be Satisfying for Nonlinear Real Arithmetic Problems
Christopher Brix, Julia Walczak, Nils Lommen, Thomas Noll · PDF
Verifying Omega-regular Properties of Neural Network-Controlled Systems via Proof Certificates
Peixin Wang, Jianhao Bai, Dapeng Zhi, Min Zhang, Luke Ong · PDF

Accepted papers (32)

☆ABSINT-AI: Language Models for Abstract Interpretation

☆AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement

☆CAPM: Fast and Robust Verification on Maxpool-based CNN via Dual Network

☆CRANE: Reasoning with constrained LLM generation

☆Exact Certification of (Graph) Neural Networks Against Label Poisoning

☆Guided Proof Search Using Large Language Models and Lemma Extraction in Coq

☆LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction

☆Learning Automata from Demonstrations, Examples, and Natural Language

☆Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

☆Lightweight Latent Verifiers for Efficient Meta-Generation Strategies

☆LipShiFT: A Certifiably Robust Shift-based Vision Transformer

☆LLMV-AgE: Verifying LLM-Guided Planning for Agentic Exploration in Open-World RL

☆MathConstruct: Challenging LLM Reasoning with Constructive Proofs

☆Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers (Abridged)

☆Multi-Turn Code Generation Through Single-Step Rewards

☆Neural Abstract Interpretation

☆NO STRESS NO GAIN: STRESS TESTING BASED SELF-CONSISTENCY FOR OLYMPIAD PROGRAMMING

☆On the Query Complexity of Verifier-Assisted Language Generation

☆Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs

☆ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration

☆Reinforcement Learning with LTL and $\omega$-Regular Objectives via Optimality-Preserving Translation to Average Rewards

☆Scaling Randomized Smoothing to state-of-the-art Vision-Language Models

☆Scaling Test-Time Compute Without Verification or RL is Suboptimal

☆Self-Steering Language Models

☆Synthesis and Verification of String Stable Control for Interconnected Systems via Neural sISS Certificate

☆Tasks, Challenges, and Paths Towards AI for Software Engineering

☆Temporal Consistency for LLM Reasoning Process Error Identification

☆Toward Trustworthy Neural Program Synthesis

☆Training and Verifying robust Kolmogorov-Arnold Networks

☆Type-Constrained Code Generation with Language Models

☆Using GPUs And LLMs Can Be Satisfying for Nonlinear Real Arithmetic Problems

☆Verifying Omega-regular Properties of Neural Network-Controlled Systems via Proof Certificates

ABSINT-AI: Language Models for Abstract Interpretation

AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement

CAPM: Fast and Robust Verification on Maxpool-based CNN via Dual Network

CRANE: Reasoning with constrained LLM generation

Exact Certification of (Graph) Neural Networks Against Label Poisoning

Guided Proof Search Using Large Language Models and Lemma Extraction in Coq

LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction

Learning Automata from Demonstrations, Examples, and Natural Language

Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

Lightweight Latent Verifiers for Efficient Meta-Generation Strategies

LipShiFT: A Certifiably Robust Shift-based Vision Transformer

LLMV-AgE: Verifying LLM-Guided Planning for Agentic Exploration in Open-World RL

MathConstruct: Challenging LLM Reasoning with Constructive Proofs

Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers (Abridged)

Multi-Turn Code Generation Through Single-Step Rewards

Neural Abstract Interpretation

NO STRESS NO GAIN: STRESS TESTING BASED SELF-CONSISTENCY FOR OLYMPIAD PROGRAMMING

On the Query Complexity of Verifier-Assisted Language Generation

Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs

ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration

Reinforcement Learning with LTL and $\omega$-Regular Objectives via Optimality-Preserving Translation to Average Rewards

Scaling Randomized Smoothing to state-of-the-art Vision-Language Models

Scaling Test-Time Compute Without Verification or RL is Suboptimal

Self-Steering Language Models

Synthesis and Verification of String Stable Control for Interconnected Systems via Neural sISS Certificate

Tasks, Challenges, and Paths Towards AI for Software Engineering

Temporal Consistency for LLM Reasoning Process Error Identification

Toward Trustworthy Neural Program Synthesis

Training and Verifying robust Kolmogorov-Arnold Networks

Type-Constrained Code Generation with Language Models

Using GPUs And LLMs Can Be Satisfying for Nonlinear Real Arithmetic Problems

Verifying Omega-regular Properties of Neural Network-Controlled Systems via Proof Certificates