ICLR 2025 Past Other

ICLR 2025 Workshop: VerifAI: AI Verification in the Wild

ICLR 2025 Workshop VerifAI

Submission deadline
Feb 8, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (32)

Fetched from OpenReview (v2) on 2026-06-10.

  1. ABSINT-AI: Language Models for Abstract Interpretation

    Michael Wang, Kexin Pei, Armando Solar-Lezama · PDF
  2. AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement

    Pranjal Aggarwal, Bryan Parno, Sean Welleck · PDF
  3. CAPM: Fast and Robust Verification on Maxpool-based CNN via Dual Network

    Jia-Hau Bai, Chi-Ting Liu, Yu Wang, Fu-Chieh Chang, Pei-Yuan Wu · PDF
  4. CRANE: Reasoning with constrained LLM generation

    Debangshu Banerjee, Tarun Suresh, Shubham Ugare, Sasa Misailovic, Gagandeep Singh · PDF
  5. Exact Certification of (Graph) Neural Networks Against Label Poisoning

    Mahalakshmi Sabanayagam, Lukas Gosch, Stephan Günnemann, Debarghya Ghoshdastidar · PDF
  6. Guided Proof Search Using Large Language Models and Lemma Extraction in Coq

    Tarun Prasad, Nada Amin · PDF
  7. LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction

    Suozhi Huang, Peiyang Song, Robert Joseph George, Anima Anandkumar · PDF
  8. Learning Automata from Demonstrations, Examples, and Natural Language

    Marcell Vazquez-Chanlatte, Karim Elmaaroufi, Stefan Witwicki, Matei Zaharia, Sanjit A. Seshia · PDF
  9. Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

    Qian Meng, Jin Peng Zhou, Kilian Q Weinberger, Hadas Kress-Gazit · PDF
  10. Lightweight Latent Verifiers for Efficient Meta-Generation Strategies

    Bartosz Piotrowski, Witold Drzewakowski, Konrad Staniszewski, Piotr Miłoś · PDF
  11. LipShiFT: A Certifiably Robust Shift-based Vision Transformer

    Rohan Menon, Nicola Franco, Stephan Günnemann · PDF
  12. LLMV-AgE: Verifying LLM-Guided Planning for Agentic Exploration in Open-World RL

    Haotian Chi, Songwei Zhao, Ivor Tsang, Yew-Soon Ong, Hechang Chen, Yi Chang, Haiyan Yin · PDF
  13. MathConstruct: Challenging LLM Reasoning with Constructive Proofs

    Jasper Dekoninck, Mislav Balunovic, Nikola Jovanović, Ivo Petrov, Martin Vechev · PDF
  14. Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers (Abridged)

    Shalev Lifshitz, Sheila A. McIlraith, Yilun Du · PDF
  15. Multi-Turn Code Generation Through Single-Step Rewards

    Arnav Kumar Jain, Gonzalo Gonzalez-Pumariega, Wayne Chen, Alexander M Rush, Wenting Zhao, Sanjiban Choudhury · PDF
  16. Neural Abstract Interpretation

    Shaurya Gomber, Gagandeep Singh · PDF
  17. NO STRESS NO GAIN: STRESS TESTING BASED SELF-CONSISTENCY FOR OLYMPIAD PROGRAMMING

    Kunal Singh, Sayandeep Bhowmick, Pradeep Moturi, Siva Kishore Gollapalli · PDF
  18. On the Query Complexity of Verifier-Assisted Language Generation

    Edoardo Botta, Yuchen Li, Aashay Mehta, Jordan T. Ash, Cyril Zhang, Andrej Risteski · PDF
  19. Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs

    Kavi Gupta, Kate Sanders, Armando Solar-Lezama · PDF
  20. ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration

    Minghang Deng, Ashwin Ramachandran, Canwen Xu, Lanxiang Hu, Zhewei Yao, Anupam Datta, Hao Zhang · PDF
  21. Reinforcement Learning with LTL and $\omega$-Regular Objectives via Optimality-Preserving Translation to Average Rewards

    Xuan-Bach Le, Dominik Wagner, Leon Witzman, Alexander Rabinovich, Luke Ong · PDF
  22. Scaling Randomized Smoothing to state-of-the-art Vision-Language Models

    Emmanouil Seferis · PDF
  23. Scaling Test-Time Compute Without Verification or RL is Suboptimal

    Amrith Setlur, Nived Rajaraman, Sergey Levine, Aviral Kumar · PDF
  24. Self-Steering Language Models

    Gabriel Grand, Joshua B. Tenenbaum, Vikash Mansinghka, Alexander K. Lew, Jacob Andreas · PDF
  25. Synthesis and Verification of String Stable Control for Interconnected Systems via Neural sISS Certificate

    Jingyuan Zhou, Haoze Wu, Longhao Yan, Kaidi Yang · PDF
  26. Tasks, Challenges, and Paths Towards AI for Software Engineering

    Alex Gu, Naman Jain, Wen-Ding Li, Manish Shetty, Yijia Shao, Ziyang Li, Diyi Yang, Koushik Sen, Kevin Ellis, Armando Solar-Lezama · PDF
  27. Temporal Consistency for LLM Reasoning Process Error Identification

    Jiacheng Guo, Yue Wu, Jiahao Qiu, Kaixuan Huang, Xinzhe Juan, Ling Yang, Mengdi Wang · PDF
  28. Toward Trustworthy Neural Program Synthesis

    Wen-Ding Li, Darren Yan Key, Kevin Ellis · PDF
  29. Training and Verifying robust Kolmogorov-Arnold Networks

    Björn Heiderich, Max-Lion Schumacher, Marco Huber · PDF
  30. Type-Constrained Code Generation with Language Models

    Niels Mündler, Jingxuan He, Hao Wang, Koushik Sen, Dawn Song, Martin Vechev · PDF
  31. Using GPUs And LLMs Can Be Satisfying for Nonlinear Real Arithmetic Problems

    Christopher Brix, Julia Walczak, Nils Lommen, Thomas Noll · PDF
  32. Verifying Omega-regular Properties of Neural Network-Controlled Systems via Proof Certificates

    Peixin Wang, Jianhao Bai, Dapeng Zhi, Min Zhang, Luke Ong · PDF