ICML 2025 Past Other

The Exploration in AI Today Workshop at ICML 2025

EXAIT@ICML 2025

Submission deadline
Jun 1, 2025, 12:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (51)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Diffusion Model to Shrink Proteins While Maintaining their Function

    Ethan Baron, Alan Nawzad Amin, Ruben Weitzman, Debora Susan Marks, Andrew Gordon Wilson · PDF
  2. Active Advantage-Aligned Online Reinforcement Learning with Offline Data

    Xuefeng Liu, Hung T. C. Le, Siyu Chen, Rick Stevens, Zhuoran Yang, Matthew Walter, Yuxin Chen · PDF
  3. Align While Search: Belief-Guided Exploratory Inference for Test-Time World Alignment

    Seohui Bae, Jeonghye Kim, Youngchul Sung, Woohyung Lim · PDF
  4. Automated Data Selection for Efficient Cost Model Training to Optimize Sparse Matrix Kernels on Emerging Hardware Accelerators

    Chamika Sudusinghe, Gerasimos Gerogiannis, Damitha Lenadora, Charles Block, Josep Torrellas, Charith Mendis · PDF
  5. Blindfolded Experts Generalize Better: Insights from Robotic Manipulation and Videogames

    Ev Zisselman, Mirco Mutti, Shelly Francis-Meretzki, Elisei Shafer, Aviv Tamar · PDF
  6. Branched Schrödinger Bridge Matching

    Sophia Tang, Yinuo Zhang, Alexander Tong, Pranam Chatterjee · PDF
  7. Central Path Proximal Policy Optimization

    Nikola Milosevic, Johannes Müller, Nico Scherf · PDF
  8. Diffusion-Based Maximum Entropy Reinforcement Learning

    Onur Celik, Zechu Li, Denis Blessing, Ge Li, Daniel Palenicek, Jan Peters, Georgia Chalvatzaki, Gerhard Neumann · PDF
  9. Direct Regret Optimization in Bayesian Optimization

    Fengxue Zhang, Yuxin Chen · PDF
  10. DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

    Leander Diaz-Bone, Marco Bagatella, Jonas Hübotter, Andreas Krause · PDF
  11. Distances for Markov chains from sample streams

    Sergio Calo, Anders Jonsson, Gergely Neu, Ludovic Schwartz, Javier Segovia-Aguas · PDF
  12. Diversity By Design: Leveraging Distribution Matching for Offline Model-Based Optimization

    Michael S Yao, James Gee, Osbert Bastani · PDF
  13. e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs

    Amrith Setlur, Matthew Y. R. Yang, Charlie Victor Snell, Jeremiah Greer, Ian Wu, Virginia Smith, Max Simchowitz, Aviral Kumar · PDF
  14. EVOLvE: Evaluating and Optimizing LLMs ForIn-Context Exploration

    Allen Nie, Yi Su, Bo Chang, Jonathan Lee, Ed H. Chi, Quoc V Le, Minmin Chen · PDF
  15. Exploration by Exploitation: Curriculum Learning for Reinforcement Learning Agents through Competence-Based Curriculum Policy Search

    Tabitha Edith Lee, Nan Rosemary Ke, Sarvesh Patil, Annya Dahmani, Eunice Yiu, Esra'a Saleh, Alison Gopnik, Oliver Kroemer, Glen Berseth · PDF
  16. Fleet of Agents: Coordinated Problem Solving with Large Language Models

    Lars Henning Klein, Nearchos Potamitis, Roland Aydin, Robert West, Caglar Gulcehre, Akhil Arora · PDF
  17. Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning

    Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh, Zebang Shen, Niao He, Andreas Krause · PDF
  18. From Words to Rewards: Leveraging Natural Language for Reinforcement Learning

    Belen Martin Urcelay, Andreas Krause, Giorgia Ramponi · PDF
  19. G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning

    Xiaojun Guo, Ang Li, Yifei Wang, Stefanie Jegelka, Yisen Wang · PDF
  20. Greed is Good: A Unifying Perspective on Guided Generation

    Zander W. Blasingame, Chen Liu · PDF
  21. Improved Exploration in GFlownets via Enhanced Epistemic Neural Networks

    Sajan Muhammad, Salem Lahlou · PDF
  22. Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM

    Thang Duong, Minglai Yang, Chicheng Zhang · PDF
  23. In-Context Learning for Pure Exploration

    Alessio Russo, Ryan Welch, Aldo Pacchiano · PDF
  24. Instance-Dependent Fixed-Budget Pure Exploration in Reinforcement Learning

    Yeongjong Kim, Yeoneung Kim, Kwang-Sung Jun · PDF
  25. Intent Factored Generation: Unleashing the Diversity in Your Language Model

    Eltayeb Ahmed, Uljad Berdica, Martha Elliott, Danijela Horak, Jakob Nicolaus Foerster · PDF
  26. Intrinsic Benefits of Categorical Distributional Loss: Uncertainty-aware Exploration in Reinforcement Learning towards Higher Moment Regularisations

    Ke Sun, Yingnan Zhao, Enze Shi, Yafei Wang, Xiaodong Yan, Bei Jiang, Linglong Kong · PDF
  27. Kevin: Multi-Turn RL for Generating CUDA Kernels

    Carlo Baronio, Pietro Marsella, Ben Pan, Simon Guo, Silas Alberti · PDF
  28. Llama-Nemotron: Efficient Reasoning Models

    Soumye Singhal, Jiaqi Zeng, Alexander Bukharin, Yian Zhang, Gerald Shen, Ameya Sunil Mahabaleshwarkar, Bilal Kartal, Yoshi Suhara, Akhiad Bercovich, Itay Levy, Izik Golan, Mohammed Dabbah, Ran El-Yaniv, Somshubra Majumdar, Igor Gitman, Evelina Bakhturina, Jimmy J. Zhang, Bor-Yiing Su, Guyue Huang, Izzy Putterman, Mostofa Patwary, Oluwatobi Olabiyi, Olivier Delalleau, Bryan Catanzaro, Boris Ginsburg, Oleksii Kuchaiev, Tugrul Konuk · PDF
  29. LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

    Thomas Schmied, Jörg Bornschein, Jordi Grau-Moya, Markus Wulfmeier, Razvan Pascanu · PDF
  30. No-Regret Safety: Balancing Tests and Misclassification in Logistic Bandits

    Tavor Baharav, Spyros Dragazis, Aldo Pacchiano · PDF
  31. Oracle-Efficient Adversarial Reinforcement Learning via Max-Following

    Sikata Bela Sengupta, Zakaria Mhammedi, Teodor Vanislavov Marinov · PDF
  32. Prompts Generalize with Low Data: Non-vacuous Generalization Bounds for Optimizing Prompts with More Informative Priors

    Qiuyi Zhang, David Madras, Joshua Safyan · PDF
  33. Provably Learning from Language Feedback

    Wanqiao Xu, Allen Nie, Ruijie Zheng, Aditya Modi, Adith Swaminathan, Ching-An Cheng · PDF
  34. Reimagining Parameter Space Exploration with Diffusion Models

    Lijun Zhang, Xiao Liu, Hui Guan · PDF
  35. Reinforcement Learning with Action Chunking

    Qiyang Li, Zhiyuan Zhou, Sergey Levine · PDF
  36. Reinforcement Learning with Thompson Sampling: No-Regret Performance over Finite Horizons

    Jasmine Bayrooti, Sattar Vakili, Amanda Prorok, Carl Henrik Ek · PDF
  37. Rethinking Exploration In Asynchronous Bayesian Optimization: Standard Acquisition Is All You Need

    Ben Riegler, James A C Odgers, Vincent Fortuin · PDF
  38. Retrospective and Structurally Informed Exploration via Cross-task Successor Feature Similarity

    Arya Ebrahimi, Jun Jin · PDF
  39. Scalable and Efficient Exploration via Intrinsic Rewards in Continuous-time Dynamical Systems

    Klemens Iten, Andreas Krause · PDF
  40. See it to Place it: Evolving Macro Placements with Vision Language Models

    Ikechukwu Uchendu, Vincent Zhuang, Wenjie Jiang, Kuang-Huei Lee, Ebrahim Songhori, Swati Goel, Karly Hou, Vijay Janapa Reddi · PDF
  41. SOAPIA: Siamese-Guided Generation of Off Target-Avoiding Protein Interactions with High Target Affinity

    Sophia Vincoff, Oscar Davis, Yinuo Zhang, Ismail Ilkan Ceylan, Alexander Tong, Joey Bose, Pranam Chatterjee · PDF
  42. Sparse Optimistic Information Directed Sampling

    Ludovic Schwartz, Hamish Flynn, Gergely Neu · PDF
  43. Stabilizing protein fitness predictors via the PCS framework

    Omer Ronen, Alex Y. Zhao, Ron Boger, Chengzhong Ye, Bin Yu · PDF
  44. StemCell-GPT: A Specialized AI Agent For Human Stem Cell Engineering

    Jingwen Hui, Freja Kjellaug Amalia Ekman, Hana Yousef Ghanim, Sridhar Selvaraj, Yuanhao Qu, Matthew Porteus, Le Cong · PDF
  45. Strategic Vantage Selection for Learning Viewpoint-Agnostic Manipulation Policies

    Sreevishakh Vasudevan, Som Sagar, Ransalu Senanayake · PDF
  46. Testing LLM Understanding of Scientific Literature through Expert-Driven Question Answering: Insights from High-Temperature Superconductivity

    Haoyu Guo, Maria Tikhanovskaya, Paul Raccuglia, Alexey Vlaskin, Christopher Co, Daniel J. Liebling, Scott Ellsworth, Matthew Abraham, Elizabeth Dorfman, N.P. Armitage, John M. Tranquada, Senthil Todadri, Antoine Georges, Subir Sachdev, Steven Kivelson, B. J. Ramshaw, Chunhan Feng, Olivier Gingras, Vadim Oganesyan, Michael Brenner, Subhashini Venugopalan, Eun-Ah Kim · PDF
  47. The Effective Horizon Challenge

    Cassidy Laidlaw, Daniel Khalil, Michelle Li, Laker Newhouse, Stuart Russell, Anca Dragan · PDF
  48. The Road Not Taken: Hindsight Exploration for LLMs in Multi-Turn RL

    Huaxiaoyue Wang, Sanjiban Choudhury · PDF
  49. Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

    Jiaqi WANG, Kevin Qinghong Lin, James Cheng, Mike Zheng Shou · PDF
  50. Toward Efficient Exploration by Large Language Model Agents

    Dilip Arumugam, Thomas L. Griffiths · PDF
  51. Towards Unsupervised Multi-Agent Reinforcement Learning via Task-Agnostic Exploration

    Riccardo Zamboni, Mirco Mutti, Marcello Restelli · PDF