NeurIPS 2025PastAI for science

NeurIPS 2025 AI for Science Workshop

NeurIPS2025-AI4Science

Official website ↗OpenReview venue ↗See all NeurIPS workshops →✎ Edit this entry

Submission deadline: Aug 28, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (230)

Fetched from OpenReview (v2) on 2026-06-10.

10 Million Particle Events: Enabling Foundation Models for Sparse 3D Inverse Problems
Omar Alterkait, Sam Young, Ka Vang Tsang, Junjie Xia, Carolyn H Smith, Taritree Wongjirad, Kazuhiro Terao · PDF
A Foundational Dataset for the Predictive Prevention of Waterborne Disease
Aditya Chaudhary · PDF
A Large Multimodal Molecular Representation Encoder-Decoder Foundation Model for Chemistry
Victor Y. Shirasuna, Emilio Vital Brazil, Eduardo Soares, Nathaniel H. Park, Dmitry Zubarev, Vidushi Sharma, Indra Priyadarsini, Caio Rodrigues Gama, Enzo Reis de Oliveira · PDF
A Multi-Modal Deep Learning Model for Drug Potency Prediction: Leveraging Features from Physics-Based Docking and Advanced Co-Folding Methods
Claire Suen, BoRam Lee, Matthew Adrian, Jeffrey Zhou, Hyeyun Jung, David He, Gean Hu, Kelly Hui, Aditi Jain, Qamil Mirza, Milena Novakovic, Joseph Park, Winston Qian, Aarav Shah, Xina Wang, Yunsie Chung, Alan C Cheng · PDF
A Probabilistic U-Net Approach to Downscaling Climate Simulations
Maryam Alipourhajiagha, Pierre-Louis Lemaire, Youssef Diouane, Julie Carreau · PDF
A study of EHVI vs fixed scalarization for molecule design
Anabel Yong, Austin Tripp, Layla Hosseini-Gerami, Brooks Paige · PDF
A Synthesizability-Guided Pipeline for Materials Discovery
Thorben Prein, Willis O'Leary, Aikaterini Flessa Savvidou, Elchaïma Bourneix, Joonatan E. M. Laulainen · PDF
AC-PKAN: Attention-Enhanced and Chebyshev Polynomial-Based Physics-Informed Kolmogorov–Arnold Networks
Hangwei Zhang, Zhimu Huang, Yan Wang · PDF
Accelerated Isotopologue Reduced Partition Function Ratio Prediction with Orbital-based Deep Learning
Simon Andren, Beom Seok Kang, William Goddard, John Eiler, Anima Anandkumar · PDF
Accelerating Protein Molecular Dynamics Simulation with DeepJump
Allan Dos Santos Costa, Manvitha Ponnapati, Dana Rubin, Tess Smidt, JOSEPH JACOBSON · PDF
Adaptive Transition State Refinement with Learned Equilibrium Flows
Samir Darouich, Vinh Tong, Tanja Bien, Johannes Kästner, Mathias Niepert · PDF
AI for Science Strategic Compass: Aligning Discovery Tensions with Core AI Functions
Ran Liu, Zhibin Lin, Xiaowei Huang · PDF
AI4O3: A Foundational Data Collection for Artificial Intelligence in Tropospheric Ozone Research
Makoto Kelp, Sebastian Hickman, Kazuyuki Miyazaki, Kai-Lan Chang, Paul Griffiths, Qindan Zhu, Gerbrand Koren, Fernando Iglesias-Suarez, Elyse Pennington, Martin Georg Schultz · PDF
AIM: Adaptive Intervention for Deep Multi-task Learning of Molecular Properties
Mason Minot, Gisbert Schneider · PDF
AION-1: Omnimodal Foundation Model for Astronomical Sciences
Liam Holden Parker, Francois Lanusse, Jeff Shen, Ollie Liu, Tom Hehir, Leopoldo Sarra, Lucas Thibaut Meyer, Micah Bowles, Sebastian Wagner-Carena, Helen Qu, Siavash Golkar, Alberto Bietti, Hatim Bourfoune, Pierre Cornette, Keiya Hirashima, Geraud Krawezik, Ruben Ohana, Nicholas Lourie, Michael McCabe, Rudy Morel, Payel Mukhopadhyay, Mariel Pettee, Kyunghyun Cho, Miles Cranmer, Shirley Ho · PDF
Alvessa: An Agentic Evidence-Grounded Research Assistant for Genomics
Ksenia Sokolova, Sanketh Vedula, Keerthana Nallamotu, Guillermo Sapiro, Olga G Troyanskaya · PDF
An Agentic Orchestration System for Heliophysics Tasks
Russell Spiewak, Kevin Lee, James Walsh · PDF
An in-silico integration of neurodevelopmental and dopaminergic views of schizophrenia
Xena Al-Hejji, Jose Guillermo Gomez Castro, Santina Duarte, Edgar Bermudez Contreras, Eric Chalmers · PDF
Assessing the Geographic Generalization and Physical Consistency of Generative Models for Climate Downscaling
Carlo Saccardi, Maximilian Pierzyna, Haitz Sáez de Ocáriz Borde, Simone Monaco, Cristian Meo, Pietro Lio, Rudolf Saathof, Geethu Joseph, Justin Dauwels · PDF
Augmenting Research Ideation with Data: An Empirical Investigation in Social Science
Xiao Liu, Xinyi Dong, Xinyang Gao, Yansong Feng, Xun Pang · PDF
AutoChemSchematic AI: Agentic Physics-Aware Automation for Chemical Manufacturing Scale-Up
Sagar Srinivas Sakhinana, Shivam Gupta, Venkataramana Runkana · PDF
Automated scientific minimization of regret for cognitive modeling
Marcel Binz, Akshay Kumar Jagadish, Milena Rmus, Eric Schulz · PDF
BasePrompt: Self-Prompting Genome Language Models for RNA Fitness Prediction
Jin Gao, Zheling Tan, Junhao Shi, Dequan Wang · PDF
Benchmarking LLMs for atomic-level geometric manipulation in crystals
Taoyuze Lv, Alexander Chen, Fengyu Xie, Yingheng Wang, Jeffrey Meng, Bram Hoex, Zhicheng Zhong, Tong Xie · PDF
Benchmarking Machine Learning Potentials for Crystal Structure Relaxation
Kowen Woo, Prashant Govindarajan, Sarath Chandar · PDF
Beyond Atoms: Evaluating Electron Density Representation for 3D Molecular Learning
Patricia Adriana Suriana, Joshua A Rackers, Ewa Nowara, Pedro O. Pinheiro, Vishnu Sresht, John M Nicoludis · PDF
Beyond data subsampling: differentiation as an uncertainty source in equation discovery
Khilchuk Maria Denisovna, Ilya Markov, Alexander Hvatov · PDF
Beyond Ensembles: Simulating All-Atom Protein Dynamics in a Learned Latent Space
Aditya Sengar, Ali Hariri, Pierre Vandergheynst, PATRICK BARTH · PDF
Beyond model organisms: robust prediction of functional properties across protein evolution
Lucas Waldburger, Hunter Nisonoff, Marissa Zintel, Liam D. Kirkpatrick, Angelica Lam, Nathan Lanclos, Jay D. Keasling, Max V. Staller, Patrick M. Shih · PDF
Bigger is not always better: evaluating target-specific dataset design strategies for regioselectivity prediction on complex molecules
Jules Schleinitz, Alba Carretero Cerdán, Anjali Gurajapu, Yonatan Harnik, Carolyn Ruan, Gina Lee, Amitesh Pandey, Anat Milo, Sarah Reisman · PDF
BioMedReasoner: Towards Multi-Hop Reasoning using Path-based Relational Learning on Biomedical Knowledge Graphs
Ahmad Wisnu Mulyadi, Lilija Wehling, Ansh Kumar, Nicolas Boucher, Firas Abdessalem, Sven Jager, Mohammed H. Mosa, Thomas Klabunde, Tommaso Andreani, Gurdeep Singh · PDF
BioVerge: A Comprehensive Benchmark and Study of Self-Evaluating Agents for Biomedical Hypothesis Generation
Fuyi Yang, Chenchen Ye, Mingyu Derek Ma, Yijia Xiao, Matthew Yang, Wei Wang · PDF
Block-wise distillation for lightweight weather models
Daniil Sukhorukov, Andrei Zakharov, Dmitry Zhevnenko, Vladimir Kirilin, Ekaterina Muravleva, Ivan Oseledets, Ilya Makarov · PDF
BLOSUM Is All You Learn — Generative Antibody Models Reflect Evolutionary Priors
Talip Ucar, Pietro Sormanni · PDF
Boundary-Augmented Neural Operators for Better Generalization to Unseen Geometries
Jiayi Zhou, Valentin Duruisseaux, Daniel Zhengyu Huang, Anima Anandkumar · PDF
Bridging Neural Operator and Flow Matching for a Generative PDE Foundation Model
Zituo Chen, Sili Deng · PDF
CALM-PDE: Continuous and Adaptive Convolutions for Latent Space Modeling of Time-dependent PDEs
Jan Hagnberger, Daniel Musekamp, Mathias Niepert · PDF
Can Theoretical Physics Research Benefit from Language Agents?
Sirui Lu, Zhijing Jin, Terry Jingchen Zhang, Pavel Kos, Juan Ignacio Cirac, Bernhard Schölkopf · PDF
CAST: Causal Modeling of Time-Varying Treatment Effects on Head and Neck Cancer
Everest Yang, Ria Vasishtha, Luqman K. Dad, Lisa A. Kachnic, Andrew Hope, Eric Wang, Xiao Wu, Yading Yuan, David J Brenner, Igor Shuryak · PDF
Causal AI Scientist: Facilitating Causal Data Science with Large Language Models
Vishal Verma, Sawal Acharya, Samuel Simko, Devansh Bhardwaj, Anahita Haghighat, Dominik Janzing, Mrinmaya Sachan, Zhijing Jin, Yongjin Yang · PDF
Chemist-aligned retrosynthesis by ensembling diverse inductive bias models
Krzysztof Maziarz, Guoqing Liu, Austin Tripp, Junren Li, Piotr Gaiński, Marwin Segler · PDF
CHEMSETS: How Capable Are Chemistry LLMs?
Christoph Bartmann, Mykyta Ielanskyi, Johannes Schimunek, Philipp Seidl, Günter Klambauer, Sohvi Luukkonen · PDF
CiteGuard: Retrieval-Augmented Citation Verification for LLM-Powered Peer Review
Ishaan Gangwani, Aayam Bansal · PDF
Closing the Omics Gap: A Benchmark for Unified Evaluation of Biomolecular Foundation Models
Joseph G. Wakim, Vinayak Gupta, Jose Manuel Marti, Jonathan E Allen, Brian R. Bartoldson, Bhavya Kailkhura · PDF
CompGen: A Conditional Generation Framework for Inverse Composition Design of Catalytic Surfaces
Shuizhou Chen, Chenghan Sun, ZhiyuanLiu, Andi Han, Ichigaku Takigawa, Quan QIAN · PDF
Conditioned Clifford-Steerable Kernels
Bálint László Szarvas, Maksim Zhdanov · PDF
Connecting Preclinical Models to Patient Outcomes: A Machine Learning Dataset for Predictive Validity in Drug Development
Alexander Honkala · PDF
Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design
Danny Reidenbach, Zhonglin Cao, Zuobai Zhang, Kieran Didi, Tomas Geffner, Guoqing Zhou, Jian Tang, Christian Dallago, Arash Vahdat, Emine Kucukbenli, Karsten Kreis · PDF
Constant-Potential Machine Learning Force Field for Electrochemical Interface
Ruoyu Wang, Shaoheng Fang, Qixing Huang, Yuanyue Liu · PDF
Constructing the Mental Health Phenome: An Open Multimodal Dataset Linking Digital Behavior, Physical Health, and Mental Wellbeing
Shakson Isaac, Ambika Grover, Yentl Collin, John Torous, Chirag Patel · PDF
Control-Augmented Diffusion for Autoregressive Data Assimilation
Prakhar Srivastava, Farrin Marouf Sofian, Francesco Immorlano, Stephan Mandt · PDF
Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling
Srinivas Anumasa, Barath Chandran.C, Tingting Chen, Dianbo Liu · PDF
Data-driven Design as a High-Impact, Ecologically Valid Benchmark for Document Understanding
Sireesh Gururaja, Junwon Seo, Hung-Yi Lin, Jeremiah Milbauer, Anthony Rollett, Emma Strubell · PDF
Data-Driven Solar Surface Flux Transport Modeling with Uncertainty Quantification
Katherine Keegan, Nina Bonaventura, Plinio Guzmán, Nishu Karna, Shea Hess-Webber, Spiridon Kasapis, Bibhuti Kumar Jha, Andrés Muñoz-Jaramillo · PDF
Data-optimal scaling of paired antibody language models
Mahdi Shafiei Neyestanak, Sarah M. Burbach, Karenna Ng, Praneeth Gangavarapu, Jonathan Hurtado, Judie Magura, Nasreen Ismail, Daniel Muema, Thumbi Ndung'u, Andrew B. Ward, Bryan Briney · PDF
De novo generation of functional terpene synthases using TpsGPT
Hamsini Ramanathan, Roman Bushuiev, Matouš Soldát, Jiří Kohout, Téo Hebra, Joshua David Smith, Tomas Pluskal · PDF
Decompose, Adapt, and Evolve: Towards Efficient Scientific Equation Discovery with Large Language Models
Pouya Behzadifar, Parshin Shojaee, Sanchit Kabra, Kazem Meidani, Chandan K. Reddy · PDF
Deep Graph Learning for Industrial Carbon Emission Analysis and Policy Impact
Xuanming Zhang · PDF
Demystifying Protein Generation with Hierarchical Conditional Diffusion Models
Zinan Ling, Yi Shi, Brett A. McKinney, Da Yan, Yang Zhou, Bo Hui · PDF
Differentiable Predictive Control for Precise Oxygen Level Maintenance for Critical Patients
Azmine Toushik Wasi, Md Manjurul Ahsan · PDF
Diffusion for Fusion: Designing Stellarators with Generative AI
Misha A Padidar, Ningyuan Huang, Andrew Giuliani, Marina Spivak · PDF
Dimensionality and Topological Stability of Neural Representations in the Human Brain Predict Learning Outcomes
Junjie Yu, Zihan Deng, Wenxiao Ma, Zhuoli Ouyang, Jianyu Zhang, Yi Guo, Quanying Liu · PDF
DINO: dynamics-informed dataset to overcome the limitations of static molecular data in AI-driven drug discovery
Eva Smorodina, Victor Greiff, Rahmad Akbar · PDF
Discontinuous Epitope Fragments as Sufficient Target Templates for Efficient Binder Design
Zhenfeng Deng, Ruijie Hou, Ningrui Xie, Mike Tyers, Michał Koziarski · PDF
Dissecting Larval Zebrafish Hunting Behavior using Deep Reinforcement Learning trained RNNs
Raaghav Malik, Satpreet Harcharan Singh, Sonja Johnson-Yu, Roy Harpaz, Kanaka Rajan · PDF
DistMLIP: A Distributed Inference Platform for Machine Learning Interatomic Potentials
Kevin Han, Bowen Deng, Amir Barati Farimani, Gerbrand Ceder · PDF
Diverse Topology Optimization using Modulated Neural Fields
Andreas Radler, Eric Volkmann, Johannes Brandstetter, Arturs Berzins · PDF
DMPKBench: A Multi-Modal Benchmark for Evaluating LLMs and Agents in Drug Discovery DMPK Tasks‌
Jie Li, Baiming Chen, Zhiyang Zou, rumin zhang, sheng ding, jinjiang guo · PDF
DMRG Quantum Chemistry Dataset for Multi-Reference Machine Learning
Stefan Gugler, Nina Glaser · PDF
Do Llamas Understand the Periodic Table?
Ge Lei, Samuel J. Cooper · PDF
Does LLM dream of differential equation discovery?
Elizaveta Ivanchik, Timur Bavshin, Alexander Hvatov · PDF
Domain-Invariant Feature Learning for Patient-Level Phenotype Prediction from Single-Cell Data
Mathias Perez, Justin Hong, Aaron Zweig, Elham Azizi · PDF
EARS-UDE : Evaluating Auditory Response in Sensory Overload with Universal Differential Equations
Miheer Salunke, Prathamesh Dinesh Joshi, Raj Dandekar, Rajat Dandekar, Sreedath Panat · PDF
Einstein Fields: A Neural Perspective To Computational General Relativity
Sandeep Suresh Cranganore, Andrei Bodnar, Arturs Berzins, Johannes Brandstetter · PDF
Emergent SO(3)-Invariant Molecular Representations from Multimodal Alignment
Eduardo Soares, Victor Y. Shirasuna, Emilio Vital Brazil, Dmitry Zubarev, Enzo Reis de Oliveira, Caio Rodrigues Gama, Daniel Djinishian de Briquez · PDF
Empowering AI in RNAi Therapeutics: A Foundational Dataset for siRNA Design and Optimization
Xin Guo, Jiyang Li · PDF
EquiHGNN: Scalable Rotationally Equivariant Hypergraph Neural Networks
Tien Dang, Truong-Son Hy · PDF
Every Answer Counts: Enhancing Scientific Discovery with Efficient Entity-Centric Question Answering from Long Contexts
Binyamin Perets, Zohar Shnaider, Shie Mannor, Dvir Aran · PDF
Explainable AI–Guided Virtual Experiments Reveal How DNA Sequence Context Shapes Gene Regulation
Sophia Chen, David M. McCandlish, Justin Kinney, Peter K Koo · PDF
Explaining Temporal Effects in Sepsis Prediction
Chaehyeon Kim, Eric Wong · PDF
Exploring Generative Approaches for Predicting Copolymer Sequences from Reaction Conditions
Guanghui Min, Wenxin Xu, Kateri DuBay, Chen Chen · PDF
FALCON: An ML Framework for Fully Automated Layout-Constrained Analog Circuit Design
Asal Mehradfar, Xuzhe Zhao, Yilun Huang, Emir Ceyani, Yankai Yang, Shihao han, Hamidreza Aghasi, Salman Avestimehr · PDF
Few-shot Protein Fitness Prediction via In-context Learning and Test-time Training
Felix Teufel, Aaron W Kollasch, Yining Huang, Ole Winther, Kevin K Yang, Pascal Notin, Debora Susan Marks · PDF
First Comprehensive Benchmark for Tailored Small Molecule-Binding Aptamer Design
Mariia Eremeyeva, Nikita Serov · PDF
Foundation Models Enabling Multi-Scale Battery Materials Discovery: From Molecules To Devices
Vidushi Sharma, Maxwell J Giammona, Andy Tek, Murtaza Zohair, Nathaniel H. Park, Tim Erdmann, Eduardo Soares, Linda Sundberg, Khanh Nguyen, Young-Hye Na, Emilio Vital Brazil · PDF
From In Silico to In Vitro: Evaluating Molecule Generative Models for Hit Generation
Nagham Osman, Vittorio Lembo, Giovanni Bottegoni, Laura Toni · PDF
From Molecules to Perception: A Benchmark Dataset for AI in Sensory Science
Dachuan Zhang · PDF
From Static Structures to Ensembles: Studying and Harnessing Protein Structure Tokenization
Zijing Liu, Bin Feng, He CAO, Yu Li · PDF
GAPMAP: Mapping Scientific Knowledge Gaps in Biomedical Literature Using Large Language Models
Nourah Salem, Elizabeth White, Mike Bada, Lawrence Hunter · PDF
GCP-VQVAE: A Geometry-Complete Language for Protein 3D Structure
Mahdi Pourmirzaei, Alex Morehead, Farzaneh Esmaili, Jarett Zida Ren, Mohammadreza Pourmirzaeioliaei, Dong Xu · PDF
Generalization Beyond Benchmarks: Evaluating Learnable Protein-Ligand Scoring Functions on Unseen Targets
Jakub Kopko, David Graber, Saltuk Mustafa Eyrilmez, Stanislav Mazurenko, David Bednar, Jiri Sedlar, Josef Sivic · PDF
Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes
Li Zhang, Basu Jindal, Ahmed Alaa, Robert Weinreb, David Wilson, Eran Segal, James Zou, Pengtao Xie · PDF
Generative Latent Space Dynamics of Electron Density
Yuan Chiang, Youngsoo Choi, Daniel Osei-Kuffuor · PDF
GeoGraph: Geometric and Graph-based Ensemble Descriptors for Intrinsically Disordered Proteins
Eoin Quinn, Marco Carobene, Jean Quentin, Sebastien Boyer, Miguel Arbesú, Oliver Bent · PDF
Geometry Aware Inference of Steady State PDEs Using Equivariant Neural Field Representations
Giovanni Catalani, Xavier BERTRAND, Frédéric TOST, Michael BAUERHEIM, Joseph Morlier · PDF
Gradient-Free Physics-informed Operator Learning using Walk-on-Spheres
Hrishikesh Viswanath, Hong Chul Nam, Julius Berner, Anima Anandkumar, Aniket Bera · PDF
Graph Neural Networks for Interferometer Simulations
Sidharth Kannan, Pooyan Goodarzi, Evangelos E. Papalexakis, Jonathan Richardson · PDF
Hash Collisions in Molecular Fingerprints: Effects on Property Prediction and Bayesian Optimization
Walter Virany, Austin Tripp · PDF
Holonic Science: A New Framework for Benchmarking AI Scientists
Nathan Suri, Savannah Jennifer Thais · PDF
How knowledge discovery and embedded paradigm transform industrial process management: exploring pipeline hydraulic dynamic identification
Du Jian, Haochong Li, Jianqin Zheng, Qi Liao, Jun Shen, Shiyuan Pan, Yongtu Liang · PDF
How to Detect and Defeat Molecular Mirage: A Metric-Driven Benchmark for Hallucination in LLM-based Molecular Comprehension
Li Hao, Liuzhenghao Lv, He CAO, Zijing Liu, Zhiyuan Yan, Yu Wang, Yonghong Tian, Yu Li, Li Yuan · PDF
IM-LPG: Inverse Modeling Approach to Laser Pulse Shape Generation in Inertial Confinement Fusion
Ricardo Luna Gutierrez, Vineet Gundecha, Rahman Ejaz, Varchas Gopalaswamy, Riccardo Betti, Sahand Ghorbanpour, Aarne Lees, Soumyendu Sarkar · PDF
Improved Therapeutic Antibody Reformatting through Multimodal Machine Learning
Jiayi Xin, Aniruddh Raghu, Nick Bhattacharya, Adam Carr, Melanie Montgomery, Hunter Elliott · PDF
Improving RNA Secondary Structure Prediction Through Expanded Training Data
Conner J. Langeberg, Taehan Kim, Roma Nagle, Charlotte Meredith, Dimple Amitha Garuadapuri, Jennifer Doudna, Jamie H. D. Cate · PDF
Is Sequence Information All You Need for Bayesian Optimization of Antibodies?
Sebastian W. Ober, Calvin McCarter, Aniruddh Raghu, Yucen Lily Li, Alan Nawzad Amin, Andrew Gordon Wilson, Hunter Elliott · PDF
Label-free biochemical imaging of neural organoids via deep learning-enhanced Raman microspectroscopy
Dimitar Georgiev, Ruoxiao Xie, Daniel Reumann, Xiaoyu Zhao, Álvaro Fernández-Galiana, Mauricio Barahona, Molly M. Stevens · PDF
Large-scale audio-language datasets for bioacoustics
Gagan Narula, Marius Miron, David Robinson, Milad Alizadeh, Masato Hagiwara, Ellen Gilsenan-McMahon, Sara Keen, Benjamin Hoffman, Maddie Cusimano, Emmanuel Chemla, Matthieu Geist, Olivier Pietquin · PDF
LeafTrackNet: A Deep Learning Framework for Robust Leaf Tracking in Top-Down Plant Phenotyping
Shanghua Liu, Majharulislam Babor, Christoph Verduyn, Breght Vandenberghe, Bruno Betoni Parodi, Cornelia Weltzien, Marina MC Höhne · PDF
Learning Boltzmann Generators via Constrained Mass Transport
Christopher von Klitzing, Denis Blessing, Henrik Schopmans, Pascal Friederich, Gerhard Neumann · PDF
Learning chaotic PDEs with boundedness guarantees
Andrea Goertzen, Sunbochen Tang, Navid Azizan · PDF
Learning Deformable Body Interactions With Adaptive Spatial Tokenization
Hao Wang, Yu Liu, Daniel Biggs, Haoru Wang, Jiandong Yu, Ping Huang · PDF
Learning Protein-Ligand Binding in Hyperbolic Space
Jianhui Wang, Wenyu Zhu, Bowen Gao, Xin Hong, Ya-Qin Zhang, Wei-Ying Ma, Yanyan Lan · PDF
Learning to Compress Plasma Turbulence
Gianluca Galletti, Gerald Gutenbrunner, Fabian Paischer, Sandeep Suresh Cranganore, William Hornsby, Naomi Carey, Lorenzo Zanisi, Stanislas Pamela, Johannes Brandstetter · PDF
LEONARDO: A Physics-Informed Generative Model for Stochastic Nanoparticle Dynamics in Liquid-Phase TEM
Zain Shabeeb, Vida Jamali · PDF
Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design
Nathaniel H. Park, Tiffany Callahan, James L. Hedrick, Tim Erdmann, Sara Capponi · PDF
LINKER: Learning Interactions Between Functional Groups and Residues With Chemical Knowledge-Enhanced Reasoning and Explainability
Phuc Pham, Viet Thanh Duy Nguyen, Truong-Son Hy · PDF
LLM Kernel: an evaluation framework for open-ended scientific interpretation
William Connell, Drishti Guin, Clayton Mellina · PDF
Machine Learning Interatomic Potentials: library for efficient training, model development and simulation of molecular systems
Christoph Brunken, Olivier Peltre, Heloise Chomet, Lucien Walewski, Manus McAuliffe, Valentin Heyraud, Solal Attias, Martin Maarand, Yessine Khanfir, Edan Toledo, Fabio Falcioni, Marie Bluntzer, Silvia Acosta-Gutierrez, Jules Tilly · PDF
MARSHA: Multi-Agent RAG System for Hazard Adaptation
Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, Joshua Bergerson, John K Hutchison, Duane Rudolph Verner, Jordan Branham, M. Ross Alexander, Robert Ross, Yan Feng, Leslie-Anne Levy, Weijie J Su, Camillo Jose Taylor · PDF
Measuring Dependencies between Biological Signals with Self-supervision, and its Limitations
Evangelos Sariyanidi, John D Herrington, Lisa D Yankowitz, Pratik Chaudhari, Theodore D. Satterthwaite, Casey J. Zampella, Robert T Schultz, Russell T. Shinohara, Birkan Tunc · PDF
Mechanistic Reaction Data for Interpretable Deep Learning in Chemistry
Ryan J Miller, Alexander E. Dashuta, Pierre Baldi, David Van Vranken, Ann Marie Carlton · PDF
MEGA: A Large-Scale Molecular Editing Dataset for Guided-Action Optimization
Nelson Fernandez, Maxime Illouz, Luis Pinto, Entao Yang, Habiboulaye Amadou Boubacar · PDF
Memory-Augmented Reinforcement Learning for Hierarchical Graph Optimization of Dynamic Bills of Materials in Sustainable Medical device Product Families
Abdelaziz GUELFANE · PDF
MetaOmics-10T: The Foundational Dataset to Unlock Causal Modeling of Microbial Ecosystems
Arvid E. Gollwitzer, Deepak A. Subramanian, Isaac Tucker, Giovanni Traverso · PDF
Mixture-of-Experts Guided Multi-Omic Integration for Gastrointestinal Cancer Subtype Prediction
Sajib Acharjee Dip, Uddip Acharjee Shuvo, Dipanwita Mallick, Abrar Rahman Abir, Liqing Zhang · PDF
MLIPAudit: A benchmarking tool for Machine Learned Interatomic Potentials
Leon Wehrhan, Lucien Walewski, Marie Bluntzer, Heloise Chomet, Christoph Brunken, Jules Tilly, Silvia Acosta-Gutierrez · PDF
Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model
Dongki Kim, Wonbin Lee, Sung Ju Hwang · PDF
Mol-LLM: Multimodal Generalist Molecular LLM with Improved Graph Utilization
Chanhui Lee, Hanbum Ko, Yuheon Song, Yongjun Jeong, Rodrigo Hormazabal, Sehui Han, Kyunghoon Bae, Sungbin Lim, Sungwoong Kim · PDF
Mol-SGCL: Molecular Substructure-Guided Contrastive Learning for Out-of-Distribution Generalization
Andrew Zhou, Yasha Ektefaie, Maha Farhat · PDF
MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery \\via Hierarchical Search
Zonglin Yang, Wanhao Liu, Ben Gao, Yujie Liu, Wei Li, Tong Xie, Lidong Bing, Wanli Ouyang, Erik Cambria, Dongzhan Zhou · PDF
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback
Wanhao Liu, Zonglin Yang, Jue Wang, Lidong Bing, Di Zhang, Dongzhan Zhou, Yuqiang Li, Houqiang Li, Erik Cambria, Wanli Ouyang · PDF
moPPIt-v3: Motif-Specific Peptides Generated via Multi-Objective-Guided Discrete Flow Matching
Tong Chen, Zachary Quinn, Yinuo Zhang, Pranam Chatterjee · PDF
MORGaN: self-supervised multi-relational graph learning for drug target discovery
Martina Occhetta, Anniek Myatt, Manikhandan A. V. Mudaliar, Conrad Bessant · PDF
MSAFlow: a Unified Approach for MSA Representation, Augmentation, and Family-based Protein Design
Anirudh Venkatraman, Hanqun Cao, Tong Wei, Chaoran Cheng, Ge Liu · PDF
Multi-Graph Meta-Transformer: An Interpretable Framework for Cross-Graph Functional Alignment in Neural Decoding
Zahra Moslemi, Ziyi Liang, Norbert J. Fortin, Babak Shahbaba · PDF
Multi-Modal Attention Framework for Underwater Bioacoustic Denoising and Recognition
Amine Razig, Soulaymani Youssef, Loubna Benabbou, Pierre Cauchy · PDF
Multi-Objective Nanobody Design via Masked Discrete Diffusion with Simplex Refinement
Ruoxi Zhang, Pranam Chatterjee · PDF
Multi-Objective Peptide Design via Token-Aligned Preference Optimization
Michaela Areti Zervou, Felix Teufel, Yannis Pantazis, Panagiotis Tsakalides, Ole Winther · PDF
Multi-Scale Classification of Green Bank Telescope Signals
Jessica E. Liang, Ben Jacobson-Bell, Steve Croft · PDF
Multilevel neural simulation-based inference
Yuga Hikida, Ayush Bharti, Niall Jeffrey, Francois-Xavier Briol · PDF
Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning
Gang Liu, Michael Sun, Wojciech Matusik, Meng Jiang, Jie Chen · PDF
Multiscale Neural PDE Surrogates for Prediction and Downscaling: Application to Ocean Currents
Abdessamad El Kabid, Loubna Benabbou, Redouane Lguensat, Alex Hernández-García · PDF
Neural network distillation of orbital dependent density functional theory
Matija Medvidović, Jaylyn C. Umana, Iman Ahmadabadi, Domenico Di Sante, Johannes Flick, Angel Rubio · PDF
Neural Triangular Transport Maps: A New Approach Towards Sampling in Lattice QCD
Andrey Bryutkin, Youssef Marzouk · PDF
OmniCast: A Masked Latent Diffusion Model for Weather Forecasting Across Time Scales
Tung Nguyen, Tuan Pham, Troy Arcomano, Rao Kotamarthi, Ian Foster, Sandeep Madireddy, Aditya Grover · PDF
OpenCityCorpus: A Large-Scale, Harmonized, and LLM-Ready Corpus of Urban Data for Scientific Research
Junfeng Jiao, Sean Hardesty Lewis, Yiming Xu, Jihyung Park, Connor Phillips · PDF
OpenDiscovery: A Verifiable, Creative Science Problem-Solving Dataset to Forge AI Scientists
Yixuan Weng, QiYao Sun, Minjun Zhu, Yue Zhang · PDF
Pareto-Guided Reinforcement Learning for Multi-Objective ADMET Optimization in Generative Drug Design
Hoang-My Nguyen, Nguyet-Hang Vu, Hoang Thanh Lam, Hoang D. Nguyen · PDF
PatchDNA: A Flexible and Biologically-Informed Alternative to Tokenization for DNA
Alice Del Vecchio, Chantriolnt-Andreas Kapourani, Abdullah M Athar, Agnieszka Dobrowolska, Andrew Anighoro, Benjamin Tenmann, Lindsay Edwards, Cristian Regep · PDF
PEAR: Equal Area Weather Forecasting on the Sphere
Hampus Linander, Christoffer Petersson, Daniel Persson, Jan E Gerken · PDF
PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning
Ruheng Wang, Hang Zhang, Trieu Nguyen, Shasha Feng, Hao-Wei Pang, Xiang Yu, Li Xiao, Peter Zhiping Zhang · PDF
Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research
Xiang Liu, Penglei Sun, Shuyan Chen, Longhan Zhang, Peijie Dong, Huajie You, Yongqi Zhang, Chang YAN, Xiaowen Chu, Tong-yi Zhang · PDF
PhySense: Evaluating LLMs on Foundational Physics Principles
Yinggan XU, Yue Liu, Zhi-Qiang Gao, Changnan Peng, Di Luo · PDF
Physics-Informed Learning Near Critical Transitions: A Comparative Study of UDEs and Neural ODEs
Urvi Mahendra Bora, Prathamesh Dinesh Joshi, Raj Dandekar, Rajat Dandekar, Sreedath Panat · PDF
Physics-Informed Neural Networks with Fourier Features and Attention-Driven Decoding
Rohan Arni, Carlos Blanco · PDF
PhysiX: A Foundation Model for Physics Simulations
Tung Nguyen, Arsh Koneru, Shufan Li, Aditya Grover · PDF
PICore: Physics-Informed Unsupervised Coreset Selection for Data Efficient Neural Operator Training
Anirudh Satheesh, Anant Khandelwal, Mucong Ding, Radu Balan · PDF
PIRF: Physics-Informed Reward Fine-Tuning for Diffusion Models
Mingze Yuan, Pengfei Jin, Na Li, Quanzheng Li · PDF
PKG-DPO: Optimizing Domain-Specific AI systems with Physics Knowledge Graphs and Direct Preference Optimization
Nitin Nagesh Kulkarni, Bryson Wilcox, Max Sawa, Jason Thom · PDF
PLAME: Lightweight MSA Design Advances Protein Folding From Evolutionary Embeddings
Hanqun Cao, Xinyi Zhou, Zijun Gao, Chenyu Wang, Xin Gao, Zhi Zhang, Chunbin Gu, Ge Liu, Pheng-Ann Heng · PDF
Predicting Kinase-Specific Phosphorylation Sites with Pretrained Protein Language Models
Mahdi Pourmirzaei, Farzaneh Esmaili, Kai Chen, Mohammadreza Pourmirzaeioliaei, Mohsen Rezaei, Duolin Wang, Dong Xu · PDF
Predictive Feature Caching for Training-free Acceleration of Molecular Geometry Generation
Johanna Sommer, John Rachwan, Nils Fleischmann, Stephan Günnemann, Bertrand Charpentier · PDF
PrimerCast: Predictive Modeling of PCR Amplification with an AI-Ready Experimental Dataset
S. Chan Baek, Kenneth Bryan Hsu, Yasha Ektefaie, Pardis Sabeti · PDF
Proposal for a Large-scale High-quality Dataset of Activity Cliffs
Xiuyuan Hu, Jingyi Zhao, Guoqing Liu, Yang Zhao, Jieran Li, Hao Zhang, José Miguel Hernández-Lobato · PDF
Protein Design with Agent Rosetta: A Case Study for Specialized Scientific Agents
Jacopo Teneggi, Tanya Marwah, Alberto Bietti, P. Douglas Renfrew, Vikram Khipple Mulligan, Siavash Golkar · PDF
PUBHOMICS: A Multispecies Biological Dataset to Catalyze AI-Driven Toxicity Assessment for Environmental and Public Health
Daniel Chinwendu Ukaegbu · PDF
RAG-Enhanced Collaborative LLM Agents for Drug Discovery
Namkyeong Lee, Edward De Brouwer, Ehsan Hajiramezanali, Tommaso Biancalani, Chanyoung Park, Gabriele Scalia · PDF
Rao-Blackwell Gradient Estimators for Equivariant Denoising Diffusion
Vinh Tong, Dung Trung Hoang, Anji Liu, Guy Van den Broeck, Mathias Niepert · PDF
ReactionReasoner: Towards Reasoning LLM for Chemical Reaction Prediction
Hanbum Ko, Chanhui Lee, Ye Rin Kim, Rodrigo Hormazabal, Sehui Han, Sungbin Lim, Sungwoong Kim · PDF
README: Rapid Equation Discovery with Multimodel Encoders
Gregory Kang Ruey Lau, Yue Ran Kang, Zi-Yu Khoo, Apivich Hemachandra, Ruth Wan Theng Chew, Bryan Kian Hsiang Low · PDF
Reasoning LLMs for Materials Discovery with Physics-aware Rejection Sampling
Lee Hyun, Sohee Yoon, Jinwoo Park, Jooyeon Ahn, Yebin Jung, You Jung Chung, JINA KIM, Hogeun Chang, Myeonginn Kang, Seongeon Park, Sujin Park, Sue In Chae, Ho-Gyeong Kim, Myeonghun Jeong · PDF
RemoteFoldSet: Benchmarking Structural Awareness of Protein Language Models
Zinnia Ma, Neville P. Bethel · PDF
Resilience Outcomes Benchmark: Toward an Outcome-Labeled Coping Strategy Dataset for Precision Mental Health
Saurabh Anand · PDF
Reviewing Scientific Papers for Critical Problems With Reasoning LLMs: Baseline Approaches and Automatic Evaluation
Tianmai M. Zhang, Neil F. Abernethy · PDF
Revive Legacy Scientific Reasoning Benchmark by Growing Perturbation
Terry Jingchen Zhang, Wenyuan Jiang, Yinya Huang · PDF
RNA-Scope: Benchmarking RNA Language Models for RNA Sequence Understanding
Hui Wang, Wenjun Lin, Hongwang Xiao, Qiwei Ye, Yaqing Zhang · PDF
Rodent-Bench
Thomas Heap, Laurence Aitchison, Emma Cahill, Adriana Casado Rodriguez · PDF
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents
Kunlun Zhu, Jiaxun Zhang, Ziheng Qi, Nuoxing Shang, Zijia Liu, Peixuan Han, Yue Su, Haofei Yu, Jiaxuan You · PDF
Sampling 3D Molecular Conformers with Diffusion Transformers
Thorben Frank, Winfried Ripken, Gregor Lied, Klaus Robert Muller, Oliver T. Unke, Stefan Chmiela · PDF
Scaling High-Throughput Experimentation Unlocks Robust Reaction-Outcome Prediction
Michał Sadowski, Lukasz Sztukiewicz, Maria Wyrzykowska, Tadija Radusinović, Piotr Byrski, Paweł Włodarczyk-Pruszyński, Bartosz Matysiak, Jan Kulczycki, Filip Ulatowski, Ruard van Workum, Pawel Dabrowski-Tumanski, Paulina Wach, Filip Chmielewski, Jan Rzymkowski, Mateusz Bruno-Kamiński, Jan Busz, Artur Chołuj, Mateja Duda, Tomasz Dybowski, Marco Farinone, Tomasz Jeliński, Alicja Karczewska, Paweł Kowalczyk, Marek Pietrzak, Łukasz Szczupak, Aleksander Szkółka, Grzegorz Wojciechowski, Stanislaw Kamil Jastrzebski · PDF
Scaling Multi-Modal and Multi-Task Transformers for Small Molecule Drug Discovery
David S. Farina Jr, Sai Krishna Sirumalla, Michiel J.M. Niesen, Daniele A. Di Cesare, Felipe Costa Farias, Michael B. O'Connor, Marcelo Gomes Pereira de Lacerda, Orion Walker Dollar, Peter Bygrave, Thomas Dresselhaus, Zhuoran Qiao, Rishi Shah, Jason Swails, Daniel Miles, Oliver Feighan, Stephen Opalenski, Wallace Derricotte, Feizhi Ding, Matthew Welborn, Fred Manby, Thomas Miller · PDF
scCMap: Connecting Genetic and Chemical Perturbations at Single-Cell Resolution
Yiming Li, Min Zeng, Min Li · PDF
Scientific Machine Learning for Symbolic Recovery of Relativistic Effects in Black Hole Orbits
Pothuraju Naveen Yadav, Prathamesh Dinesh Joshi, Raj Dandekar, Rajat Dandekar, Sreedath Panat, Dinesh Kumar Vishwakarma · PDF
SciKnowEval: A Comprehensive Dataset for Evaluating Scientific Knowledge of Large Language Models
Kehua Feng, Xinyi Shen, Weijie Wang, Xiang Zhuang, Yuqi Tang, Qiang Zhang, Keyan Ding · PDF
SciNav: A General Agent Framework for Scientific Coding Tasks
TIANSHU ZHANG, Huan Sun · PDF
Semantic search for 100M+ galaxy images using AI-generated captions
Nolan Koblischke, Liam Holden Parker, Francois Lanusse, Irina Espejo Morales, Jo Bovy, Shirley Ho · PDF
Sinhala Diachronic Corpus
Nisansa de Silva · PDF
SkillPuzzler: A Self-Evolving Agentic Framework for Materials and Chemistry Research with Minimal Reliance on Predefined Tools
Xu Huang, Junwu Chen, Philippe Schwaller, Gerbrand Ceder · PDF
Smiles2Dock: a large-scale dataset for ML-based docking score prediction using AlphaFold structures
Thomas Le Menestrel, Manuel Rivas Cruz · PDF
Softly Constrained Denoisers for Diffusion Models
Victor M. Yeom-Song, Severi Rissanen, Arno Solin, Samuel Kaski, Mingfei Sun · PDF
SPADE: Inferring Transcriptional Dynamics from Spatial Transcriptomics with Physics-Informed Deep Learning
Xiao Wang, Jia Wang, Yuhui Wei, Yijie Wang, Sha Cao, Chi Zhang · PDF
Sparse Autoencoders for Low-$N$ Protein Function Prediction and Design
Darin Tsui, Kunal Talreja, Amirali Aghazadeh · PDF
Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?
Sukwon Yun, Heming Yao, Burkhard Hoeckendorf, David Richmond, Aviv Regev, Russell Littman · PDF
Spatio-Temporal Graphs Beyond Grids: Benchmark for Maritime Anomaly Detection
Jeehong Kim, Youngseok Hwang, Minchan Kim, Sungho Bae, Hyunwoo Park · PDF
Static and Dynamic Diffusion Emulators: From Sampling Gray Swan Extreme Events to Suffering from Model Collapse
Karan Jakhar, Pedram Hassanzadeh, Björn Lütjens, Jonathan Weare · PDF
Steering the Evolutionary Game: Hierarchical Control of Therapeutic Resistance in Cancer Treatment
Arvid E. Gollwitzer, Deepak A. Subramanian, Isaac Tucker, Giovanni Traverso · PDF
Steering Vector Fields for Property-Controlled Molecular Generation with Chemical Language Models
Aleksandar Dimitrievikj, Jude Wells · PDF
Synergizing Large Language Models and Knowledge Graphs in Science: A Survey
Zhihui Zhu, Yuqi Tang, Qiang Zhang, Keyan Ding · PDF
SynthFair: A Semi-Synthetic Medical Imaging Dataset to Propel Research on Bias Detection & Mitigation
Fabio De Sousa Ribeiro, Estanislao Claucich, Emma A.M. Stanley, Panos Dimitrakopoulos, Sotirios A. Tsaftaris, Enzo Ferrante, Ben Glocker, Rodrigo Echeveste · PDF
TadABench-1M: A Large-Scale Wet-Lab Protein Benchmark For Rigorous OOD Evaluation
Jin Gao, Juntu Zhao, Jiaqi Shen, Junhao Shi, Dukun Zhao, Yuming Lu, Dequan Wang · PDF
Task Alignment Outweighs Framework Choice in Scientific LLM Agents
Nawaf Alampara, Martiño Ríos-García, Chandan Gupta, Sajid Mannan, Santiago Miret, N M Anoop Krishnan, Kevin Maik Jablonka · PDF
TeBaAb: Text-Based Antigen-Conditioned Antibody Redesign via Directed Evolution
Cuong Manh Nguyen, Huy-Hoang Do-Huu, Viet Thanh Duy Nguyen, Truong-Son Hy · PDF
Test-Time Control Over Accuracy-Cost Trade-Offs in Neural Physics Simulators via Recurrent Depth
Harris Abdul Majid, Pietro Sittoni, Francesco Tudisco · PDF
The Darwin–Gödel Discovery Machine: Toward Bounded-Risk Self-Improving AI4Science
Xuening Wu, Xinhang Zhang, Yanlan Kang, Qianya Xu, Honggang Wang, Zeping Chen · PDF
The Loss Landscape of XRD-Based Structure Optimization Is Too Rough for Gradient Descent
Nofit Segal, Akshay Subramanian, Mingda Li, Benjamin Kurt Miller, Rafael Gomez-Bombarelli · PDF
The More You Automate, the Less You See: The Hidden Pitfalls of AI Scientist Systems
Ziming Luo, Atoosa Kasirzadeh, Nihar B Shah · PDF
The Transparent Earth: A Multimodal Foundation Model for the Earth's Subsurface
Arnab Neelim Mazumder, Javier E. Santos, Noah Hobbs, Mohamed Mehana, Daniel O'Malley · PDF
Thinking like a CHEMIST: Combined Heterogeneous Embedding Model Integrating Structure and Tokens
Nikolai Rekut, Alexey Orlov, Klea Ziu, Elizaveta Starykh, Martin Takáč, Aleksandr Beznosikov · PDF
Token-Level Early Fusion Model Bridging Text and 3D Electron Density Grids in Chemistry
Eduardo Soares, Emilio Vital Brazil, Victor Y. Shirasuna, Henrique de Morais Porto, Enzo Reis de Oliveira, Caio Rodrigues Gama, Daniel Djinishian de Briquez, Sandro Rama Fiorini, Marcelo Nery dos Santos, Nathaniel H. Park, Dmitry Zubarev · PDF
Token-Level Guided Discrete Diffusion for Membrane Protein Design
Shrey Goel, Peregrine Michael Schray, Yinuo Zhang, Sophia Vincoff, Huong T. Kratochvil, Pranam Chatterjee · PDF
Topological defects propagate information in deep neural networks
Nabil Iqbal, Max Welling · PDF
Topological Feature Compression for Molecular Graph Neural Networks
Rahul Khorana · PDF
Topological Graph Generative Model for Ecological Design
Zitong S. Chen, Oana Carja · PDF
TorchQuantumDistributed
Oliver Knitter, Jonathan Mei, Masako Yamada, Martin Roetteler · PDF
Towards Accurate Test-Time Adaptation for Neural Surrogates
Anna Zimmel, Gianluca Galletti, Paul Setinek, Johannes Brandstetter, Werner Zellinger · PDF
Towards Generating Stable Materials via Large Language Models with Reinforcement Learning Finetuning
Zhang-Wei Hong, Nofit Segal, Aviv Netanyahu, Hoje Chun, Rafael Gomez-Bombarelli, Pulkit Agrawal · PDF
Towards Multi-Fidelity Scaling Laws of Neural Surrogates in CFD
Paul Setinek, Gianluca Galletti, Johannes Brandstetter · PDF
Training Dynamics of Learning 3D-Rotational Equivariance
Max W Shen, Ewa Nowara, Michael Maser, Kyunghyun Cho · PDF
TroubleRAG: Evaluating Retrieval Pipelines for Real-World Chemistry Troubleshooting
Mahsa Monshizadeh, Xiaoyi Chen, Haixu Tang, Yuzhen Ye · PDF
Trustworthy Retrosynthesis: Eliminating Hallucinations with a Diverse Ensemble of Reaction Scorers
Michał Sadowski, Maria Wyrzykowska, Lukasz Sztukiewicz, Tadija Radusinović, Jan Rzymkowski, Paweł Włodarczyk-Pruszyński, Mikołaj Sacha, Piotr Kozakowski, Ruard van Workum, Stanislaw Kamil Jastrzebski · PDF
Unlearning as Ablation: Toward a Falsifiable Benchmark for Generative Scientific Discovery
Robert Yang · PDF
Urban Climate Counterfactuals: A Causal Dataset for Street-Level Heat Mitigation Interventions
Ahanaf Hasan Ariq · PDF
Using Deep Reinforcement Learning to Understand Odor Plume Tracking in Walking and Flying Agents
Aarav Sinha, Satpreet Harcharan Singh · PDF
WhaleLM: Finding Structure and Information in Sperm Whale Vocalizations and Behavior with Machine Learning
Pratyusha Sharma, Shane Gero, Daniela Rus, Antonio Torralba, Jacob Andreas · PDF
When Do LLMs Improve Bayesian Optimization? A Systematic Comparison Across Molecular and Protein Design
Mattias Akke, Soojung Yang, Jurģis Ruža, Jinyeop Song, Elton Pan, Rafael Gomez-Bombarelli · PDF
WildSci: Advancing Scientific Reasoning from In-the-Wild Literature
Tengxiao Liu, Deepak Nathani, Zekun Li, Kevin Yang, William Yang Wang · PDF
Without Safeguards, AI-Biology Integration Risks Accelerating Future Pandemics
Dianzhuo Wang, Marian Huot, Zechen Zhang, Kaiyi Jiang, Eugene Shakhnovich, Kevin M. Esvelt · PDF
Wrong Model, Right Uncertainty: Spatial Associations for Discrete Data with Misspecification
David R. Burt, Renato Berlinghieri, Tamara Broderick · PDF
Zephyrus: An Agentic Framework for Weather Science
Sumanth Varambally, Marshall Fisher, Jas Thakker, Yiwei Chen, Zhirui Xia, Ruijia Niu, Yasaman Jafari, Veeramakali Vignesh Manivannan, Zachary Novack, Luyu Han, Srikar Eranky, Salva Rühling Cachay, Taylor Berg-Kirkpatrick, Duncan Watson-Parris, Yian Ma, Rose Yu · PDF
Zero-Shot Protein–Ligand Binding-Residue Prediction from Sequence and SMILES
Mahdi Pourmirzaei, Salhuldin Alqarghuli, Kai Chen, Mohammadreza Pourmirzaeioliaei, Dong Xu · PDF

Accepted papers (230)

☆10 Million Particle Events: Enabling Foundation Models for Sparse 3D Inverse Problems

☆A Foundational Dataset for the Predictive Prevention of Waterborne Disease

☆A Large Multimodal Molecular Representation Encoder-Decoder Foundation Model for Chemistry

☆A Multi-Modal Deep Learning Model for Drug Potency Prediction: Leveraging Features from Physics-Based Docking and Advanced Co-Folding Methods

☆A Probabilistic U-Net Approach to Downscaling Climate Simulations

☆A study of EHVI vs fixed scalarization for molecule design

☆A Synthesizability-Guided Pipeline for Materials Discovery

☆AC-PKAN: Attention-Enhanced and Chebyshev Polynomial-Based Physics-Informed Kolmogorov–Arnold Networks

☆Accelerated Isotopologue Reduced Partition Function Ratio Prediction with Orbital-based Deep Learning

☆Accelerating Protein Molecular Dynamics Simulation with DeepJump

☆Adaptive Transition State Refinement with Learned Equilibrium Flows

☆AI for Science Strategic Compass: Aligning Discovery Tensions with Core AI Functions

☆AI4O3: A Foundational Data Collection for Artificial Intelligence in Tropospheric Ozone Research

☆AIM: Adaptive Intervention for Deep Multi-task Learning of Molecular Properties

☆AION-1: Omnimodal Foundation Model for Astronomical Sciences

☆Alvessa: An Agentic Evidence-Grounded Research Assistant for Genomics

☆An Agentic Orchestration System for Heliophysics Tasks

☆An in-silico integration of neurodevelopmental and dopaminergic views of schizophrenia

☆Assessing the Geographic Generalization and Physical Consistency of Generative Models for Climate Downscaling

☆Augmenting Research Ideation with Data: An Empirical Investigation in Social Science

☆AutoChemSchematic AI: Agentic Physics-Aware Automation for Chemical Manufacturing Scale-Up

☆Automated scientific minimization of regret for cognitive modeling

☆BasePrompt: Self-Prompting Genome Language Models for RNA Fitness Prediction

☆Benchmarking LLMs for atomic-level geometric manipulation in crystals

☆Benchmarking Machine Learning Potentials for Crystal Structure Relaxation

☆Beyond Atoms: Evaluating Electron Density Representation for 3D Molecular Learning

☆Beyond data subsampling: differentiation as an uncertainty source in equation discovery

☆Beyond Ensembles: Simulating All-Atom Protein Dynamics in a Learned Latent Space

☆Beyond model organisms: robust prediction of functional properties across protein evolution

☆Bigger is not always better: evaluating target-specific dataset design strategies for regioselectivity prediction on complex molecules

☆BioMedReasoner: Towards Multi-Hop Reasoning using Path-based Relational Learning on Biomedical Knowledge Graphs

☆BioVerge: A Comprehensive Benchmark and Study of Self-Evaluating Agents for Biomedical Hypothesis Generation

☆Block-wise distillation for lightweight weather models

☆BLOSUM Is All You Learn — Generative Antibody Models Reflect Evolutionary Priors

☆Boundary-Augmented Neural Operators for Better Generalization to Unseen Geometries

☆Bridging Neural Operator and Flow Matching for a Generative PDE Foundation Model

☆CALM-PDE: Continuous and Adaptive Convolutions for Latent Space Modeling of Time-dependent PDEs

☆Can Theoretical Physics Research Benefit from Language Agents?

☆CAST: Causal Modeling of Time-Varying Treatment Effects on Head and Neck Cancer

☆Causal AI Scientist: Facilitating Causal Data Science with Large Language Models

☆Chemist-aligned retrosynthesis by ensembling diverse inductive bias models

☆CHEMSETS: How Capable Are Chemistry LLMs?

☆CiteGuard: Retrieval-Augmented Citation Verification for LLM-Powered Peer Review

☆Closing the Omics Gap: A Benchmark for Unified Evaluation of Biomolecular Foundation Models

☆CompGen: A Conditional Generation Framework for Inverse Composition Design of Catalytic Surfaces

☆Conditioned Clifford-Steerable Kernels

☆Connecting Preclinical Models to Patient Outcomes: A Machine Learning Dataset for Predictive Validity in Drug Development

☆Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design

☆Constant-Potential Machine Learning Force Field for Electrochemical Interface

☆Constructing the Mental Health Phenome: An Open Multimodal Dataset Linking Digital Behavior, Physical Health, and Mental Wellbeing

☆Control-Augmented Diffusion for Autoregressive Data Assimilation

☆Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling

☆Data-driven Design as a High-Impact, Ecologically Valid Benchmark for Document Understanding

☆Data-Driven Solar Surface Flux Transport Modeling with Uncertainty Quantification

☆Data-optimal scaling of paired antibody language models

☆De novo generation of functional terpene synthases using TpsGPT

☆Decompose, Adapt, and Evolve: Towards Efficient Scientific Equation Discovery with Large Language Models

☆Deep Graph Learning for Industrial Carbon Emission Analysis and Policy Impact

☆Demystifying Protein Generation with Hierarchical Conditional Diffusion Models

☆Differentiable Predictive Control for Precise Oxygen Level Maintenance for Critical Patients

☆Diffusion for Fusion: Designing Stellarators with Generative AI

☆Dimensionality and Topological Stability of Neural Representations in the Human Brain Predict Learning Outcomes

☆DINO: dynamics-informed dataset to overcome the limitations of static molecular data in AI-driven drug discovery

☆Discontinuous Epitope Fragments as Sufficient Target Templates for Efficient Binder Design

☆Dissecting Larval Zebrafish Hunting Behavior using Deep Reinforcement Learning trained RNNs

☆DistMLIP: A Distributed Inference Platform for Machine Learning Interatomic Potentials

☆Diverse Topology Optimization using Modulated Neural Fields

☆DMPKBench: A Multi-Modal Benchmark for Evaluating LLMs and Agents in Drug Discovery DMPK Tasks‌

☆DMRG Quantum Chemistry Dataset for Multi-Reference Machine Learning

☆Do Llamas Understand the Periodic Table?

☆Does LLM dream of differential equation discovery?

☆Domain-Invariant Feature Learning for Patient-Level Phenotype Prediction from Single-Cell Data

☆EARS-UDE : Evaluating Auditory Response in Sensory Overload with Universal Differential Equations

☆Einstein Fields: A Neural Perspective To Computational General Relativity

☆Emergent SO(3)-Invariant Molecular Representations from Multimodal Alignment

☆Empowering AI in RNAi Therapeutics: A Foundational Dataset for siRNA Design and Optimization

☆EquiHGNN: Scalable Rotationally Equivariant Hypergraph Neural Networks

☆Every Answer Counts: Enhancing Scientific Discovery with Efficient Entity-Centric Question Answering from Long Contexts

☆Explainable AI–Guided Virtual Experiments Reveal How DNA Sequence Context Shapes Gene Regulation

10 Million Particle Events: Enabling Foundation Models for Sparse 3D Inverse Problems

A Foundational Dataset for the Predictive Prevention of Waterborne Disease

A Large Multimodal Molecular Representation Encoder-Decoder Foundation Model for Chemistry

A Multi-Modal Deep Learning Model for Drug Potency Prediction: Leveraging Features from Physics-Based Docking and Advanced Co-Folding Methods

A Probabilistic U-Net Approach to Downscaling Climate Simulations

A study of EHVI vs fixed scalarization for molecule design

A Synthesizability-Guided Pipeline for Materials Discovery

AC-PKAN: Attention-Enhanced and Chebyshev Polynomial-Based Physics-Informed Kolmogorov–Arnold Networks

Accelerated Isotopologue Reduced Partition Function Ratio Prediction with Orbital-based Deep Learning

Accelerating Protein Molecular Dynamics Simulation with DeepJump

Adaptive Transition State Refinement with Learned Equilibrium Flows

AI for Science Strategic Compass: Aligning Discovery Tensions with Core AI Functions

AI4O3: A Foundational Data Collection for Artificial Intelligence in Tropospheric Ozone Research

AIM: Adaptive Intervention for Deep Multi-task Learning of Molecular Properties

AION-1: Omnimodal Foundation Model for Astronomical Sciences

Alvessa: An Agentic Evidence-Grounded Research Assistant for Genomics

An Agentic Orchestration System for Heliophysics Tasks

An in-silico integration of neurodevelopmental and dopaminergic views of schizophrenia

Assessing the Geographic Generalization and Physical Consistency of Generative Models for Climate Downscaling

Augmenting Research Ideation with Data: An Empirical Investigation in Social Science

AutoChemSchematic AI: Agentic Physics-Aware Automation for Chemical Manufacturing Scale-Up

Automated scientific minimization of regret for cognitive modeling

BasePrompt: Self-Prompting Genome Language Models for RNA Fitness Prediction

Benchmarking LLMs for atomic-level geometric manipulation in crystals

Benchmarking Machine Learning Potentials for Crystal Structure Relaxation

Beyond Atoms: Evaluating Electron Density Representation for 3D Molecular Learning

Beyond data subsampling: differentiation as an uncertainty source in equation discovery

Beyond Ensembles: Simulating All-Atom Protein Dynamics in a Learned Latent Space

Beyond model organisms: robust prediction of functional properties across protein evolution

Bigger is not always better: evaluating target-specific dataset design strategies for regioselectivity prediction on complex molecules

BioMedReasoner: Towards Multi-Hop Reasoning using Path-based Relational Learning on Biomedical Knowledge Graphs

BioVerge: A Comprehensive Benchmark and Study of Self-Evaluating Agents for Biomedical Hypothesis Generation

Block-wise distillation for lightweight weather models

BLOSUM Is All You Learn — Generative Antibody Models Reflect Evolutionary Priors

Boundary-Augmented Neural Operators for Better Generalization to Unseen Geometries

Bridging Neural Operator and Flow Matching for a Generative PDE Foundation Model

CALM-PDE: Continuous and Adaptive Convolutions for Latent Space Modeling of Time-dependent PDEs

Can Theoretical Physics Research Benefit from Language Agents?

CAST: Causal Modeling of Time-Varying Treatment Effects on Head and Neck Cancer

Causal AI Scientist: Facilitating Causal Data Science with Large Language Models

Chemist-aligned retrosynthesis by ensembling diverse inductive bias models

CHEMSETS: How Capable Are Chemistry LLMs?

CiteGuard: Retrieval-Augmented Citation Verification for LLM-Powered Peer Review

Closing the Omics Gap: A Benchmark for Unified Evaluation of Biomolecular Foundation Models

CompGen: A Conditional Generation Framework for Inverse Composition Design of Catalytic Surfaces

Conditioned Clifford-Steerable Kernels

Connecting Preclinical Models to Patient Outcomes: A Machine Learning Dataset for Predictive Validity in Drug Development

Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design

Constant-Potential Machine Learning Force Field for Electrochemical Interface

Constructing the Mental Health Phenome: An Open Multimodal Dataset Linking Digital Behavior, Physical Health, and Mental Wellbeing

Control-Augmented Diffusion for Autoregressive Data Assimilation

Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling

Data-driven Design as a High-Impact, Ecologically Valid Benchmark for Document Understanding

Data-Driven Solar Surface Flux Transport Modeling with Uncertainty Quantification

Data-optimal scaling of paired antibody language models

De novo generation of functional terpene synthases using TpsGPT

Decompose, Adapt, and Evolve: Towards Efficient Scientific Equation Discovery with Large Language Models

Deep Graph Learning for Industrial Carbon Emission Analysis and Policy Impact

Demystifying Protein Generation with Hierarchical Conditional Diffusion Models

Differentiable Predictive Control for Precise Oxygen Level Maintenance for Critical Patients

Diffusion for Fusion: Designing Stellarators with Generative AI

Dimensionality and Topological Stability of Neural Representations in the Human Brain Predict Learning Outcomes

DINO: dynamics-informed dataset to overcome the limitations of static molecular data in AI-driven drug discovery

Discontinuous Epitope Fragments as Sufficient Target Templates for Efficient Binder Design

Dissecting Larval Zebrafish Hunting Behavior using Deep Reinforcement Learning trained RNNs

DistMLIP: A Distributed Inference Platform for Machine Learning Interatomic Potentials

Diverse Topology Optimization using Modulated Neural Fields

DMPKBench: A Multi-Modal Benchmark for Evaluating LLMs and Agents in Drug Discovery DMPK Tasks‌

DMRG Quantum Chemistry Dataset for Multi-Reference Machine Learning

Do Llamas Understand the Periodic Table?

Does LLM dream of differential equation discovery?

Domain-Invariant Feature Learning for Patient-Level Phenotype Prediction from Single-Cell Data

EARS-UDE : Evaluating Auditory Response in Sensory Overload with Universal Differential Equations

Einstein Fields: A Neural Perspective To Computational General Relativity

Emergent SO(3)-Invariant Molecular Representations from Multimodal Alignment

Empowering AI in RNAi Therapeutics: A Foundational Dataset for siRNA Design and Optimization

EquiHGNN: Scalable Rotationally Equivariant Hypergraph Neural Networks

Every Answer Counts: Enhancing Scientific Discovery with Efficient Entity-Centric Question Answering from Long Contexts

Explainable AI–Guided Virtual Experiments Reveal How DNA Sequence Context Shapes Gene Regulation

Explaining Temporal Effects in Sepsis Prediction