ICML 2026 Past AgentsGenerative models

The 2026 Workshop on Generative and Agentic AI for Biology

GenBio 2026

Submission deadline
May 9, 2026, 12:00 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (153)

Fetched from OpenReview (v2) on 2026-06-10.

  1. 3D Molecule Generation from Rigid Motifs via $\mathrm{SE}(3)$ Flows

    Roman Poletukhin, Marcel Kollovieh, Eike Eberhard, Stephan Günnemann · PDF
  2. A Deep Generative Mixture Model for Enhancing Circulating Tumor DNA Estimation

    Mathilde Hartvig Diekema, Christian Marius Lillelund, Mads Heilskov Rasmussen, Claus Lindbjerg Andersen, Jakob Skou Pedersen · PDF
  3. A supervised ontology-aware cell annotation method for single-cell transcriptomic data

    Nimish Magre, Ebtisam Alshehri, Fedor Grab, Yerdos Ordabayev, Mehrtash Babadi, Stephen J. Fleming · PDF
  4. ACER: Towards Generalizable Protein-ligand Co-folding

    Nopsinth Vithayapalert, Francesca Grisoni · PDF
  5. Active Flow Expansion for Out-of-Distribution Discovery: from Theory to Molecules

    Riccardo De Santi, Bruce D Lee, Cristian Perez Jensen, Kimon Protopapas, Sophia Tang, Cheng-Hao Liu, Pranam Chatterjee, Yisong Yue, Andreas Krause · PDF
  6. Affinage: Genome-Scale Mechanistic Gene Annotation from the Published Literature

    Matteo Di Bernardo, Iain M. Cheeseman · PDF
  7. Agent-Guided De Novo Design of Nanobody Binders Against a Novel Cancer Target

    Yue Zhao, Melih Yilmaz, Edward Lee, Chuanyui Teh, Lan Guo, Kemal Sonmez, Luca Giancardo, Gordon Trang, Fangda Xu, Madelyn Espinosa-Cotton, Nai-Kong Cheung, Jiwon Kim, Nina Cheng · PDF
  8. Agentic Discovery of Non-Canonical Antimicrobial Peptides with AMPGAN v3

    Jay Hwasung Jung, Xiaohan Zhang, Shenghan Song, Mahmoud Sayedahmed, Chijian Xiang, Yunong Xu, Ahmed AbdelKhalek, Severin T. Schneebeli, Matthew J. Wargo, Jianing Li, Safwan Wshah · PDF
  9. AgentPLM: Agentic Protein Language Models with Reasoning-Augmented Decoding for Protein Sequence Design

    Sahil Rahman, Maxx Richard Rahman · PDF
  10. AIR: Inference-Time Refinement for Discrete-Diffusion Antibody Humanization

    Anna Karpova, Andrey Shevtsov, Viacheslav Meshchaninov, Pavel Strashnov · PDF
  11. AIVARI Agent: An Evidence-Grounded Agentic LLM for Variant Reportability and Interpretation

    Min Kang, Seungwoo Kim, Ki Woong Kwon, Dongseok Moon · PDF
  12. AlloGen: Conformation-Selective Binder Design with Differential State Scoring

    Hanqun Cao, Aastha Pal, Sumi Kimura, Yesol Kim, Jingjie Zhang, Pheng-Ann Heng, Pranam Chatterjee · PDF
  13. AMP-DiT: Antimicrobial Peptide Design with AMP-classifier Conditional Diffusion Transformers

    ALIREZA NOROOZI, Attila Gürsoy · PDF
  14. annDNA: Learning Annotation-Aware Genomic Representations via Knowledge Distillation

    Hwanseok Sim, Yujin Kim, Soo-Whee Kim, Yeojin Ryu, Joon-Yong An · PDF
  15. AnomalyModifier: Suppressor Modifier Discovery in Familial Hypercholesterolemia via One-Class Anomaly Detection

    Inpyo Hong, Joohyun Han, Min Kang, Doyeon Ha · PDF
  16. Antibody Generation via Redistributed Latent Diffusion

    Andrey Shevtsov, Viacheslav Meshchaninov, Pavel Strashnov, Dmitry Vetrov · PDF
  17. Ares: Loss-Free Mixture-of-Experts Routing for Bidirectional Protein Encoders

    Hazem Alsamkary · PDF
  18. AURORA: Alignment-Guided Mutation Proposal for Protein Engineering

    Katie Spivakovsky, Jeannie She, Nathan P. Shapiro, Krithik Ramesh · PDF
  19. Autoregressive Models Enable Efficient Conditional 3D Molecular Generation

    Song Kim, Ameya Daigavane, Parthasarathy Suryanarayanan, Tess Smidt · PDF
  20. Base-and-Sugar Dual-Frame Flow Matching for RNA Co-Design

    Junzhe Li, Lijian Peng, Yuhao Li, Yize Zhou, Hanqun Cao, Cheng Tan, Shengchao Liu · PDF
  21. Beyond Nativeness: Viral Proteins in Protein Language Models

    Arthur Bigot, Harmon Bhasin, Core Francisco Park, Eugene Shakhnovich, Dianzhuo Wang · PDF
  22. BGC-Master: Detecting Novel Biosynthetic Gene Clusters with DNA Foundation Models

    Fenyi Liu, Yuhong SUN, Fan Zhang, Siheng Chen · PDF
  23. BioSkillSafety: A Systematic Benchmark for Evaluating Agent Skill Safety in Bioinformatics

    Bioclaw Team · PDF
  24. bish-bash-fold: what are protein structure prediction models learning?

    Soo-Jeong Kim · PDF
  25. Boltz-1 as a force field -- why co-folding models struggle with learning physics and how to fix it

    Urszula Julia Komorowska, Vsevolod Viliuga, Leif Seute, Pietro Lio, Mateja Jamnik · PDF
  26. Boltz-Jump: Accelerated Sampling of the Conformational Landscape of Biomolecular Structure Prediction Models

    Ameya Daigavane, Shashank Sule, Saeed Saremi, Andrew Martin Watkins, Joseph Kleinhenz, Tess Smidt, Bodhi P. Vani · PDF
  27. Boltz-Perturb: The Path Not Taken. Unlocking Generative Diversity in Co-Folding Models via Training-Free Conditioning Perturbation

    Hyeyun Jung, Alan C Cheng, BoRam Lee · PDF
  28. Bridging Gene Regulatory Networks and Causal Representation Learning in Single-Cell Genomics Data

    Vincenzo Lagani, Giorgi Sokhadze, Liliia Nigmetzianova, Robert Lehmann, Yiling Ma, Sumeer Ahmad Khan, Xabier Martinez de Morentin, Narsis A. Kiani, Mikel Hernaez, Alexander A. Lukyanov, Jesper Tegnér, David Gomez-Cabrero · PDF
  29. Can AI Scientist Agents Learn from Lab-in-the-Loop Feedback? Evidence from Iterative Perturbation Discovery

    Gilles Wainrib, Barbara Bodinier, Haithem Dakhli, Almudena Espin-Perez, Roberta Codato, John Klein · PDF
  30. Can AI Scientists Discover Neural Mechanisms? Evaluating Agentic Biological Discovery in a Digital Fly.

    Aarav Sinha · PDF
  31. Canopy: A Heterograph Foundation Model for Metabolic Engineering

    Jake Bowden, Laurence Legon, Satnam S. Surae · PDF
  32. Cell-Level Virtual Screening

    Caleb Ellington, Sohan Addagudi, Jiaqi Wang, Ben Lengerich, Eric P. Xing · PDF
  33. CIDER: Conformal Information-Directed Agents for Low-Budget Protein Engineering

    Rishabh Bhattacharya · PDF
  34. CLAMP: Steady-State ODE Inference of Gene Regulatory Networks from Single-Cell Perturbations

    Amaya Gallagher-Syed, Aaron Wenteler, Santiago A Martinez, Gregory Slabaugh · PDF
  35. Coder as Editor: Code-driven Interpretable Molecular Editing

    Wenyu Zhu, Chengzhu Li, Xiaohe Tian, Yifan Wang, Yinjun Jia, Jianhui Wang, Bowen Gao, Haichuan Tan, Ya-Qin Zhang, Wei-Ying Ma, Yanyan Lan · PDF
  36. COMPASS: Decoupled Latent Steering for Protein Conformational Transitions

    Changyu Lee, Sunghee Choi, Gyu Rie Lee · PDF
  37. Confidence-Weighted Elastic Gaussian Networks To Predict Protein Flexibility

    Matheus Ferraz, Francesco Alesiani, Henrik Christiansen · PDF
  38. ConTact: Contact-First Antibody CDR Design via Explicit Interface Reasoning

    Mansoor Ahmed, Spencer VonBank, Nadeem Taj, Sujin Lee, Naila Jan, Murray Patterson · PDF
  39. Contextualizing Biological Language Models across Modalities via Logit-Space Contrastive Alignment

    Yanjun Shao, Yundi Chen, Yashvi Patel, Aurelien Pelissier, María Rodríguez Martínez · PDF
  40. CPgen: Heterochiral Cyclic Peptide Ensemble Generation and Ensemble-Based Sequence Design

    Maxim Secor, Jovan Damjanovic, Veronika Thost, Sebastian Swanson, Kristine Deibler, Jesper Ferkinghoff-Borg · PDF
  41. CupOFLATTE: Coupled Objective-Guided Discrete Flows via Linker Assembly for Targeted PROTAC Engineering

    Ziang Li, Ruoxi Zhang, Pranam Chatterjee · PDF
  42. DBMol: Design of High-Affinity, Target-Specific Small Molecules through Structure Prediction Model

    Yiming QIN, Kai Yi, Miruna Cretu, Sjors HW Scheres, Pietro Lio, Pascal Frossard · PDF
  43. De Novo Generation of Odorant Molecules with Targeted Olfactory Receptor Activation Patterns

    Kexin Zhang, Manhao Guan · PDF
  44. Decoding Loss-of-Function Variants with Sparse Concept Features of ESM-2

    sungnam kim, Doyeon Ha · PDF
  45. Deep Generative Models for Phylogenetic Inference with Complex Evolutionary Processes

    Ethan Baron, Alan Nawzad Amin, Andrew Gordon Wilson · PDF
  46. DeepRoot: A KG-Coordinated Multi-Agent System for Therapeutic Reasoning over Historical Medical Texts

    Zijian Carl Ma, Sean J. Wang, Sijbren Manuel Kramer, Li Erran Li · PDF
  47. DELBERT-2: Pretrained Fingerprint Language Models for DEL Protein Binder Prediction

    Bing Xu Hu, Sun Sun, Shaik salman basha, Anita Layton, Helen Hong Chen · PDF
  48. Density-guided AlphaFold reveals unmodeled alternative turn conformations in protein structures

    Shuqin Zhang, Sai Advaith Maddipatla, Aviv A. Rosenberg, Sanketh Vedula, Alexander Bronstein, Ailie Marx · PDF
  49. Design-CP: Context Parallelism for Design of Protein Nanoparticles

    Lorenzo Tarricone, Helen Elizabeth Eisenach, Aiko Muraishi, Charlotte Deane · PDF
  50. Diamond Maps for Protein Binder Design: Inference-Time Scaling Survives Stochastic Flow Map Distillation

    Arthur Liang · PDF
  51. DNA Compression with Genomic Language Models: Tokenization, Benchmarking, and an Information-Content Map

    Vojtěch Máčala, Petr Simecek · PDF
  52. DrugSAGE: Self-evolving Agent Experience for Efficient State-of-the-Art Drug Discovery

    Yikun Zhang, Xiwei Cheng, Tianyu Liu, Yuanqi Du, Wengong Jin · PDF
  53. ELISA: An Interpretable Hybrid Generative AI Agent for Expression-Grounded Discovery in Single-Cell Genomics

    Omar Coser · PDF
  54. Elucidating the Design Space of Generative Models for Single-Cell Perturbation Prediction

    Sanjukta Bhattacharya, Christian Gensbigler, Shaamil Karim · PDF
  55. EpiCLIP: Learning Antibody-Antigen Interactions from Approximate Interfaces

    Joseph Boen, Imee Sinha, Samuel Don Stanton, Pranav Rao, Robert G Alberstein, Simon Kelow · PDF
  56. Evaluating H5N1 Vaccine Durability using Computationally-Designed Proteins

    Navami Jain, Noor Youssef, Sarah Gurev, Debora Susan Marks · PDF
  57. Evaluating out of distribution generalization of protein language models

    Kumaresh Krishnan, Arpan Sarkar, Sean R Eddy · PDF
  58. Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design

    Shriram Chennakesavalu, Kirill Shmilovich, Hayley Weir, Colin A Grambow, John Bradshaw, Patricia Suriana, Chen Cheng, Kangway V. Chuang · PDF
  59. EvoStruct: Bridging Evolutionary and Structural Priors for Antibody CDR Design via Protein Language Model Adaptation

    Mansoor Ahmed, Sujin Lee, Umar Khayaz, Murray Patterson · PDF
  60. Factorized Search and Cartography of Synthon-Based Chemical Spaces

    Miroslav Lžičař · PDF
  61. Few-Step Cofolding with All-Atom Flow Maps

    Gianluca Scarpellini, Ron Shprints, Peter Holderrieth, Juno Nam, Pranav Murugan, Rafael Gomez-Bombarelli, Tommi Jaakkola, Maruan Al-Shedivat, Nicholas Matthew Boffi, Joey Bose · PDF
  62. FORGE: Fragment-Oriented Ranking and Generation for Context-Aware Molecular Optimization

    Qingchuan Zhang, He CAO, Hao Li, Yanjun Shao, Zhiyuan Liu, Shihang Wang, Shufang Xie, Shenghua Gao, Xinwu Ye · PDF
  63. GDTR: Layer-wise Settling Depth Reveals Biological Grammar in Genomic Foundation Models

    Yoonjin Cho, Jiheon Kang, Subin Park, Sangwoo Kim · PDF
  64. GEMS: Molecular Structure Identification via Geodesic Navigation of the Isomer Manifold

    Utku Umur ACIKALIN, Yingheng Wang, Goncalo J. Gouveia, Tyler Schwertfeger, Frank C Schroeder, Carla P Gomes · PDF
  65. Gene-Embedding Perturbation Operators for Zero-Shot and Transferable Prediction of Transcriptional Responses

    Bryan Cheng, Austin Jin, Jasper Zhang · PDF
  66. Generalise or Memorise? Benchmarking Ligand-Conditioned Protein Generation

    Alex Vicente-Sola, Joan Coines, Lars J. Dornfeld, Noelia Ferruz · PDF
  67. Generalization of Protein Foundation Models for Engineered Fluorescent Biosensors

    Anirudh Palutla, Caroline Malin-Mayor, John N. Koberstein, Alison G. Tebo, Srinivas C Turaga · PDF
  68. Generating and decoding methylated DNA with a Human Epigenetic Foundation Model

    Pouya Niki, Christoforos Nalmpantis, Javkhlan-Ochir Ganbat, Donal Byrne, Pooja Kathail, Husam Babikir, Anjeet Jhutty, Andrey Karailiev, Francisco M Martín-Zamora, Luca Giacomoni, Timing Liu, Netanel Loyfer, Ivan Koychev, Hannah Madan, Jonathan C M Wan, Ravi Solanki · PDF
  69. Generative design of intrinsically disordered protein regions with IDiom

    Jason Liu, Sebastian Ibarraran, Frank Hu, Abigail Park, Alexander R. Dunn, Grant M. Rotskoff · PDF
  70. Generative Modeling of Solvated Biomolecules

    MinGyu Choi, Derek Chen, Gloria Ma, Tommi Jaakkola, Regina Barzilay · PDF
  71. Generative Priors for Cryo-EM Image Reconstruction

    Zain Shabeeb, Daniel Saeedi, Darin Tsui, Vida Jamali, Amirali Aghazadeh · PDF
  72. GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining

    Shaoheng Yan, Zian Li, Muhan Zhang · PDF
  73. GOAgent: Tool-Orchestrating Language Agents for Protein Function Annotation

    Manvitha Ponnapati, Brian Lynch, JOSEPH JACOBSON · PDF
  74. GPA: Generative Population Annealing for Test-Time Sequence Design

    Anirban Sarkar, Alejandra Duran, Peter K Koo · PDF
  75. Harmonic Torsional Diffusion for Protein-Ligand Flexible Docking

    Maksim Zhdanov, Pavel Strashnov, Vladislav Kurenkov · PDF
  76. Hepa-RAFT: Retrieval-Augmented Virtual Hepatocyte Responses for Hepatotoxicity Prediction

    CHOI YOUNG SEOK · PDF
  77. How Do Co-folding Models Organize Structural Information?

    Minha Park, Shinwoo Kim, Seokhyun Moon, Hyeongwoo Kim, Gyuho Jeon, Woo Youn Kim · PDF
  78. Hybrid Flow Matching in Billera-Holmes-Vogtmann Tree Space for Generative Phylogenetic Inference

    Yasha Ektefaie, Jiabo Cui, Shrey Jain, Marinka Zitnik, Pardis Sabeti · PDF
  79. Identification of Heterogeneous Erlotinib Response Gene Sets Using Sample-Specific Counterfactual Causal Attribution

    Hyekyoung Lee, Seungjin Choi, Hyunjin Shin · PDF
  80. Improving the Efficacy of Test-Time Steering in Masked Diffusion Models with Parallel Tempering

    Po-Yi Lu, Hsuan-Tien Lin, Shih-Hsin Wang · PDF
  81. IRIS: An Agentic Multi-Phase Framework for Automated Scientific Literature Review

    Sergey Kolchenko, Mahdi Zamanighomi, Amir Bayegan · PDF
  82. IsoPLM: Isolating the Impacts of Architecture on Protein Language Models

    Yash Semlani, Ruitong Li, Frederick Hoffman, Nauman Javed, Krithik Ramesh · PDF
  83. Just Add Structure: Protein Language Models Combined with Structural Equivariance Excel at Protein Tasks

    Qurat ul ain, Yee Whye Teh, Carlos Outeiral, Matteo Cagiada, Charlotte Deane · PDF
  84. Knowing When to Stop: Pertura for Graph-Enforced PI Gating in Perturb-seq Agents

    Tianyi Wang · PDF
  85. Large-scale sequence modeling of antibody-antigen binding specificity

    Fiona Qu, Sarah Gurev, Noor Youssef, Murphy Angelo, Debora Susan Marks · PDF
  86. Learning Clinical-Trial Strategy: Offline Policy Training for Decision Agents

    William James Bolton, Philip Torr · PDF
  87. LeFlur: A Biomolecular Design Model with Latent Structure Tokens

    Sidney L Lisanza, Karina Zadorozhny, Frederic A Dreyer, Kyunghyun Cho · PDF
  88. LLM-Assisted versus Agentic Approaches to De Novo Minibinder Design for a KRAS G12D Neoantigen

    Yilan Wang, Aaron W Kollasch · PDF
  89. LLM-guided acquisition improves pathway-specific Perturb-seq design under experimental budgets

    Malaika Aiyar, Kanglu Pei, Sisi Qu, Philip Torr, Christian Schroeder de Witt, William James Bolton, Jonathan G. Hedley · PDF
  90. MassSpecGym in the Wild: Uncovering and Correcting Evaluation Pitfalls in AI-Driven Molecule Discovery

    Hongxuan Liu, Roman Bushuiev, Ivy Lightheart, Mrunali Manjrekar, Anton Bushuiev, Magdalena Lederbauer, Filip Jozefov, Yinkai Wang, Soha Hassoun, Josef Sivic, James Taylor, Runzhong Wang, David Healey, Tomas Pluskal, Connor W. Coley · PDF
  91. Measure-to-measure Regression with Transformers

    Matthew Vandergrift, Martha White, Yury Polyanskiy, Philippe Rigollet, Lazar Atanackovic · PDF
  92. Mechanisms Matter: Transportability of Cellular Perturbation Effects

    Shi-ang Qi, Paidamoyo Chapfuwa · PDF
  93. MoCDiff: Efficient Motif-Constrained Discrete Diffusion for Molecule Generation

    Tasfia Nuzhat Ornee, Mohammad Jahid Ibna Basher, Siddhi Kanta Mishra, Ozlem Garibay, Ivan Garibay, Niloofar Yousefi · PDF
  94. MolOpt-Eval: Can Frontier LLMs Perform Structure-Based Hit-to-Lead Optimization?

    Chengzhu Li, Haichuan Tan, Wenyu Zhu, Bowen Gao, Jiqing Zheng, Ya-Qin Zhang, Wei-Ying Ma, Yanyan Lan · PDF
  95. MotifCraft: scalable functional protein binder design with AlphaFold2 hallucination

    Océane Follonier, Torsten Schwede, Janani Durairaj · PDF
  96. Multi-Scale Flow Matching for Continuous-time Generative Modeling of Spatiotemporal Tissue Dynamics from Spatial Transcriptomics

    Pinar Demetci, Timothy P Guan, Bo Xia, Lazar Atanackovic · PDF
  97. NanoFold: Designing Reproducible Protein Structure Benchmarks through Principled Sampling

    Chris Hayduk, Krithik Ramesh · PDF
  98. Natural-Language-Guided Generator-Agnostic Shortlisting for Protein Binder Design

    Gyubok Lee, Kiwoong Yoo, Jimin Seo, Kyunghoon Hur, Edward Choi · PDF
  99. Order-Agnostic Decoding for Sample-Efficient RNA Inverse Folding

    Antonia Panescu, Shujun He, Yixuan He, Rex Ying · PDF
  100. PACE: Geometry-Aware Bridge Transport for Single-Cell Trajectory Inference

    Chenglei Yu, Chuanrui Wang, Bangyan Liao, Tailin Wu · PDF
  101. PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling

    Xinyu Yuan, Xixian Liu, Ya Shi Zhang, Zuobai Zhang, Hongyu Guo, Jian Tang · PDF
  102. Phase-Calibrated Steering of Protein Diffusion Language Models

    Kun Hyung Roh, Vincent Yip, Sunghoon Rho · PDF
  103. Phenotype-Conditioned Drug Repurposing for Undiagnosed Rare Disease Patients via Graph Neural Networks and LLM Hybridization

    Beatrice Bihui Chen, Qiaohan Xu, Jiale Yang · PDF
  104. PIGEON: Pocket-Inferred Geometric Ensemble Flexible Docking

    Tong Chen, Maximilian Holsman, Pranam Chatterjee · PDF
  105. PlasmidLM: A Promptable DNA Language Model via Verifiable-Reward Post-Training

    McClain Thiel, Chris P Barnes · PDF
  106. pLM-Guided Inverse Folding for Antibody Sequence Design

    Valentin Noske, Felix Koulischer, Kathleen Marchal, Thomas Demeester · PDF
  107. Position: AI for Drug Discovery Models Often Do Not Learn as Expected and How to Diagnose These Failure Modes

    Nikhil Branson, Aaron Wenteler, Guy Durant, Charlotte Deane · PDF
  108. PRiMeFlow: capturing complex expression heterogeneity in perturbation response modelling

    Zichao Yan, Yan Wu, Mica Xu Ji, Chaitra Agrahar, Esther Wershof, Marcel Nassar, Mehrshad Sadria, Ridvan Eksi, Vladimir Trifonov, Ignacio L. Ibarra, Telmo Felgueira, Błażej Osiński, Rory Stark · PDF
  109. Probing coexistence of robust threshold and ultrasensitivity in molecular switches and cascades

    Rushmila Shehreen Khan, Md. Shahriar Karim · PDF
  110. Progressive Multi-Agent Reasoning for Biological Perturbation Prediction

    Hyomin Kim, Sang-Yeon Hwang, Jaechang Lim, Yinhua Piao, Yunhak Oh, Woo Youn Kim, Chanyoung Park, Sungsoo Ahn, Junhyeok Jeon · PDF
  111. PROPHET: Phylogenetically Robust Antiviral Peptide Design Against Heterogeneous Evolutionary Trajectories

    Kimberly Liang, Navya Nori, Elizabeth H Mahood, Yinuo Zhang, Pranam Chatterjee · PDF
  112. Proteo-R1: Reasoning Foundation Models for De Novo Antibody Design

    Fang Wu, Weihao Xuan, Heli Qi, Heng-Jui Chang, Hanqun Cao, Zeqi Zhou, Haokai Zhao, Jian Ma, Zijian Carl Ma, Heng-Jui Chang, Xiangru Tang, Zehong Wang, Kuan Pang, Hanchen, Kejun Ying, Chiho Im, Yinxi Li, Tinson Xu, Deyao Zhu, Peng Xia, Seungju Han, Pan Lu, Guanlue Li, Pheng-Ann Heng, Naoto Yokoya, Masashi Sugiyama, Li Erran Li, Jure Leskovec, Yejin Choi · PDF
  113. ProteomeLM: A Proteome-Scale Language Model Enables Accurate and Rapid Prediction of Protein-Protein Interactions and Gene Essentiality Across Taxa

    Cyril Malbranke, Gionata Paolo Zalaffi, Anne-Florence Bitbol · PDF
  114. Proteomic Divergence in the Trisomic Mouse Cortex: Machine Learning Identifies Tau, APP, and ADARB1 as Key Genotype Signatures and Reveals Limited Proteomic Response to Memantine

    Calvin H. Cho · PDF
  115. ProtoCol: Late Interaction Retrieval for Protein Homolog Search

    Gabrielle Cohn, Rohan Gumaste, Minh Hoang, Vihan Lakshman · PDF
  116. ProtQueSt: Query-Conditioned Retrieval-Augmented Generation for Protein Function Annotation

    Linrui Ma, Yiwei Liang, Yishu Yu, Chuhan Joyce Qi · PDF
  117. Pushing Biomolecular Utility-Diversity Frontiers with Supergroup Relative Policy Optimization

    Xinwu Ye, He CAO, Hao Li, Bin Feng, Zijing Liu, Xiangru Tang, Yu Li, Shenghua Gao · PDF
  118. Representative vs. Load-bearing Layers: A Dissociation in Genomic Foundation Models

    Yoonjin Cho, Min Seok Kim, Sangwoo Kim · PDF
  119. Rethinking Diffusion Models with Symmetries through Canonicalization with Applications to Molecular Graph Generation

    Cai Zhou, Zijie Chen, Zian Li, Jike Wang, Kaiyi Jiang, Pan Li, Rose Yu, Muhan Zhang, Stephen Bates, Tommi Jaakkola · PDF
  120. Rethinking Self-Consistency in Protein Generative Models

    Minji Lee, Jaeyeon Kim, Yeqing Lin, Mohammed AlQuraishi · PDF
  121. RobustDock: Robust Generative Flexible Docking with Long-Tailed Data

    Wenyin Zhou, Erik Englesson, Mert Can Kurucu, Hossein Azizpour · PDF
  122. Scaling Pocket Docking with Data Augmentation and Heterogeneous Equivariant Graph Attention

    Huyen Nguyen, Tuan Le, Kim-Loc Nguyen, Yuxing Peng, Emine Kucukbenli, Mai Thi Hien, Steven Truong, Van Ha Tang · PDF
  123. Self-Distillation for Continual Learning in Masked Language Models

    Ali Saadat, Jacques Fellay · PDF
  124. Self-Supervised Contextual Representation Learning for Transcriptomic Generative AI

    SeyedMohsen Hosseini, Divya Sharma · PDF
  125. Shifting a Molecular Generator Toward Developability with Iterative Importance Fine-Tuning

    Alex Berlaga, Andrew Ferguson · PDF
  126. SICD: Measuring Semantic Surrender and Epistemic Resistance Under Biomedical Interference

    Jacob Dang, Patrick Mazza, Shouraya Pendgaonkar · PDF
  127. Simplified motif background model provides significant speed-up for regulatory activity inference

    Asta Mannstaedt Rasmussen, Jakob Skou Pedersen · PDF
  128. Site4Drug: Predicting Drug-Binding Target Sites with an AI Agent

    Taehan Kim, Sarrah Mikhail Leung, Bharat Mekala, Jeongbin Park · PDF
  129. SMDD-Bench: Can LLMs Solve Real-World Small Molecule Drug Design Tasks?

    Kevin Han, Renfei Zhang, Kathy Y Wei, Hamed Mahdavi, Niloofar Mireshghallah, Amir Barati Farimani · PDF
  130. SOAPIA: Specificity-Guided Generation of Off-Target-Avoiding Protein Interactions with High Target Affinity

    Sophia Vincoff, Pranam Chatterjee · PDF
  131. Spectral Diffusion for Protein Dynamics

    Hew Phipps, Matteo Cagiada, Santiago David Villalba, Charlotte Deane · PDF
  132. SPROUT: Steered Plant Promoter Editing via Rollout-Guided Utility Tilting of Edit Flows

    Elizabeth H Mahood, Pranam Chatterjee · PDF
  133. ST-JEPA: Joint-Embedding Predictive Architecture for Spatial Transcriptomics

    Sebastian Birk, Amirhossein Vahidi, Mohammad Vali Sanian, Arpit Merchant, Mohammad Lotfollahi · PDF
  134. Steering Sequence Generation in Protein Language Models through Iterative Lookback Monte Carlo Sampling

    Francesco Calvanese, Gianluca Lombardi, Martin Weigt, Jorge FERNANDEZ-DE-COSSIO-DIAZ · PDF
  135. Stochastic Path Integral Formalism of Causal Field Theory

    Stefan Groha, Ching-Hao Wang, Bernhard Schölkopf, Arash Mehrjou · PDF
  136. Structure-Guided Reinforcement Learning for High-Affinity Antibody Design

    Hanqun Cao, Shuaike Shen, Weihao Xuan, Jian Ma, Pheng-Ann Heng, Fang Wu · PDF
  137. SurfDesign: Effective Protein Design on Molecular Surfaces

    Fang Wu, Shuting Jin, Xiangru Tang, Mark Gerstein, Xiangxiang Zeng, Yejin Choi, Jure Leskovec, Jinbo Xu · PDF
  138. Synthesis Tamper-evident Attestation and Molecular Provenance (STAMP): Cryptographic Molecular Barcoding for DNA Synthesizers

    Nicole Lai-Lopez, Ethan Lai · PDF
  139. Synthon Contrastive Learning for Synthesizable 3D Molecule Generation

    Nahyun Kim, Seul Lee, Sung Ju Hwang · PDF
  140. SynthonBench: Benchmarking Sample-Efficient Optimization in Combinatorial Chemical Spaces

    Miroslav Lžičař · PDF
  141. The Hallucination Dependence Index: A Cross-Condition Diagnostic for Clinical-LLM Faithfulness

    Ishan Gonehal, Hanson Wen, Bowman Novey · PDF
  142. Token-Only Adaptation of Frozen Self-Supervised Vision Foundation Models for Cross-Species Animal Pose: A Pareto-Frontier Characterization Across Eight Held-Out Mammal Species

    Ethan Y Wang, Aayan Alwani · PDF
  143. Token-Wise Residual Latent Adapters: Steering Seq2Seq Models for Protein Fitness Extrapolation

    Steven Wu, Mostafa Karimi, Sharmi Banerjee, Peng Gao, Jonah Noh, Jiachen Li, Robert Jiang, Bella Dubrov, Shang Shang, Hao Song · PDF
  144. ToolMol: Evolutionary Agentic Framework for Multi-objective Drug Discovery

    Andrew Y. Zhou, Sharvaree Vadgama, Sumanth Varambally, Peter Eckmann, Michael K Gilson, Rose Yu · PDF
  145. Towards an Agentic AI Framework for Generating, Optimizing and Filtering Protein Binders

    Mohammad Amaan Sayeed, Boulbaba Ben Amor · PDF
  146. Towards Autonomous Mechanistic Reasoning in Virtual Cells

    Yunhui Jang, Lu Zhu, Jake Fawkes, Alisandra Kaye Denton, Dominique Beaini, Emmanuel Noutahi · PDF
  147. Toxin Feature Hierarchy in ESM-2: Mechanistic Interpretability reveals Why Frozen Probes Resist ProteinMPNN Redesign

    Manan Wadhwa, Shivam Dubey · PDF
  148. Two-Stage Fine-Tuning for Protein Sequence Generation with Targeted Amino-Acid Composition

    Violeta Basten Romero, VICTOR GUALLAR, Isaac Filella-Merce, Rubén Muñoz-Tafalla, Anna M. Diaz-Rovira, Bertran Miquel-Oliver · PDF
  149. Uncertainty-Aware Oracle-Concordance Steering for Reliable Generative Design

    Wenhui Sophia Lu, Xiaowei Zhang, Xiaojing J Gao, Dominik Rothenhaeusler, Wing Hung Wong · PDF
  150. Unified sampling framework and benchmarking of sequence- and structure-based protein models

    Aviv Spinner, Pascal Notin, Samuel P Berry, Dana Cortade, Zach Sisson, Svetlana P Ikonomova, David Ross, Debora Susan Marks · PDF
  151. VarLitBench and VarLitAgent for Benchmarking and Automating LLM-Assisted Functional Evidence Curation in Genomic Variant Interpretation

    Ali Saadat, Jacques Fellay · PDF
  152. What Does a Chromatin Foundation Model Know About a Petri Dish? Sparse Autoencoders Reveal In Vitro vs. In Vivo Context in EPIBERT

    Nicole Ching, Ayushi Mehrotra · PDF
  153. Where Simple Baselines Fail: Mapping the Modeling Frontier of Perturbation Prediction

    Anna Kalygina, Alexander Theus, Marina Esteban-Medina, Valentina Boeva · PDF