ICML 2026 Past Large language modelsAI for scienceMultimodal

ICML 2026 3rd Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences

ICML 2026 FM4LS Workshop

Submission deadline
May 11, 2026, 16:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (83)

Fetched from OpenReview (v2) on 2026-06-10.

  1. AgriPerceiver: A Parameter-Efficient Vision-Language Model for Structured Macroscopic Crop Phenotyping

    Vatsal Khanna, Davinder Singh
  2. AMR-Bench-mini: A Diagnostic Benchmark for Agentic Mechanistic AMR Reasoning under Evidence Insufficiency

    Ali Ihsan Nergiz, Erva Nur Cinar
  3. ARO: Aligned Representation learning for multi-Omics data

    Amogh Singh, Yash Shah, Chiara D'Ercoli, Arash Mehrjou, Patrick Schwab, Timothy M. Jones, Pietro Lio
  4. Auditing Generative Graph Foundation Models for Connectomics: A Score × Predictor × Sampler Decomposition on Real Sparse Directed Connectomes

    Ethan Y Wang, Aayan Alwani
  5. Beyond Nativeness: Viral Proteins in Protein Language Models

    Arthur Bigot, Harmon Bhasin, Core Francisco Park, Eugene Shakhnovich, Dianzhuo Wang · PDF
  6. Biophysics-Guided Genomic Foundation Models via Attention Gating

    Bryan Cheng
  7. Bolek: A Multimodal Language Model for Molecular Reasoning

    Frederic Grabowski, Jacek Szczerbiński, Maciej Jaśkowski, Kalina Jasinska-Kobus, Pawel Dabrowski-Tumanski, Tomasz Jetka, Bartosz Topolski
  8. Causal-IQD-DTA: Counterfactual Interaction-Quality Disentanglement for Robust Drug–Target Affinity Prediction

    khan huang, Chang Li
  9. Cell Painting Generates Single-Cell Transcriptomics via Conditional Diffusion

    Reed Naidoo, Jingyu Hu, Giuseppe Tripodi, Chris Bakal, Tapabrata Chakraborti
  10. ConfPert: Distribution-Free Conformal Coverage for Single-Cell Perturbation Predictors

    Aayan Alwani, Ethan Y Wang
  11. Contrastive Learning for Gene Set Enrichment Analysis Post-Processing

    Leonardo P.A. Biral, Sandeep Dave · PDF
  12. Cross-modal transfer learning for mapping bulk transcriptomes at cellular level

    Aarthi Venkat, Zhiwen Jiang, Daniel Marbach, Nir Hacohen, Marinka Zitnik · PDF
  13. DELBERT-2: Pretrained Fingerprint Language Models for DEL Protein Binder Prediction

    Bing Xu Hu, Sun Sun, Shaik salman basha, Anita Layton, Helen Hong Chen
  14. Detecting Sparse Colorectal Cancer Signals from Multi-Modal Cell-Free DNA Representations Using Modern Hopfield Attention

    Michael Widrich, Anooj Patel, Elisabeth Rumetshofer, Peter Ulz, Kaitlyn Coil, Thomas Royce, Cheng-Ho Jimmy Lin, Richard Bourgon, Anindita Dutta
  15. Do Clinical VLMs Need Dense Visual Tokens? Probing Spatial Grounding in Radiology Report Generation

    Leopoldo Julian Lechuga Lopez, Tim G. J. Rudner, Farah E. Shamout
  16. DrugAgent: Reliable Multi-Agent Aggregation under Conflicting Biomedical Evidence

    Yoshitaka Inoue, Tianci Song, Xinling Wang, Augustin Luna, Tianfan Fu · PDF
  17. E1: Retrieval-Augmented Protein Encoder Models

    Sarthak Jain, Joel Beazer, Jeffrey A. Ruffolo, Aadyot Bhatnagar, Ali Madani · PDF
  18. Empirical Observations on Parameter Scaling in Chemical Language Models

    Artur Safrastyan, Hrant Khachatrian · PDF
  19. ERVNet: A Three-Module Framework for Predicting Endogenous Retrovirus Reactivation, Gene Propagation, and Immunogenicity

    Bryan Cheng, Austin Jin, Jasper Zhang, Brendan Lo, Arnav Pemmaraju, Joshua Chang
  20. esm-bind: How much protein–RNA binding signal is already in frozen ESM-2 + RNA-FM representations?

    Ethan B Arnold
  21. Exploring Set-Aggregated Genome Embeddings for Microbiome Abundance Prediction

    Younhun Kim, Georg K. Gerber, Travis E Gibson · PDF
  22. Few-Shot Biomedical Image Classification by Alignment of Independently Pretrained Encoders

    Daniel Shalam, Simon Korman
  23. Gene-Embedding Perturbation Operators for Zero-Shot and Transferable Prediction of Transcriptional Responses

    Bryan Cheng, Austin Jin, Jasper Zhang
  24. Generalization of Protein Foundation Models for Engineered Fluorescent Biosensors

    Anirudh Palutla, Caroline Malin-Mayor, John N. Koberstein, Alison G. Tebo, Srinivas C Turaga
  25. GeneZip: Region-Aware Compression for Long Context DNA Modeling

    Jianan Zhao, Xixian Liu, Zhihao Zhan, Xinyu Yuan, Hongyu Guo, Jian Tang
  26. GFETM: Genome Foundation-based Embedded Topic Model for scATAC-seq Modeling

    Yimin Fan, Yu Li, Jun Ding, Yue Li
  27. Group Contrastive Learning for Weakly Paired Multimodal Data

    Aditya Gorla, Hugues Van Assel, Jan-Christian Huetter, Heming Yao, Kyunghyun Cho, Aviv Regev, Russell Littman
  28. HealthBot: An Open-Source AI Assistant for Longitudinal Personal Health Management

    Zheng Yu, Yanyuan Qiao, Qi Wu, Yutong Xie · PDF
  29. HistoTx: Early fusion of H&E images and spatial transcriptomics at varying spatial transcriptomics resolution with self-supervised learning

    Alice Driessen, Steffen Wolf, Emil Malmsten, Bernd Illing, Isinsu Katircioglu, Adriano Martinelli, Raphael Gottardo, Marianna Rapsomaniki, Fei Tang
  30. How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images

    Guimeng Liu, Tianze Yu, Somayeh Ebrahimkhani, Lin Zhi Zheng Shawn, Kok Pin Ng, Ngai-Man Cheung
  31. ImmunoFoundation: A Multimodal Foundation Model for Immunogenicity Prediction and Peptide Optimization

    João Felipe Rocha, Hiren Madhu, Jenny Yongjia Liu, Apurva Mishra, Chen Liu, Rishabh Anand, Rex Ying, Smita Krishnaswamy · PDF
  32. Is PEFT Enough for Cell Segmentation? An Empirical No-Go Result on Frozen Foundation Models

    Joonsuk Hwang, Hyowon Woo, Hyobin Seok, Jangwon Seo, Jeong Hoon Lee
  33. Learning Protein Fitness Landscapes with Multimodal Stability Priors

    Shannon SQ Zhang, Yunan Luo · PDF
  34. Lightweight Alignment of Unimodal Foundation Models for Metabolite Identification

    Paul Krzakala, Gabriel Melo, Camille Lançon, Charlotte Laclau, Rémi Flamary, Etienne A. Thévenot, Florence d'Alché-Buc · PDF
  35. LLMs Can Learn the Language of the Microbiome

    Neythen J Treloar, Jenny Yang, Saif Ur-Rehman · PDF
  36. Local-Atlas Control-Anchored Flow Matching for Unpaired Single-Cell Perturbation Prediction

    khan huang, Chang Li
  37. Marking the Wrong Symptoms: Evaluating LLM Watermarks in Medical Texts

    Melanie Rieff, Robin Staab, Thibaud Gloaguen, Stefan Hegselmann, Martin Vechev
  38. Mechanistic Synergy in Multi-Modal VEP: DNA Context Complements PLMs under Biophysical Constraints

    Yoojin Kim, Doyeon Ha · PDF
  39. Medmarks: An Open-Source LLM Benchmark Suite for Medical Tasks

    Benjamin Warner, Ratna Sagari Grandhi, Max Kieffer, Aymane Ouraq, Saurav Panigrahi, Geetu Ambwani, Kunal Bagga, Nikhil Khandekar, Arya Hariharan, Nishant Mishra, Manish Ram, Shamus Sim Zi Yang, Ahmed Essouaied, Adepoju Jeremiah Moyondafoluwa, Robert Scholz, Bofeng Huang, Molly Beavers, Srishti Gureja, Anish Mahishi, Sameed Khan, Maxime Griot, Hunar Batra, Jean-Benoit Delbrouck, Siddhant Bharadwaj, Ronald Clark, Ashish Vashist, Anas Zafar, Leema Krishna Murali, Harsh Deshpande, Ameen Patel, William Brown, Johannes Hagemann, Connor Lane, Paul Steven Scotti, Tanishq Mathew Abraham
  40. MESH-HR: Multimodal Fusion of Somatic DNA Profiles and Histopathology for Continuous Breast Cancer Receptor Subtyping via LLM-Assisted Annotation

    Shaye Carver, Kodi Taraszka, Intae Moon, Alexander Gusev
  41. Mode-Aware Phenotype Profiling from Korean Clinical Reports: An LLM-Derived Two-Layer Fingerprint for Autism Characterization

    Yeojin Ryu, Miae Oh, Joon-Yong An
  42. MolEmb: Multimodal Large Language Models Can Be Strong Molecular Embedding Models

    Xinjian Zhao, Xiangru Jian, Yaoyao Xu, Xiaozhuang Song, Wei Pang, LEI BAI, Tianshu Yu
  43. Neuro-Anatomy–Informed Self-Supervised Learning for Structural Brain MRI

    Sean Chen Jiale, Tianze Yu, Somayeh Ebrahimkhani, Lin Zhi Zheng Shawn, Kok Pin Ng, Ngai-Man Cheung
  44. OmicsDefense: The First Unified Framework for Defending Against Backdoor Attacks in Single-cell Foundation Models

    Bioclaw Team, Peiting Shi
  45. OmicsLM: A Multimodal Large Language Model for Multi-Sample Omics Reasoning

    Maciej Sypetkowski, Joanna Krawczyk, Łukasz Smoliński, Remigiusz Kinas, Przemysław Pietrzak, Tomasz Jetka, Rafal Powalski
  46. PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder

    Yancheng Liu, Kenichi Maeda, Manan Pancholy
  47. PaSTel: Anchoring Histology in Spatial Transcriptomics via Multi-Scale Hierarchical Bio-Prior Contrastive Pretraining

    Azim Dehghani Amirabad, Junchao Zhu, Pushpak Pati, Walid Abdelmoula, Tommaso Mansi, Rui Liao
  48. PertReasonQA: A Knowledge-Grounded Benchmark and Framework for Cell-State–Conditioned Mechanistic Reasoning of Perturbation Effects

    Dongkwan Kim, Yiming Gao, Yining Yang, Yang Shen
  49. PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling

    Xinyu Yuan, Xixian Liu, Ya Shi Zhang, Zuobai Zhang, Hongyu Guo, Jian Tang
  50. Position: AI for Drug Discovery Models Often Do Not Learn as Expected and How to Diagnose These Failure Modes

    Nikhil Branson, Aaron Wenteler, Guy Durant, Charlotte Deane
  51. Position: Multi-Modal LLMs for Video Behavioral Coding in High-Stakes Decision-Making Are Bounded by Polysemy, Not by Model Scale

    Zezheng Lin, Jinhao Gan
  52. Position: Multi-Omics Foundation Models Need a Modality Identifiability Standard, Not Just Aggregate Accuracy

    Zezheng Lin, Fengming Liu
  53. Position: Saturation in Single-Cell Foundation Model Benchmarks Signals Identifiability Failure, Not Solved Capability

    Zezheng Lin, Fengming Liu
  54. Position: Saturation in Single-Cell Foundation Model Benchmarks Signals Identifiability Failure, Not Solved Capability

    Zezheng Lin, Fengming Liu
  55. Pre-training on noncovalent interactions from synthetic protein-ligand structures to better predict binding affinity

    Arun Raja, Garrett A. Stevenson, Garrett M Morris, Jonathan E Allen, Hyojin Kim
  56. Predicting host-pathogen interactions using a proteome-scale language model

    Cyril Malbranke, Cecilia Fruet, Anne-Florence Bitbol
  57. PRIMA: a bidirectional state-space architecture and training approach for sequence modelling of protein-protein interactions

    Arturo Fiorellini-Bernardis, Sebastien Boyer, Jean QUENTIN, Ghassene Jebali, Oliver Bent · PDF
  58. Probing, Fusion, and Trustworthiness: A Systematic Evaluation of Foundation Model Representations for Multimodal Cancer Analysis

    Jingyu Hu, Giuseppe Tripodi, Reed Naidoo, Sarah F McGough, Tapabrata Chakraborti
  59. ProSAM: Modular and Energy-Guided Fine-Tuning of Protein Language Models for Structure Prediction

    Yiyang Zheng, Zhiguo Tao
  60. ProteinJEPA: Latent prediction complements protein language models

    Dan Ofer, Dafna Shahaf, Michal Linial
  61. ProteomeLM: A Proteome-Scale Language Model Enables Accurate and Rapid Prediction of Protein-Protein Interactions and Gene Essentiality Across Taxa

    Cyril Malbranke, Gionata Paolo Zalaffi, Anne-Florence Bitbol
  62. PROTEUS: Predicting How Post-Translational Modifications Alter Drug Binding Affinity

    Bryan Cheng, Austin Jin, Jasper Zhang, Arnav Pemmaraju, Joshua Chang, Brendan Lo
  63. ProtQueSt: Query-Conditioned Retrieval-Augmented Generation for Protein Function Annotation

    Linrui Ma, Yiwei Liang, Yishu Yu, Chuhan Joyce Qi · PDF
  64. ProtSent: Protein Sentence Transformers

    Dan Ofer, Oriel Perets, Michal Linial, Nadav Rappoport
  65. Retrieval-Augmented Foundation Model Enhances Risk Prediction Using Electronic Health Records

    Saeed Shurrab, Mariam Al-Omari, Dana El Samad, Farah E. Shamout
  66. SaNano - Structure Aware Transfer Learning For Data Limited Protein Modality

    Hugo Frelin, Andrei Kamenski, Paolo Marcatili · PDF
  67. Search, Edit, and Fold: LLM-Guided MSA Optimization for Protein Conformation Prediction

    Yu Pei, Jiangtao Feng, Hao Wang, Dongyu Xue, Keyue Qiu, Hao Zhou, Wei-Ying Ma
  68. Selective Benefits of Sequence-Drug Multimodal Learning for Antimicrobial Resistance Prediction

    Mateusz A. Wlodarski
  69. SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome

    Dabin Jeong, Amirhossein Vahidi, Ciro Ramírez-Suástegui, Marie Moullet, Kevin Ly, Mohammad Vali Sanian, Sebastian Birk, Yinshui Chang, Adam Boxall, Daniyal Jafree, Lloyd Steele, Vijaya Baskar MS, Muzlifah Haniffa, Mohammad Lotfollahi
  70. Single-Cell Cross-Modal Transfer by Adversarial Fine-Tuning of Foundation Models

    Joseph Boyd, Matthew Lyon, Martino Mansoldo, Christian John Hurry, Finnian Firth · PDF
  71. Structural Bottleneck Reasoning: Efficient Medical VQA via Concept Alignment

    Hoang Nguyen, Tran Anh Dang Le, Ta Duc Huy, Nguyen Bui Ngoc Han, Vy Tuong Dang, Duy Duong-Tran, Khoa D Doan, Anji Liu, Li Shen, Pengtao Xie, Mathias Niepert, Duy Minh Ho Nguyen, James Zou, Daniel Sonntag
  72. Survival-Relevant Directional Pathology–Omics Discordance from Frozen Whole-Slide Foundation Embeddings

    KuanTing Wu
  73. Synergy-Aware Contrastive Pretraining for Co-recorded Physiological Signals

    Lei Chen, Junetae Kim · PDF
  74. The Hallucination Dependence Index: A Cross-Condition Diagnostic for Clinical-LLM Faithfulness

    Ishan Gonehal, Hanson Wen, Bowman Novey
  75. Transcriptomics-Conditioned Virtual Tissue Synthesis via Diffusion Transformers

    Pantelis Vlachas, Kalin Nonchev, Viktor Koelzer, Gunnar Ratsch · PDF
  76. Transferable Lesion-Supervised Speech Representations for Post-Stroke Modelling

    Giulia Sanguedolce, Niamh Parkinson, Fatemeh Geranmayeh, Patrick Naylor
  77. TriFit: Trimodal Fusion with Protein Dynamics for Mutation Fitness Prediction

    Seungik Cho · PDF
  78. Uncovering evolutionarily remote and highly potent antimicrobial peptides with protein language models

    Qinze Yu, Yu Li
  79. VFUSE: Virulent Feature Understanding with Sparse autoEncoders

    Michael Yu, Matthew Lyle Olson
  80. Virtual Cell Models Inflate Perturbation Effect Sizes and Undermine Causal Gene Regulatory Network Recovery

    Aayan Alwani, Ethan Y Wang
  81. VirtualCeLLM: A Comprehensive Benchmark and Guidance for Large Language Models in Cellular Biology

    Jingbo Zhou, Zhongqi Wang, Zhen Lei, Stan Z. Li, Chang Yu
  82. WACA-DTA: Water-Aware Geometric Biases for Structure-Conditioned Drug-Target Affinity Prediction

    khan huang, Chang Li
  83. What Makes a Virtual Cell a World Model? Three Gaps, Three Axes, and a Roadmap

    Chang Yu, Jingbo Zhou, Zhongqi Wang, Zhen Lei, Stan Z. Li