ICML 2026PastLarge language modelsAI for scienceMultimodal

ICML 2026 3rd Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences

ICML 2026 FM4LS Workshop

Official website ↗OpenReview venue ↗See all ICML workshops →✎ Edit this entry

Submission deadline: May 11, 2026, 16:59 UTC
OpenReview-synced 2026-05-11 16:59 UTC (as of 2026-06-23) — extensions on OpenReview are applied automatically; verify on the website.
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (83)

Fetched from OpenReview (v2) on 2026-06-10.

AgriPerceiver: A Parameter-Efficient Vision-Language Model for Structured Macroscopic Crop Phenotyping
Vatsal Khanna, Davinder Singh
AMR-Bench-mini: A Diagnostic Benchmark for Agentic Mechanistic AMR Reasoning under Evidence Insufficiency
Ali Ihsan Nergiz, Erva Nur Cinar
ARO: Aligned Representation learning for multi-Omics data
Amogh Singh, Yash Shah, Chiara D'Ercoli, Arash Mehrjou, Patrick Schwab, Timothy M. Jones, Pietro Lio
Auditing Generative Graph Foundation Models for Connectomics: A Score × Predictor × Sampler Decomposition on Real Sparse Directed Connectomes
Ethan Y Wang, Aayan Alwani
Beyond Nativeness: Viral Proteins in Protein Language Models
Arthur Bigot, Harmon Bhasin, Core Francisco Park, Eugene Shakhnovich, Dianzhuo Wang · PDF
Biophysics-Guided Genomic Foundation Models via Attention Gating
Bryan Cheng
Bolek: A Multimodal Language Model for Molecular Reasoning
Frederic Grabowski, Jacek Szczerbiński, Maciej Jaśkowski, Kalina Jasinska-Kobus, Pawel Dabrowski-Tumanski, Tomasz Jetka, Bartosz Topolski
Causal-IQD-DTA: Counterfactual Interaction-Quality Disentanglement for Robust Drug–Target Affinity Prediction
khan huang, Chang Li
Cell Painting Generates Single-Cell Transcriptomics via Conditional Diffusion
Reed Naidoo, Jingyu Hu, Giuseppe Tripodi, Chris Bakal, Tapabrata Chakraborti
ConfPert: Distribution-Free Conformal Coverage for Single-Cell Perturbation Predictors
Aayan Alwani, Ethan Y Wang
Contrastive Learning for Gene Set Enrichment Analysis Post-Processing
Leonardo P.A. Biral, Sandeep Dave · PDF
Cross-modal transfer learning for mapping bulk transcriptomes at cellular level
Aarthi Venkat, Zhiwen Jiang, Daniel Marbach, Nir Hacohen, Marinka Zitnik · PDF
DELBERT-2: Pretrained Fingerprint Language Models for DEL Protein Binder Prediction
Bing Xu Hu, Sun Sun, Shaik salman basha, Anita Layton, Helen Hong Chen
Detecting Sparse Colorectal Cancer Signals from Multi-Modal Cell-Free DNA Representations Using Modern Hopfield Attention
Michael Widrich, Anooj Patel, Elisabeth Rumetshofer, Peter Ulz, Kaitlyn Coil, Thomas Royce, Cheng-Ho Jimmy Lin, Richard Bourgon, Anindita Dutta
Do Clinical VLMs Need Dense Visual Tokens? Probing Spatial Grounding in Radiology Report Generation
Leopoldo Julian Lechuga Lopez, Tim G. J. Rudner, Farah E. Shamout
DrugAgent: Reliable Multi-Agent Aggregation under Conflicting Biomedical Evidence
Yoshitaka Inoue, Tianci Song, Xinling Wang, Augustin Luna, Tianfan Fu · PDF
E1: Retrieval-Augmented Protein Encoder Models
Sarthak Jain, Joel Beazer, Jeffrey A. Ruffolo, Aadyot Bhatnagar, Ali Madani · PDF
Empirical Observations on Parameter Scaling in Chemical Language Models
Artur Safrastyan, Hrant Khachatrian · PDF
ERVNet: A Three-Module Framework for Predicting Endogenous Retrovirus Reactivation, Gene Propagation, and Immunogenicity
Bryan Cheng, Austin Jin, Jasper Zhang, Brendan Lo, Arnav Pemmaraju, Joshua Chang
esm-bind: How much protein–RNA binding signal is already in frozen ESM-2 + RNA-FM representations?
Ethan B Arnold
Exploring Set-Aggregated Genome Embeddings for Microbiome Abundance Prediction
Younhun Kim, Georg K. Gerber, Travis E Gibson · PDF
Few-Shot Biomedical Image Classification by Alignment of Independently Pretrained Encoders
Daniel Shalam, Simon Korman
Gene-Embedding Perturbation Operators for Zero-Shot and Transferable Prediction of Transcriptional Responses
Bryan Cheng, Austin Jin, Jasper Zhang
Generalization of Protein Foundation Models for Engineered Fluorescent Biosensors
Anirudh Palutla, Caroline Malin-Mayor, John N. Koberstein, Alison G. Tebo, Srinivas C Turaga
GeneZip: Region-Aware Compression for Long Context DNA Modeling
Jianan Zhao, Xixian Liu, Zhihao Zhan, Xinyu Yuan, Hongyu Guo, Jian Tang
GFETM: Genome Foundation-based Embedded Topic Model for scATAC-seq Modeling
Yimin Fan, Yu Li, Jun Ding, Yue Li
Group Contrastive Learning for Weakly Paired Multimodal Data
Aditya Gorla, Hugues Van Assel, Jan-Christian Huetter, Heming Yao, Kyunghyun Cho, Aviv Regev, Russell Littman
HealthBot: An Open-Source AI Assistant for Longitudinal Personal Health Management
Zheng Yu, Yanyuan Qiao, Qi Wu, Yutong Xie · PDF
HistoTx: Early fusion of H&E images and spatial transcriptomics at varying spatial transcriptomics resolution with self-supervised learning
Alice Driessen, Steffen Wolf, Emil Malmsten, Bernd Illing, Isinsu Katircioglu, Adriano Martinelli, Raphael Gottardo, Marianna Rapsomaniki, Fei Tang
How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images
Guimeng Liu, Tianze Yu, Somayeh Ebrahimkhani, Lin Zhi Zheng Shawn, Kok Pin Ng, Ngai-Man Cheung
ImmunoFoundation: A Multimodal Foundation Model for Immunogenicity Prediction and Peptide Optimization
João Felipe Rocha, Hiren Madhu, Jenny Yongjia Liu, Apurva Mishra, Chen Liu, Rishabh Anand, Rex Ying, Smita Krishnaswamy · PDF
Is PEFT Enough for Cell Segmentation? An Empirical No-Go Result on Frozen Foundation Models
Joonsuk Hwang, Hyowon Woo, Hyobin Seok, Jangwon Seo, Jeong Hoon Lee
Learning Protein Fitness Landscapes with Multimodal Stability Priors
Shannon SQ Zhang, Yunan Luo · PDF
Lightweight Alignment of Unimodal Foundation Models for Metabolite Identification
Paul Krzakala, Gabriel Melo, Camille Lançon, Charlotte Laclau, Rémi Flamary, Etienne A. Thévenot, Florence d'Alché-Buc · PDF
LLMs Can Learn the Language of the Microbiome
Neythen J Treloar, Jenny Yang, Saif Ur-Rehman · PDF
Local-Atlas Control-Anchored Flow Matching for Unpaired Single-Cell Perturbation Prediction
khan huang, Chang Li
Marking the Wrong Symptoms: Evaluating LLM Watermarks in Medical Texts
Melanie Rieff, Robin Staab, Thibaud Gloaguen, Stefan Hegselmann, Martin Vechev
Mechanistic Synergy in Multi-Modal VEP: DNA Context Complements PLMs under Biophysical Constraints
Yoojin Kim, Doyeon Ha · PDF
Medmarks: An Open-Source LLM Benchmark Suite for Medical Tasks
Benjamin Warner, Ratna Sagari Grandhi, Max Kieffer, Aymane Ouraq, Saurav Panigrahi, Geetu Ambwani, Kunal Bagga, Nikhil Khandekar, Arya Hariharan, Nishant Mishra, Manish Ram, Shamus Sim Zi Yang, Ahmed Essouaied, Adepoju Jeremiah Moyondafoluwa, Robert Scholz, Bofeng Huang, Molly Beavers, Srishti Gureja, Anish Mahishi, Sameed Khan, Maxime Griot, Hunar Batra, Jean-Benoit Delbrouck, Siddhant Bharadwaj, Ronald Clark, Ashish Vashist, Anas Zafar, Leema Krishna Murali, Harsh Deshpande, Ameen Patel, William Brown, Johannes Hagemann, Connor Lane, Paul Steven Scotti, Tanishq Mathew Abraham
MESH-HR: Multimodal Fusion of Somatic DNA Profiles and Histopathology for Continuous Breast Cancer Receptor Subtyping via LLM-Assisted Annotation
Shaye Carver, Kodi Taraszka, Intae Moon, Alexander Gusev
Mode-Aware Phenotype Profiling from Korean Clinical Reports: An LLM-Derived Two-Layer Fingerprint for Autism Characterization
Yeojin Ryu, Miae Oh, Joon-Yong An
MolEmb: Multimodal Large Language Models Can Be Strong Molecular Embedding Models
Xinjian Zhao, Xiangru Jian, Yaoyao Xu, Xiaozhuang Song, Wei Pang, LEI BAI, Tianshu Yu
Neuro-Anatomy–Informed Self-Supervised Learning for Structural Brain MRI
Sean Chen Jiale, Tianze Yu, Somayeh Ebrahimkhani, Lin Zhi Zheng Shawn, Kok Pin Ng, Ngai-Man Cheung
OmicsDefense: The First Unified Framework for Defending Against Backdoor Attacks in Single-cell Foundation Models
Bioclaw Team, Peiting Shi
OmicsLM: A Multimodal Large Language Model for Multi-Sample Omics Reasoning
Maciej Sypetkowski, Joanna Krawczyk, Łukasz Smoliński, Remigiusz Kinas, Przemysław Pietrzak, Tomasz Jetka, Rafal Powalski
PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder
Yancheng Liu, Kenichi Maeda, Manan Pancholy
PaSTel: Anchoring Histology in Spatial Transcriptomics via Multi-Scale Hierarchical Bio-Prior Contrastive Pretraining
Azim Dehghani Amirabad, Junchao Zhu, Pushpak Pati, Walid Abdelmoula, Tommaso Mansi, Rui Liao
PertReasonQA: A Knowledge-Grounded Benchmark and Framework for Cell-State–Conditioned Mechanistic Reasoning of Perturbation Effects
Dongkwan Kim, Yiming Gao, Yining Yang, Yang Shen
PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling
Xinyu Yuan, Xixian Liu, Ya Shi Zhang, Zuobai Zhang, Hongyu Guo, Jian Tang
Position: AI for Drug Discovery Models Often Do Not Learn as Expected and How to Diagnose These Failure Modes
Nikhil Branson, Aaron Wenteler, Guy Durant, Charlotte Deane
Position: Multi-Modal LLMs for Video Behavioral Coding in High-Stakes Decision-Making Are Bounded by Polysemy, Not by Model Scale
Zezheng Lin, Jinhao Gan
Position: Multi-Omics Foundation Models Need a Modality Identifiability Standard, Not Just Aggregate Accuracy
Zezheng Lin, Fengming Liu
Position: Saturation in Single-Cell Foundation Model Benchmarks Signals Identifiability Failure, Not Solved Capability
Zezheng Lin, Fengming Liu
Position: Saturation in Single-Cell Foundation Model Benchmarks Signals Identifiability Failure, Not Solved Capability
Zezheng Lin, Fengming Liu
Pre-training on noncovalent interactions from synthetic protein-ligand structures to better predict binding affinity
Arun Raja, Garrett A. Stevenson, Garrett M Morris, Jonathan E Allen, Hyojin Kim
Predicting host-pathogen interactions using a proteome-scale language model
Cyril Malbranke, Cecilia Fruet, Anne-Florence Bitbol
PRIMA: a bidirectional state-space architecture and training approach for sequence modelling of protein-protein interactions
Arturo Fiorellini-Bernardis, Sebastien Boyer, Jean QUENTIN, Ghassene Jebali, Oliver Bent · PDF
Probing, Fusion, and Trustworthiness: A Systematic Evaluation of Foundation Model Representations for Multimodal Cancer Analysis
Jingyu Hu, Giuseppe Tripodi, Reed Naidoo, Sarah F McGough, Tapabrata Chakraborti
ProSAM: Modular and Energy-Guided Fine-Tuning of Protein Language Models for Structure Prediction
Yiyang Zheng, Zhiguo Tao
ProteinJEPA: Latent prediction complements protein language models
Dan Ofer, Dafna Shahaf, Michal Linial
ProteomeLM: A Proteome-Scale Language Model Enables Accurate and Rapid Prediction of Protein-Protein Interactions and Gene Essentiality Across Taxa
Cyril Malbranke, Gionata Paolo Zalaffi, Anne-Florence Bitbol
PROTEUS: Predicting How Post-Translational Modifications Alter Drug Binding Affinity
Bryan Cheng, Austin Jin, Jasper Zhang, Arnav Pemmaraju, Joshua Chang, Brendan Lo
ProtQueSt: Query-Conditioned Retrieval-Augmented Generation for Protein Function Annotation
Linrui Ma, Yiwei Liang, Yishu Yu, Chuhan Joyce Qi · PDF
ProtSent: Protein Sentence Transformers
Dan Ofer, Oriel Perets, Michal Linial, Nadav Rappoport
Retrieval-Augmented Foundation Model Enhances Risk Prediction Using Electronic Health Records
Saeed Shurrab, Mariam Al-Omari, Dana El Samad, Farah E. Shamout
SaNano - Structure Aware Transfer Learning For Data Limited Protein Modality
Hugo Frelin, Andrei Kamenski, Paolo Marcatili · PDF
Search, Edit, and Fold: LLM-Guided MSA Optimization for Protein Conformation Prediction
Yu Pei, Jiangtao Feng, Hao Wang, Dongyu Xue, Keyue Qiu, Hao Zhou, Wei-Ying Ma
Selective Benefits of Sequence-Drug Multimodal Learning for Antimicrobial Resistance Prediction
Mateusz A. Wlodarski
SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome
Dabin Jeong, Amirhossein Vahidi, Ciro Ramírez-Suástegui, Marie Moullet, Kevin Ly, Mohammad Vali Sanian, Sebastian Birk, Yinshui Chang, Adam Boxall, Daniyal Jafree, Lloyd Steele, Vijaya Baskar MS, Muzlifah Haniffa, Mohammad Lotfollahi
Single-Cell Cross-Modal Transfer by Adversarial Fine-Tuning of Foundation Models
Joseph Boyd, Matthew Lyon, Martino Mansoldo, Christian John Hurry, Finnian Firth · PDF
Structural Bottleneck Reasoning: Efficient Medical VQA via Concept Alignment
Hoang Nguyen, Tran Anh Dang Le, Ta Duc Huy, Nguyen Bui Ngoc Han, Vy Tuong Dang, Duy Duong-Tran, Khoa D Doan, Anji Liu, Li Shen, Pengtao Xie, Mathias Niepert, Duy Minh Ho Nguyen, James Zou, Daniel Sonntag
Survival-Relevant Directional Pathology–Omics Discordance from Frozen Whole-Slide Foundation Embeddings
KuanTing Wu
Synergy-Aware Contrastive Pretraining for Co-recorded Physiological Signals
Lei Chen, Junetae Kim · PDF
The Hallucination Dependence Index: A Cross-Condition Diagnostic for Clinical-LLM Faithfulness
Ishan Gonehal, Hanson Wen, Bowman Novey
Transcriptomics-Conditioned Virtual Tissue Synthesis via Diffusion Transformers
Pantelis Vlachas, Kalin Nonchev, Viktor Koelzer, Gunnar Ratsch · PDF
Transferable Lesion-Supervised Speech Representations for Post-Stroke Modelling
Giulia Sanguedolce, Niamh Parkinson, Fatemeh Geranmayeh, Patrick Naylor
TriFit: Trimodal Fusion with Protein Dynamics for Mutation Fitness Prediction
Seungik Cho · PDF
Uncovering evolutionarily remote and highly potent antimicrobial peptides with protein language models
Qinze Yu, Yu Li
VFUSE: Virulent Feature Understanding with Sparse autoEncoders
Michael Yu, Matthew Lyle Olson
Virtual Cell Models Inflate Perturbation Effect Sizes and Undermine Causal Gene Regulatory Network Recovery
Aayan Alwani, Ethan Y Wang
VirtualCeLLM: A Comprehensive Benchmark and Guidance for Large Language Models in Cellular Biology
Jingbo Zhou, Zhongqi Wang, Zhen Lei, Stan Z. Li, Chang Yu
WACA-DTA: Water-Aware Geometric Biases for Structure-Conditioned Drug-Target Affinity Prediction
khan huang, Chang Li
What Makes a Virtual Cell a World Model? Three Gaps, Three Axes, and a Roadmap
Chang Yu, Jingbo Zhou, Zhongqi Wang, Zhen Lei, Stan Z. Li

Accepted papers (83)

☆AgriPerceiver: A Parameter-Efficient Vision-Language Model for Structured Macroscopic Crop Phenotyping

☆AMR-Bench-mini: A Diagnostic Benchmark for Agentic Mechanistic AMR Reasoning under Evidence Insufficiency

☆ARO: Aligned Representation learning for multi-Omics data

☆Auditing Generative Graph Foundation Models for Connectomics: A Score × Predictor × Sampler Decomposition on Real Sparse Directed Connectomes

☆Beyond Nativeness: Viral Proteins in Protein Language Models

☆Biophysics-Guided Genomic Foundation Models via Attention Gating

☆Bolek: A Multimodal Language Model for Molecular Reasoning

☆Causal-IQD-DTA: Counterfactual Interaction-Quality Disentanglement for Robust Drug–Target Affinity Prediction

☆Cell Painting Generates Single-Cell Transcriptomics via Conditional Diffusion

☆ConfPert: Distribution-Free Conformal Coverage for Single-Cell Perturbation Predictors

☆Contrastive Learning for Gene Set Enrichment Analysis Post-Processing

☆Cross-modal transfer learning for mapping bulk transcriptomes at cellular level

☆DELBERT-2: Pretrained Fingerprint Language Models for DEL Protein Binder Prediction

☆Detecting Sparse Colorectal Cancer Signals from Multi-Modal Cell-Free DNA Representations Using Modern Hopfield Attention

☆Do Clinical VLMs Need Dense Visual Tokens? Probing Spatial Grounding in Radiology Report Generation

☆DrugAgent: Reliable Multi-Agent Aggregation under Conflicting Biomedical Evidence

☆E1: Retrieval-Augmented Protein Encoder Models

☆Empirical Observations on Parameter Scaling in Chemical Language Models

☆ERVNet: A Three-Module Framework for Predicting Endogenous Retrovirus Reactivation, Gene Propagation, and Immunogenicity

☆esm-bind: How much protein–RNA binding signal is already in frozen ESM-2 + RNA-FM representations?

☆Exploring Set-Aggregated Genome Embeddings for Microbiome Abundance Prediction

☆Few-Shot Biomedical Image Classification by Alignment of Independently Pretrained Encoders

☆Gene-Embedding Perturbation Operators for Zero-Shot and Transferable Prediction of Transcriptional Responses

☆Generalization of Protein Foundation Models for Engineered Fluorescent Biosensors

☆GeneZip: Region-Aware Compression for Long Context DNA Modeling

☆GFETM: Genome Foundation-based Embedded Topic Model for scATAC-seq Modeling

☆Group Contrastive Learning for Weakly Paired Multimodal Data

☆HealthBot: An Open-Source AI Assistant for Longitudinal Personal Health Management

☆HistoTx: Early fusion of H&E images and spatial transcriptomics at varying spatial transcriptomics resolution with self-supervised learning

☆How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images

☆ImmunoFoundation: A Multimodal Foundation Model for Immunogenicity Prediction and Peptide Optimization

☆Is PEFT Enough for Cell Segmentation? An Empirical No-Go Result on Frozen Foundation Models

☆Learning Protein Fitness Landscapes with Multimodal Stability Priors

☆Lightweight Alignment of Unimodal Foundation Models for Metabolite Identification

☆LLMs Can Learn the Language of the Microbiome

☆Local-Atlas Control-Anchored Flow Matching for Unpaired Single-Cell Perturbation Prediction

☆Marking the Wrong Symptoms: Evaluating LLM Watermarks in Medical Texts

☆Mechanistic Synergy in Multi-Modal VEP: DNA Context Complements PLMs under Biophysical Constraints

☆Medmarks: An Open-Source LLM Benchmark Suite for Medical Tasks

☆MESH-HR: Multimodal Fusion of Somatic DNA Profiles and Histopathology for Continuous Breast Cancer Receptor Subtyping via LLM-Assisted Annotation

☆Mode-Aware Phenotype Profiling from Korean Clinical Reports: An LLM-Derived Two-Layer Fingerprint for Autism Characterization

☆MolEmb: Multimodal Large Language Models Can Be Strong Molecular Embedding Models

☆Neuro-Anatomy–Informed Self-Supervised Learning for Structural Brain MRI

☆OmicsDefense: The First Unified Framework for Defending Against Backdoor Attacks in Single-cell Foundation Models

☆OmicsLM: A Multimodal Large Language Model for Multi-Sample Omics Reasoning

☆PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder

☆PaSTel: Anchoring Histology in Spatial Transcriptomics via Multi-Scale Hierarchical Bio-Prior Contrastive Pretraining

☆PertReasonQA: A Knowledge-Grounded Benchmark and Framework for Cell-State–Conditioned Mechanistic Reasoning of Perturbation Effects

☆PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling

☆Position: AI for Drug Discovery Models Often Do Not Learn as Expected and How to Diagnose These Failure Modes

☆Position: Multi-Modal LLMs for Video Behavioral Coding in High-Stakes Decision-Making Are Bounded by Polysemy, Not by Model Scale

☆Position: Multi-Omics Foundation Models Need a Modality Identifiability Standard, Not Just Aggregate Accuracy

☆Position: Saturation in Single-Cell Foundation Model Benchmarks Signals Identifiability Failure, Not Solved Capability

☆Position: Saturation in Single-Cell Foundation Model Benchmarks Signals Identifiability Failure, Not Solved Capability

☆Pre-training on noncovalent interactions from synthetic protein-ligand structures to better predict binding affinity

☆Predicting host-pathogen interactions using a proteome-scale language model

☆PRIMA: a bidirectional state-space architecture and training approach for sequence modelling of protein-protein interactions

☆Probing, Fusion, and Trustworthiness: A Systematic Evaluation of Foundation Model Representations for Multimodal Cancer Analysis

☆ProSAM: Modular and Energy-Guided Fine-Tuning of Protein Language Models for Structure Prediction

☆ProteinJEPA: Latent prediction complements protein language models

☆ProteomeLM: A Proteome-Scale Language Model Enables Accurate and Rapid Prediction of Protein-Protein Interactions and Gene Essentiality Across Taxa

☆PROTEUS: Predicting How Post-Translational Modifications Alter Drug Binding Affinity

☆ProtQueSt: Query-Conditioned Retrieval-Augmented Generation for Protein Function Annotation

☆ProtSent: Protein Sentence Transformers

☆Retrieval-Augmented Foundation Model Enhances Risk Prediction Using Electronic Health Records

☆SaNano - Structure Aware Transfer Learning For Data Limited Protein Modality

☆Search, Edit, and Fold: LLM-Guided MSA Optimization for Protein Conformation Prediction

☆Selective Benefits of Sequence-Drug Multimodal Learning for Antimicrobial Resistance Prediction

☆SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome

☆Single-Cell Cross-Modal Transfer by Adversarial Fine-Tuning of Foundation Models

☆Structural Bottleneck Reasoning: Efficient Medical VQA via Concept Alignment

☆Survival-Relevant Directional Pathology–Omics Discordance from Frozen Whole-Slide Foundation Embeddings

☆Synergy-Aware Contrastive Pretraining for Co-recorded Physiological Signals

☆The Hallucination Dependence Index: A Cross-Condition Diagnostic for Clinical-LLM Faithfulness

☆Transcriptomics-Conditioned Virtual Tissue Synthesis via Diffusion Transformers

☆Transferable Lesion-Supervised Speech Representations for Post-Stroke Modelling

☆TriFit: Trimodal Fusion with Protein Dynamics for Mutation Fitness Prediction

☆Uncovering evolutionarily remote and highly potent antimicrobial peptides with protein language models

☆VFUSE: Virulent Feature Understanding with Sparse autoEncoders

AgriPerceiver: A Parameter-Efficient Vision-Language Model for Structured Macroscopic Crop Phenotyping

AMR-Bench-mini: A Diagnostic Benchmark for Agentic Mechanistic AMR Reasoning under Evidence Insufficiency

ARO: Aligned Representation learning for multi-Omics data

Auditing Generative Graph Foundation Models for Connectomics: A Score × Predictor × Sampler Decomposition on Real Sparse Directed Connectomes

Beyond Nativeness: Viral Proteins in Protein Language Models

Biophysics-Guided Genomic Foundation Models via Attention Gating

Bolek: A Multimodal Language Model for Molecular Reasoning

Causal-IQD-DTA: Counterfactual Interaction-Quality Disentanglement for Robust Drug–Target Affinity Prediction

Cell Painting Generates Single-Cell Transcriptomics via Conditional Diffusion

ConfPert: Distribution-Free Conformal Coverage for Single-Cell Perturbation Predictors

Contrastive Learning for Gene Set Enrichment Analysis Post-Processing

Cross-modal transfer learning for mapping bulk transcriptomes at cellular level

DELBERT-2: Pretrained Fingerprint Language Models for DEL Protein Binder Prediction

Detecting Sparse Colorectal Cancer Signals from Multi-Modal Cell-Free DNA Representations Using Modern Hopfield Attention

Do Clinical VLMs Need Dense Visual Tokens? Probing Spatial Grounding in Radiology Report Generation

DrugAgent: Reliable Multi-Agent Aggregation under Conflicting Biomedical Evidence

E1: Retrieval-Augmented Protein Encoder Models

Empirical Observations on Parameter Scaling in Chemical Language Models

ERVNet: A Three-Module Framework for Predicting Endogenous Retrovirus Reactivation, Gene Propagation, and Immunogenicity

esm-bind: How much protein–RNA binding signal is already in frozen ESM-2 + RNA-FM representations?

Exploring Set-Aggregated Genome Embeddings for Microbiome Abundance Prediction

Few-Shot Biomedical Image Classification by Alignment of Independently Pretrained Encoders

Gene-Embedding Perturbation Operators for Zero-Shot and Transferable Prediction of Transcriptional Responses

Generalization of Protein Foundation Models for Engineered Fluorescent Biosensors

GeneZip: Region-Aware Compression for Long Context DNA Modeling

GFETM: Genome Foundation-based Embedded Topic Model for scATAC-seq Modeling

Group Contrastive Learning for Weakly Paired Multimodal Data

HealthBot: An Open-Source AI Assistant for Longitudinal Personal Health Management

HistoTx: Early fusion of H&E images and spatial transcriptomics at varying spatial transcriptomics resolution with self-supervised learning

How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images

ImmunoFoundation: A Multimodal Foundation Model for Immunogenicity Prediction and Peptide Optimization

Is PEFT Enough for Cell Segmentation? An Empirical No-Go Result on Frozen Foundation Models

Learning Protein Fitness Landscapes with Multimodal Stability Priors

Lightweight Alignment of Unimodal Foundation Models for Metabolite Identification

LLMs Can Learn the Language of the Microbiome

Local-Atlas Control-Anchored Flow Matching for Unpaired Single-Cell Perturbation Prediction

Marking the Wrong Symptoms: Evaluating LLM Watermarks in Medical Texts

Mechanistic Synergy in Multi-Modal VEP: DNA Context Complements PLMs under Biophysical Constraints

Medmarks: An Open-Source LLM Benchmark Suite for Medical Tasks

MESH-HR: Multimodal Fusion of Somatic DNA Profiles and Histopathology for Continuous Breast Cancer Receptor Subtyping via LLM-Assisted Annotation

Mode-Aware Phenotype Profiling from Korean Clinical Reports: An LLM-Derived Two-Layer Fingerprint for Autism Characterization

MolEmb: Multimodal Large Language Models Can Be Strong Molecular Embedding Models

Neuro-Anatomy–Informed Self-Supervised Learning for Structural Brain MRI

OmicsDefense: The First Unified Framework for Defending Against Backdoor Attacks in Single-cell Foundation Models

OmicsLM: A Multimodal Large Language Model for Multi-Sample Omics Reasoning

PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder

PaSTel: Anchoring Histology in Spatial Transcriptomics via Multi-Scale Hierarchical Bio-Prior Contrastive Pretraining

PertReasonQA: A Knowledge-Grounded Benchmark and Framework for Cell-State–Conditioned Mechanistic Reasoning of Perturbation Effects

PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling

Position: AI for Drug Discovery Models Often Do Not Learn as Expected and How to Diagnose These Failure Modes

Position: Multi-Modal LLMs for Video Behavioral Coding in High-Stakes Decision-Making Are Bounded by Polysemy, Not by Model Scale

Position: Multi-Omics Foundation Models Need a Modality Identifiability Standard, Not Just Aggregate Accuracy

Position: Saturation in Single-Cell Foundation Model Benchmarks Signals Identifiability Failure, Not Solved Capability

Position: Saturation in Single-Cell Foundation Model Benchmarks Signals Identifiability Failure, Not Solved Capability

Pre-training on noncovalent interactions from synthetic protein-ligand structures to better predict binding affinity

Predicting host-pathogen interactions using a proteome-scale language model

PRIMA: a bidirectional state-space architecture and training approach for sequence modelling of protein-protein interactions

Probing, Fusion, and Trustworthiness: A Systematic Evaluation of Foundation Model Representations for Multimodal Cancer Analysis

ProSAM: Modular and Energy-Guided Fine-Tuning of Protein Language Models for Structure Prediction

ProteinJEPA: Latent prediction complements protein language models

ProteomeLM: A Proteome-Scale Language Model Enables Accurate and Rapid Prediction of Protein-Protein Interactions and Gene Essentiality Across Taxa

PROTEUS: Predicting How Post-Translational Modifications Alter Drug Binding Affinity

ProtQueSt: Query-Conditioned Retrieval-Augmented Generation for Protein Function Annotation

ProtSent: Protein Sentence Transformers

Retrieval-Augmented Foundation Model Enhances Risk Prediction Using Electronic Health Records

SaNano - Structure Aware Transfer Learning For Data Limited Protein Modality

Search, Edit, and Fold: LLM-Guided MSA Optimization for Protein Conformation Prediction

Selective Benefits of Sequence-Drug Multimodal Learning for Antimicrobial Resistance Prediction

SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome

Single-Cell Cross-Modal Transfer by Adversarial Fine-Tuning of Foundation Models

Structural Bottleneck Reasoning: Efficient Medical VQA via Concept Alignment

Survival-Relevant Directional Pathology–Omics Discordance from Frozen Whole-Slide Foundation Embeddings

Synergy-Aware Contrastive Pretraining for Co-recorded Physiological Signals

The Hallucination Dependence Index: A Cross-Condition Diagnostic for Clinical-LLM Faithfulness

Transcriptomics-Conditioned Virtual Tissue Synthesis via Diffusion Transformers

Transferable Lesion-Supervised Speech Representations for Post-Stroke Modelling

TriFit: Trimodal Fusion with Protein Dynamics for Mutation Fitness Prediction

Uncovering evolutionarily remote and highly potent antimicrobial peptides with protein language models

VFUSE: Virulent Feature Understanding with Sparse autoEncoders

Virtual Cell Models Inflate Perturbation Effect Sizes and Undermine Causal Gene Regulatory Network Recovery