ICLR 2025 Past GenomicsHealthcare & biology

Workshop on Machine Learning for Genomics Explorations

MLGenX

Unverified seed entry. Some fields are estimates — confirm everything on the official website before planning a submission.

Submission deadline
Feb 10, 2025, 23:59 AoE (UTC−12)
SEED estimate of the historical deadline — verify
Workshop day
Apr 27, 2025
Submission portal
OpenReview
Notes
SEED DATA — name/website from the OpenReview venue record; dates estimated — verify.

Accepted papers (86)

Fetched from OpenReview (v2) on 2026-06-10.

  1. 2DE: a probabilistic method for differential expression across niches in spatial transcriptomics data

    Nathan Levy, Florian Ingelfinger, Artemy Bakulin, Giacomo Cinnirella, Pierre Boyeau, Can Ergen, Nir Yosef · PDF
  2. A data-driven recommendation framework for genomic discovery

    Ying Yang, Zhaoying Pan, Jinge Ma, Daniel J. Klionsky · PDF
  3. A Scalable LLM Framework for Therapeutic Biomarker Discovery: Grounding Q/A Generation in Knowledge Graphs and Literature

    Marc Boubnovski Martell, Kaspar Märtens, Lawrence Phillips, Daniel Keitley, Maria Dermit, Julien Fauqueur · PDF
  4. A Topologically Guided Machine Learning Framework for Enhanced Fine-Mapping in Whole-Genome Bacterial Studies

    Tamsin Emily James, Peter Tino, Nicole E Wheeler · PDF
  5. AI AGENT FOR DATA-DRIVEN HYPOTHESIS EXPLORATION IN SINGLE-CELL TRANSCRIPTOMICS

    Artemy Bakulin, Pierre Boyeau, Nir Yosef · PDF
  6. AI-Powered Virtual Tissues from Spatial Proteomics for Clinical Diagnostics and Biomedical Discovery

    Johann Wenckstern, Eeshaan Jain, Kiril Vasilev, Matteo Pariset, Andreas Wicki, Gabriele Gut, Charlotte Bunne · PDF
  7. Aligning Molecules and Fragments in a Shared Embedding Space for RL-Based Molecule Generation

    Youngkuk Kim, Yinhua Piao, Sangseon Lee, Sun Kim · PDF
  8. Benchmarking Fine-Tuned RNA Language Models for Intronic Branch Point Prediction

    Pablo Rodenas Ruiz, Ali Saadat, Timothy T. Tran, Oliver Müller Smedt, Peng Zhang, Jacques Fellay · PDF
  9. BEYOND SEQUENCE-ONLY MODELS: LEVERAGING STRUCTURAL CONSTRAINTS FOR ANTIBIOTIC RESISTANCE PREDICTION IN SPARSE GENOMIC DATASETS

    Mahbuba Tasmin, Anna G. Green · PDF
  10. Beyond the Factual vs. Hallucinatory Dichotomy: A Refined Taxonomy for LLM Medical Response Categorization

    Saleh Afroogh, Yasser Poreesmaiel, Junfeng Jiao · PDF
  11. BirdieDNA: Reward-Based Pre-Training for Genomic Sequence Modeling

    Sam Blouir, Defne Circi, Asher Moldwin, Amarda Shehu · PDF
  12. Building Foundation Models to Characterize Cellular Interactions via Geometric Self-Supervised Learning on Spatial Genomics

    Yuning You, Zitong Jerry Wang, Kevin Fleisher, Rex Liu, Matt Thomson · PDF
  13. Capturing functional context of genetic pathways through hyperedge disentanglement

    Yoonho Lee, Junseok Lee, Sangwoo Seo, Sungwon Kim, Yeongmin Kim, Chanyoung Park · PDF
  14. CellMemory: Hierarchical Interpretation of Out-of-Distribution Cells Using Bottlenecked Transformer

    Qifei Wang · PDF
  15. COLOR: A COMPOSITIONAL LINEAR OPERATION BASED REPRESENTATION OF PROTEIN SEQUENCES FOR IDENTIFICATION OF MONOMER CONTRIBUTIONS TO PROPERTIES

    Akash Pandey, Wei Chen, Sinan Keten · PDF
  16. Curly Flow Matching for Learning Non-gradient Field Dynamics

    Katarina Petrović, Lazar Atanackovic, Kacper Kapusniak, Michael M. Bronstein, Joey Bose, Alexander Tong · PDF
  17. Decision Tree Induction with Dynamic Feature Generation: A Framework for Interpretable DNA Sequence Analysis

    Nicolas Huynh, Krzysztof Kacprzyk, Ryan M Sheridan, David L. Bentley, Mihaela van der Schaar · PDF
  18. Detecting cell level transcriptomic changes of Perturb-seq using Contrastive Fine-tuning of Single-Cell Foundation Models

    Wenmin Zhao, Ana Solaguren-Beascoa, Grant Neilson, Regina Reynolds, Louwai Muhammed, Liisi Laaniste, Sera Aylin Cakiroglu · PDF
  19. DrugAgent: Multi-Agent Large Language Model-Based Reasoning for Drug-Target Interaction Prediction

    Yoshitaka Inoue, Tianci Song, Xinling Wang, Augustin Luna, Tianfan Fu · PDF
  20. ECG-Nest-FM: A Frequency-Focused ECG Foundation Model with Nested Embeddings

    Abhishek Sharma, Lin Yang, Cory Y McLean, Justin Cosentino, Farhad I Hormozdiari · PDF
  21. EFFICIENT FINE-TUNING OF SINGLE-CELL FOUNDATION MODELS ENABLES ZERO-SHOT MOLECULAR PERTURBATION PREDICTION

    Sepideh Maleki, Jan-Christian Huetter, David Richmond, Kangway V. Chuang, Gabriele Scalia, Tommaso Biancalani · PDF
  22. Enhancing DNA Foundation Models to Address Masking Inefficiencies

    Monireh Safari, Pablo Andres Millan Arias, Scott C. Lowe, Lila Kari, Angel X Chang, Graham W. Taylor · PDF
  23. Enhancing Downstream Analysis in Genome Sequencing: Species Classification While Basecalling

    Riselda Kodra, Hadjer Benmeziane, Irem Boybat, William Andrew Simon · PDF
  24. Enhancing E. coli Genomic Analysis with Retrieval-Augmented Generation

    KRITIKA CHUGH · PDF
  25. ESM-Effect: An Effective and Efficient Fine-Tuning Framework towards accurate prediction of Mutation's Functional Effect

    Moritz Glaser, Johannes Brägelmann · PDF
  26. Exploring the potential of genetic variation and zygosity in DNA language models

    Ali Saadat, Jacques Fellay · PDF
  27. FACA-GEN: Investigating Bias and Generalization in Active Learning for Genomics AI

    Amber Qayum Hawabaz · PDF
  28. Featurization of single cell trajectories through kernel mean embedding of optimal transport maps

    Alec Plotkin, Justin Milner, Natalie Stanley · PDF
  29. Flexible Models of Functional Annotations to Variant Effects using Accelerated Linear Algebra

    Alan Nawzad Amin, Andres Potapczynski, Andrew Gordon Wilson · PDF
  30. GENATATOR: de novo Gene Annotation With DNA Language Model

    Aleksei Shmelev, Artem Shadskiy, Yuri Kuratov, Mikhail Burtsev, Olga Kardymon, Veniamin Fishman · PDF
  31. Gene Set Function Discovery with LLM-Based Agents and Knowledge Retrieval

    Daniela Pinto Veizaga, Aécio Santos, Juliana Freire, Wenke Liu, Sarah Keegan, David Fenyo · PDF
  32. Gradient-Based Gene Selection for Multimodal scRNA-seq Foundation Models

    Pakaphol Thadawasin, Farhan khodaee, Rohola Zandie, Elazer R Edelman · PDF
  33. Graph Pseudotime Analysis and Neural Stochastic Differential Equations for Analyzing Retinal Degeneration Dynamics and Beyond

    Dai Shi, Kuan Yan, Lequan Lin, Yue Zeng, Ting Zhang, Jialing zhang, Matsypura Dmytro, Mark C. Gillies, Ling Zhu, Junbin Gao · PDF
  34. GraphPINE: Graph importance propagation for interpretable drug response prediction

    Yoshitaka Inoue, Tianfan Fu, Augustin Luna · PDF
  35. HARMONY: A Multi-Representation Framework for RNA Property Prediction

    Junjie Xu, Artem Moskalev, Tommaso Mansi, Mangal Prakash, Rui Liao · PDF
  36. Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics

    Matthew Wood, Mathieu Klop, Maxime Allard · PDF
  37. Hierarchical Assembly of Long DNA Libraries from Short Oligonucleotide Pools

    Shaozhong Zou, zhien wu, Chunfu Xu · PDF
  38. HybriDNA: A Hybird Transformer-Mamba2 Long-Range DNA Language Model

    Mingqian Ma, Guoqing Liu, Chuan Cao, Pan Deng, Tri Dao, Albert Gu, Peiran Jin, Zhao Yang, Yingce Xia, Renqian Luo, Pipi Hu, Zun Wang, Yuan-Jyue Chen, Haiguang Liu, Tao Qin · PDF
  39. InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference

    Tianyu Cui, Song-Jun Xu, Artem Moskalev, Shuwei Li, Tommaso Mansi, Mangal Prakash, Rui Liao · PDF
  40. Integrating Protein Language Model and Active Learning for Few-Shot Viral Variant Detection

    Marian Huot, Dianzhuo Wang, Jiacheng Liu, Eugene Shakhnovich · PDF
  41. Interpretable prediction of DNA replication origins in S. cerevisiae using attention-based motif discovery

    Zohreh Piroozeh, Ildem Akerman, Stefan Kesselheim, Olga Kalinina, Alina Bazarova · PDF
  42. Knockoff Statistics-Driven Interpretable Deep Learning Models for Uncovering Potential Biomarkers for COVID-19 Severity Prediction

    Qian Liu, Daryl Fung, Huanjing Liu, Pingzhao Hu · PDF
  43. LangPert: LLM-Driven Contextual Synthesis for Unseen Perturbation Prediction

    Kaspar Märtens, Marc Boubnovski Martell, Cesar A. Prada-Medina, Rory Donovan-Maiye · PDF
  44. Large Language Models for Zero-shot Inference of Causal Structures in Biology

    Izzy Newsham, Luka Kovačević, Richard Moulange, Nan Rosemary Ke, Sach Mukherjee · PDF
  45. Learning Non-Equilibrium Signaling Dynamics in Single-Cell Perturbation Dynamics

    Heman Shakeri · PDF
  46. Learning Representations of Instruments for Partial Identification of Treatment Effects

    Jonas Schweisthal, Dennis Frauen, Maresa Schröder, Konstantin Hess, Niki Kilbertus, Stefan Feuerriegel · PDF
  47. Leveraging GPT Continual Fine-Tuning for Improved RNA Editing Site Prediction

    Zohar Rosenwasser, Erez Levanon, Michael Levitt, Gal Oren · PDF
  48. LIMEADE: Local Interpretable Manifold Explanations for Dimension Evaluations

    Tarek M Zikry, Genevera I. Allen · PDF
  49. LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs – Evaluation through Synthetic Data Generation

    Tejumade Afonja, Ivaxi Sheth, Ruta Binkyte, Waqar Hanif, Shubhi Ambast, Charles Mwangi Kaumbutha, Matthias Becker, Mario Fritz · PDF
  50. LoFTPat: Low-Rank Subspace Optimization for Parameter-Efficient Fine-Tuning of Genomic Language Models in Pathogenicity Identification

    Sajib Acharjee Dip · PDF
  51. MolCap-Arena: A Comprehensive Captioning Benchmark on Language-Enhanced Molecular Property Prediction

    Carl Edwards, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Heng Ji, Gabriele Scalia · PDF
  52. Multi-modal single-cell foundation models via dynamic token adaptation

    Wenmin Zhao, Ana Solaguren-Beascoa, Grant Neilson, Louwai Muhammed, Liisi Laaniste, Aylin Cakiroglu · PDF
  53. Multi-omic Causal Discovery using Genotypes and Gene Expression

    Stephen M. Asiedu, David Watson · PDF
  54. MutEmbed: Self-Supervised Learning of Biological Latent Embeddings from Cancer Mutational Profiles

    Aakansha Narain, Wu Jialun Andy, Hannan Wong, Vedant Sandhu, Jason J. Pitt · PDF
  55. NOLAN: SELF-SUPERVISED FRAMEWORK FOR MAPPING CONTINUOUS TISSUE ORGANIZATION

    Artemy Bakulin, Nathan Levy, Can Ergen, Jonas Maaskola, Nir Yosef · PDF
  56. Pathway-Attentive GAN for Interpretable Biomolecular Design

    Azmine Toushik Wasi, Mahfuz Ahmed Anik · PDF
  57. Piloting Structure-Based Drug Design via Modality-Specific Optimal Schedule

    Keyue Qiu, Yuxuan Song, Zhehuan Fan, Peidong Liu, Zhe Zhang, Mingyue Zheng, Hao Zhou, Wei-Ying Ma · PDF
  58. PIONEER: a virtual platform for iterative improvement of genomic deep learning

    Alessandro Crnjar, John J Desmarais, Justin Kinney, Peter K Koo · PDF
  59. Predicting Drug-likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization

    Dongmin Bang, Inyoung Sung, Yinhua Piao, Sangseon Lee, Sun Kim · PDF
  60. PREDICTING TIME-VARYING METABOLIC DYNAMICS USING STRUCTURED NEURAL ODE PROCESSES

    Santanu Rathod, Pietro Lio, Xiao Zhang · PDF
  61. PRISM: Enhancing Protein Inverse Folding through Fine-Grained Retrieval on Structure-Sequence Multimodal Representations

    Sazan Mahbub, Souvik Kundu, Eric P. Xing · PDF
  62. ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding

    Yijia Xiao, Edward Sun, Yiqiao Jin, Qifan Wang, Wei Wang · PDF
  63. RAG-Enhanced Collaborative LLM Agents for Drug Discovery

    Namkyeong Lee, Edward De Brouwer, Ehsan Hajiramezanali, Tommaso Biancalani, Chanyoung Park, Gabriele Scalia · PDF
  64. RAG-ESM: Improving pretrained protein language models via sequence retrieval

    Damiano Sgarbossa, Anne-Florence Bitbol · PDF
  65. Reference-free cell-type annotation with LLM agents

    Yidi Huang, Ivan Cohen, Van Quynh-Thi Truong, Pedram B Bayat, Sameer A Bhatti, Luca Paruzzo, Mark M. Painter, Shirong Zheng, Derek Alan Oldridge, Joost Wagenaar, Allison R Greenplate, Dokyoon Kim · PDF
  66. Relaxed Equivariance via Multitask Learning

    Ahmed A. A. Elhag, T. Konstantin Rusch, Francesco Di Giovanni, Michael M. Bronstein · PDF
  67. RNAGym: Benchmarks for RNA Fitness and Structure Prediction

    Rohit Arora, Murphy Angelo, Christian Andrew Choe, Aaron W Kollasch, Fiona Qu, Courtney A. Shearer, Ruben Weitzman, Artem Gazizov, Sarah Gurev, Erik Xie, Debora Susan Marks, Pascal Notin · PDF
  68. Sampling Protein Language Models for Functional Protein Design

    Jeremie Theddy Darmawan, Yarin Gal, Pascal Notin · PDF
  69. Searching for Phenotypic Needles in Genomic Haystacks: DNA Language Models for Sex Prediction

    Alla Chepurova, Yuri Kuratov, Polina Belokopytova, Mikhail Burtsev, Veniamin Fishman · PDF
  70. ShortListing Model: A Streamlined Simplex Diffusion for Biological Sequence Generation

    Yuxuan Song, Zhe Zhang, Yu Pei, Jingjing Gong, Mingxuan Wang, Hao Zhou, Jingjing Liu, Wei-Ying Ma · PDF
  71. SPACE: Your Genomic Profile Predictor is a Powerful DNA Foundation Model

    Jiwei Zhu, Zhao Yang, Bing Su · PDF
  72. SpaceDX: A Bayesian test for localized differential expression in population-level spatial transcriptomics datasets

    Niklas Stotzem, Simon Chang, Na Cai, Francesco Paolo Casale · PDF
  73. Spatially-Informed Sampling Enables Accurate Prediction of Large-Scale Mutational Effects

    Maxime Basse, Dianzhuo Wang, Eugene Shakhnovich · PDF
  74. SPELL: Spatial Prompting with Chain-of-Thought for Zero-Shot Learning in Spatial Transcriptomics

    Sumeer Ahmad Khan, Xabier Martinez de Morentin, Vincenzo Lagani, Robert Lehmann, Abdel Rahman Alsabbagh, Mahmoud Zahran, Narsis A. Kiani, David Gomez-Cabrero, Jesper Tegnér · PDF
  75. stDiffusion: A Diffusion Based Model for Generative Spatial Transcriptomics

    Sumeer Ahmad Khan, Xabier Martínez de Morentin, Vincenzo Lagani, Robert Lehmann, Narsis A. Kiani, David Gomez-Cabrero, Jesper Tegnér · PDF
  76. Structure-based metabolite function prediction using graph neural networks

    Tancredi Cogne, Mariam Ait Oumelloul, Ali Saadat, Janna Hastings, Jacques Fellay · PDF
  77. Supervised Contrastive Block Disentanglement

    Taro Makino, Ji Won Park, Natasa Tagasovska, TAKAMASA KUDO, Paula Coelho, Heming Yao, Jan-Christian Huetter, Ana Carolina Leote, Burkhard Hoeckendorf, Stephen Ra, David Richmond, Kyunghyun Cho, Aviv Regev, Romain Lopez · PDF
  78. Talk2Biomodels and Talk2KnowledgeGraph: AI agent-based application for prediction of patient biomarkers and reasoning over biomedical knowledge graphs

    Gurdeep Singh, Lilija Wehling, Ahmad Wisnu Mulyadi, Rakesh Hadne Sreenath, Thomas Klabunde, Tommaso Andreani, Douglas McCloskey · PDF
  79. Test-Time View Selection for Multi-Modal Decision Making

    Eeshaan Jain, Johann Wenckstern, Benedikt von Querfurth, Charlotte Bunne · PDF
  80. To trap or not to trap--analyzing the trade-offs in diffusion transport models

    Rushmila Shehreen Khan, Md. Shahriar Karim · PDF
  81. Transferring Preclinical Drug Response to Patient via Tumor Heterogeneity-Aware Alignment and Perturbation Modeling

    Inyoung Sung, Dongmin Bang, Sun Kim, Sangseon Lee · PDF
  82. Uncertainty-aware genomic deep learning with knowledge distillation

    Jessica Zhou, Kaeli Rizzo, Ziqi Tang, Peter K Koo · PDF
  83. Uncovering BioLOGICAL Motifs and Syntax via Sufficient and Necessary Explanations

    Beepul Bharti, Gabriele Scalia, Tommaso Biancalani, Alex M Tseng · PDF
  84. WASSERSTEIN CYCLEGAN FOR SINGLE-CELL RNA- SEQ DATA GENERATION USING CROSS-MODALITY TRANSLATION

    Sajib Acharjee Dip, Liqing Zhang · PDF
  85. What do single-cell models already know about perturbations?

    Andreas Bjerregaard, Vivek Das, Anders Krogh · PDF
  86. When repeats drive the vocabulary: a Byte-Pair Encoding analysis of T2T primate genomes

    Marina Popova, Iaroslav Chelombitko, Aleksey Komissarov · PDF