ICML 2025 Past Large language models

1st ICML Workshop on Foundation Models for Structured Data

FMSD @ ICML 2025

Submission deadline
May 24, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (70)

Fetched from OpenReview (v2) on 2026-06-10.

  1. AdaRec: Adaptive Recommendation with LLMs via Narrative Profiling and Dual-Channel Reasoning

    Meiyun Wang, Charin Polpanumas · PDF
  2. Are Time Series Foundation Models Ready for Zero-Shot Forecasting?

    Yunkai Zhang, Qi Zeng, Yawen Zhang, Zhijie Xu, Ming Zheng, Chongyang Gao, Muyan Jiang, Zeyu Zheng · PDF
  3. Assessing the Robustness of Tabular Prior-Data Fitted Network Classifier

    Ali Nawaz, Amir Ahmad, Shehroz S. Khan · PDF
  4. Calibration Properties of Time Series Foundation Models

    Coen Adler, Yuxin Chang, Samar Abdi, Padhraic Smyth · PDF
  5. CauKer: Classification Time Series Foundation Models Can Be Pretrained on Synthetic Data only

    Shifeng Xie, Vasilii Feofanov, Marius Alonso, Ambroise Odonnat, Jianfeng Zhang, Ievgen Redko · PDF
  6. Causal Foundation Models: Disentangling Physics from Instrument Properties

    Jeroen Audenaert, Daniel Muthukrishna, Paul F. X. Gregory, David W Hogg, V Ashley Villar · PDF
  7. CLEAR: Contextual Logic-based Explanations for Anomaly Reasoning

    Vikash Sharma, Vipul Joshi, Anurag Tripathi, Mayank Jauhari, Amir Raza · PDF
  8. ConTextTab: A Semantics-Aware Tabular In-Context Learner

    Marco Spinaci, Marek Polewczyk, Maximilian Schambach, Sam Thelin · PDF
  9. Do Large Foundation Models Improve Time Series Segmentation? An Industrial Case Study in Oil and Gas Drilling

    Imane Khaouja, Amine EL KHAIR, Abdallah Benzine, Sebastiaan Buiting, Soumyadipta Sengupta, Youssef Tamaazousti · PDF
  10. Do You Really Need Public Data? Surrogate Public Data for Differential Privacy on Tabular Data

    Shlomi Hod, Lucas Rosenblatt, Julia Stoyanovich · PDF
  11. Do-PFN: In-Context Learning for Causal Effect Estimation

    Jake Robertson, Arik Reuter, Siyuan Guo, Noah Hollmann, Frank Hutter, Bernhard Schölkopf · PDF
  12. DriMM: Drilling Multimodal Model for Time-Series and Text in the Era of Large Models

    Sebastiaan Buiting, Soumyadipta Sengupta, Abdallah Benzine, Amine EL KHAIR, Imane Khaouja, Youssef Tamaazousti · PDF
  13. Dual Adaptation of Time-Series Foundation Models for Financial Forecasting

    Fatemeh Chitsaz, Saman Haratizadeh · PDF
  14. Early Stopping Tabular In-Context Learning

    Jaris Küken, Lennart Purucker, Frank Hutter · PDF
  15. Efficient Table Generation for Zero-Shot Column Type Annotation

    Ehsan Hoseinzade, Ke Wang · PDF
  16. Eliciting Numerical Predictive Distributions of LLMs Without Auto-Regression

    Julianna Piskorz, Kasia Kobalczyk, Mihaela van der Schaar · PDF
  17. Explore the Time Series Forecasting Potential of TabPFN Leveraging the Intrinsic Periodicity of Data

    Sibo Cai, Xi Sun, Hui Zhong · PDF
  18. Exploring Relational Database Foundation Models from a Graph Perspective

    Yanbo Wang, Xiyuan Wang, Quan Gan, Minjie Wang, Qibin Yang, David Wipf, Muhan Zhang · PDF
  19. Filter, Augment, Forecast: Online Data Selection for Robust Time Series Forecasting

    Ege Onur Taga, Halil Alperen Gozeten, Kutay Tire, Rahul Dalvi, Reinhard Heckel, Samet Oymak · PDF
  20. FoMo-0D: A Foundation Model for Zero-shot Outlier Detection

    Yuchen Shen, Haomin Wen, Leman Akoglu · PDF
  21. Foundation Models for Clinical Records at Health System Scale

    Haresh Rengaraj Rajamohan, Xiang Gao, Weicheng Zhu, Shih-Lun Huang, Long Chen, Kyunghyun Cho, Cem M Deniz, Narges Razavian · PDF
  22. Foundation models for time series forecasting and policy evaluation in infectious disease epidemics: a modelling study

    Suprabhath Kalahasti, Benjamin Faucher, boxuan wang, Claudio Ascione, Federico Baldo, Eugenio Valdano · PDF
  23. From Structured Data to Clinical Notes: Robust Clinical Decision Support with Fine-Tuned LLMs

    Frederike Lübeck, Jonas Bernhard Wildberger, Frederik Träuble, Maximilian Mordig, Sergios Gatidis, Andreas Krause, Bernhard Schölkopf · PDF
  24. From Tabular to Time Series: Can TabPFN Handle Mixed Data? A Study on PhysioNet

    Zichao Li, Bingyang Wang, Zong Ke · PDF
  25. From Video Classification to Action Detection: Foundation vs. Task-Specific Models

    Goncalo Mesquita, Ana Rita Cóias, Alexandre Bernardino, Artur Dubrawski · PDF
  26. G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning

    Xiaojun Guo, Ang Li, Yifei Wang, Stefanie Jegelka, Yisen Wang · PDF
  27. Gateformer: Advancing Multivariate Time Series Forecasting via Temporal and Variate-Wise Attention with Gated Representations

    Yu-Hsiang Lan · PDF
  28. GATS: A Time-Series Dataset for Addressing General Aviation Flight Safety

    Aidan LaBella, Charles Duong, Pak Iong Long, Nathan DePiero, Aditya Iyer, Elise Carman, Randall Balestriero, Travis Desell · PDF
  29. GIT-BO: High-Dimensional Bayesian Optimization with Tabular Foundation Models

    Rosen Ting-Ying Yu, Cyril Picard, Faez Ahmed · PDF
  30. Improving Treatment Effect Estimation with LLM-Based Data Augmentation

    Nicolas Huynh, Julianna Piskorz, Jeroen Berrevoets, Max Ruiz Luyten, Mihaela van der Schaar · PDF
  31. In-context Pre-trained Time-Series Foundation Models adapt to Unseen Tasks

    Shangqing Xu, Harshavardhan Kamarthi, Haoxin Liu, B. Aditya Prakash · PDF
  32. Instruction Tuning of Large Language Models for Tabular Data Generation—in One Day

    Milad Abdollahzadeh, Abdul Raheem, Zilong Zhao, Uzair Javaid, Kevin Yee, Nalam Venkata Abhishek, Tram Truong-Huu, Biplab Sikdar · PDF
  33. LEAD - Framework for efficient time-series anomaly detection on large scale data using LLMs

    Akash Chandrayan, Amir ZIDI, Matthew Reimherr, Anis Mjirda, Abhinav Pradhan · PDF
  34. Learning What Matters First: Sequential Adaptation of Time Series Foundation Models for Robust Financial Forecasting

    Fatemeh Chitsaz, Saman Haratizadeh · PDF
  35. Lights Out, Tabs On: Advancing Row-Column Encoding for Tabular LLMs

    Yi-Kai Zhang, Huai-Hong Yin, Xin Li, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun, De-Chuan Zhan, Han-Jia Ye · PDF
  36. LLM Agents Struggle at Time Series Machine Learning Engineering

    Yifu Cai, Xinyu Li, Mononito Goswami, Michał Wiliński, Gus Welter, Artur Dubrawski · PDF
  37. LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis

    Berkay Döner, Thorir Mar Ingolfsson, Luca Benini, Yawei Li · PDF
  38. Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard

    Si-Yang Liu, Qile Zhou, Han-Jia Ye · PDF
  39. Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification

    Vasilii Feofanov, Marius Alonso, Songkang Wen, Romain Ilbert, Hongbo Guo, Malik Tiomoko, Lujia Pan, Jianfeng Zhang, Ievgen Redko · PDF
  40. MORPHEUS : A Foundation Model for Multivariate Time Series Forecasting

    Prathamesh Patil, Amit Varshney, Manoj Cherukumalli, Harsh Deshpande, Leonard Eun, Dushyant Sahoo, Naren Chittar · PDF
  41. Multivariate Calibration is Performative: A Perspective on Pitfalls and Progress

    Sofian Zalouk, Charles Marx, Syrine Belakaria, Christopher De Sa, Stefano Ermon · PDF
  42. Multivariate de Bruijn Graphs: A Symbolic Graph Framework for Time Series Forecasting

    Mert Onur Cakiroglu, Idil Bilge Altun, Mehmet Dalkilic, Elham khorasani buxton, HASAN KURBAN · PDF
  43. One-Run Privacy Auditing for Structured Generative and Foundation Models

    Rishav Chourasia, Zilong Zhao, Uzair Javaid · PDF
  44. Photoplethysmography, Foundation Models, Hypertension and Diabetes

    George Searle · PDF
  45. Query, Don’t Train: Privacy-Preserving Tabular Prediction from EHR Data via SQL Queries

    Josefa Lia Stoisser, Marc Boubnovski Martell, Kaspar Märtens, Lawrence Phillips, Stephen Michael Town, Rory Donovan-Maiye, Julien Fauqueur · PDF
  46. Random Initialization Can’t Catch Up: The Advantage of Language Model Transfer for Time Series Forecasting

    Roland Riachi, Kashif Rasul, Arjun Ashok, Prateek Humane, Alexis Roger, Andrew Robert Williams, Yuriy Nevmyvaka, Irina Rish · PDF
  47. Real-TabPFN: Improving Tabular Foundation Models via Continued Pre-training With Real-World Data

    Anurag Garg, Muhammad Ali, Noah Hollmann, Lennart Purucker, Samuel Müller, Frank Hutter · PDF
  48. RECoRD: A Multi-Agent LLM Framework for Reverse Engineering Codebase to Relational Diagram

    Yuan Xue, Xiaoyu Lu, Yunfei Bai, Hoiyi Ng, Yunan Liu · PDF
  49. Rethinking Description Length: A TabPFN-Based Approximation of Bayesian Mixture Codes

    Afiq Abdillah Effiezal Aswadi, Susan Wei, Ria Jeffrey · PDF
  50. Self-Imputation and Cross-Variable Learning Improve Water Quality Prediction with Sparse Data

    Xiaofeng Liu, Xiaobo Xia, Xuechen Zhang, Mohna Chakraborty, Xiyuan Chang, Kuai Fang, William S. Currie, Samet Oymak · PDF
  51. Simulation-Based Pretraining and Domain Adaptation for Astronomical Time Series Tasks with Minimal Labeled Data

    Rithwik Gupta, Daniel Muthukrishna · PDF
  52. Soft Contrastive Learning for Irregular Multivariate Time Series

    Junghoon Lim, Seunghan Lee, Taeyoung Park · PDF
  53. Speaking Numbers to LLMs: Multi-Wavelet Number Embeddings for Time Series Forecasting

    Defu Cao, Zijie Lei, Jiao Sun, Yan Liu · PDF
  54. State-Space Models for Tabular Prior-Data Fitted Networks

    Felix Koch, Marcel Wever, Fabian Raisch, Benjamin Tischler · PDF
  55. TabPFN Unleashed: A Scalable and Effective Solution to Tabular Classification Problems

    Si-Yang Liu, Han-Jia Ye · PDF
  56. TabReason: A Reinforcement Learning-Enhanced Reasoning LLM for Explainable Tabular Data Prediction

    Tommy Miao Xu, Zhitian Zhang, Xiangyu Sun, Lauren Kelly Zung, Hossein Hajimirsadeghi, Greg Mori · PDF
  57. TabRep: Training Tabular Diffusion Models with a Simple and Effective Continuous Representation

    Jacob Si, Zijing Ou, Mike Qu, Zhengrui Xiang, Yingzhen Li · PDF
  58. TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning

    Ron Shapira Weber, Shahar Benishay, Andrey Lavrinenko, Shahaf E. Finder, Oren Freifeld · PDF
  59. TiRex: Zero-Shot Forecasting Across Long and Short Horizons

    Andreas Auer, Patrick Podest, Daniel Klotz, Sebastian Böck, Günter Klambauer, Sepp Hochreiter · PDF
  60. Toto: An Open Time Series Foundation Model Optimized for Observability

    Ben Cohen, Emaad Khwaja, Youssef Doubli, Salahidine Lemaachi, Chris Lettieri, Charles Masson, Hugo Miccinilli, Elise Ramé, Qiqi Ren, Afshin Rostamizadeh, Jean Ogier du Terrail, Anna-Monica Toon, Kan Wang, Stephan Xie, Zongzhe Xu, Viktoriya Zhukova, David Asker, Ameet Talwalkar, Othmane Abou-Amal · PDF
  61. Toward Scientific Foundation Models for Aquatic Ecosystems

    Abhilash Neog, Medha Sawhney, Kazi Sajeed Mehrab, Sepideh Fatemi, Mary E. Lofton, Amartya Dutta, Aanish Pradhan, Bennett J. McAfee, Emma Marchisin, Robert Ladwig, Arka Daw, Cayelan C. Carey, Paul Hanson, Anuj Karpatne · PDF
  62. Towards a Multi-Modal Foundation Model for Inertial Confinement Fusion: Combining Structured Data and Diagnostic Images

    Michael Jones, Bogdan Kustowski · PDF
  63. Towards Benchmarking Foundation Models for Tabular Data With Text

    Martin Mráz, Breenda Das, Anshul Gupta, Lennart Purucker, Frank Hutter · PDF
  64. Towards Fair In-Context Learning with Tabular Foundation Models

    Patrik Kenfack, Samira Ebrahimi Kahou, Ulrich Aïvodji · PDF
  65. Towards Generalizable Multimodal ECG Representation Learning with LLM-extracted Clinical Entities

    Mingsheng Cai, Jiuming Jiang, Wenhao Huang, Che Liu, Rossella Arcucci · PDF
  66. Towards Interpretable Time Series Foundation Models

    Matthieu Boileau, Philippe Helluy, Jérémy Pawlus, Svitlana Vyetrenko · PDF
  67. Towards Synthetic Data for Fine-tuning Tabular Foundation Models

    Magnus Bühler, Lennart Purucker, Frank Hutter · PDF
  68. Two-Stage Contrastive Language Electrocardiogram Pre-training for Fine-Grained Waveform Features

    HaitaoLi, Che Liu, Zhengyao Ding, Ziyi Liu, Zhengxing Huang · PDF
  69. W-LSTMix: A Hybrid Modular Forecasting Framework for Trend and Pattern Learning in Short-Term Load Forecasting

    SHIVAM DWIVEDI, Anuj Kumar, Harish Kumar Saravanan, Pandarasamy Arjunan · PDF
  70. When and How Unlabeled Data Provably Improve In-Context Learning

    Yingcong Li, Xiangyu Chang, Muti Kara, Xiaofeng Liu, Amit Roy-Chowdhury, Samet Oymak · PDF