ICML 2024PastLarge language models

ICML 2024 Workshop on Foundation Models in the Wild

ICML 2024 FM-Wild Workshop

Official website ↗OpenReview venue ↗See all ICML workshops →✎ Edit this entry

Submission deadline: Jun 8, 2024, 12:29 UTC
imported from OpenReview — check the website for extensions
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (95)

Fetched from OpenReview (v2) on 2026-06-10.

$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs
Vlad Sobal, Mark Ibrahim, Randall Balestriero, Vivien Cabannes, Diane Bouchacourt, Pietro Astolfi, Kyunghyun Cho, Yann LeCun · PDF
A Critical Look At Tokenwise Reward-Guided Text Generation
Ahmad Rashid, Ruotian Wu, Julia Grosse, Agustinus Kristiadi, Pascal Poupart · PDF
Adapting LLM Agents with Universal Feedback in Communication
Kuan Wang, Yadong Lu, Michael Santacroce, Yeyun Gong, Chao Zhang, yelong shen · PDF
Adaptive Concept Bottleneck for Foundation Models
Jihye Choi, Jayaram Raghuram, Yixuan Li, Suman Banerjee, Somesh Jha · PDF
AdaptiveBackdoor: Backdoored Language Model Agents that Detect Human Overseers
Heng Wang, Ruiqi Zhong, Jiaxin Wen, Jacob Steinhardt · PDF
Adversarially Robust CLIP Models Induce Better (Robust) Perceptual Metrics
Francesco Croce, Christian Schlarmann, Naman Deep Singh, Matthias Hein · PDF
An Auditing Test to Detect Behavioral Shift in Language Models
Leo Richter, Nitin Agrawal, Xuanli He, Pasquale Minervini, Matt Kusner · PDF
An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Foundation Models
Scott C. Lowe, Joakim Bruslund Haurum, Sageev Oore, Thomas B. Moeslund, Graham W. Taylor · PDF
Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks
Antoni Kowalczuk, Jan Dubiński, Atiyeh Ashari Ghomi, Yi Sui, George Stein, Jiapeng Wu, Jesse C. Cresswell, Franziska Boenisch, Adam Dziedzic · PDF
Bilingual Adaptation of Monolingual Foundation Models
Gurpreet Gosal, Yishi Xu, Gokulakrishnan Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming Chen, Biswajit Mishra, Sunil Kumar Sahu, Neha Sengupta, Natalia Vassilieva, Joel Hestness, Samujjwal Ghosh, Bokang Jia, Onkar Arun Pandit, Satheesh Katipomu, Samta Kamboj, Rahul Pal, Parvez Mullah, Soundar Balaji Doraiswamy, Karim Chami, Preslav Nakov · PDF
Black-Box Detection of Language Model Watermarks
Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev · PDF
BUILD: Buffer-free Incremental Learning with OOD Detection for the Wild
Srishti Gupta, Daniele Angioni, Lea Schönherr, Ambra Demontis, Battista Biggio · PDF
Calibrated Self-Rewarding Vision Language Models
Yiyang Zhou, Zhiyuan Fan, Dongjie Cheng, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao · PDF
CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Peng Xia, Ze Chen, Juanxi Tian, Gong Yangrui, Ruibo Hou, Yue Xu, Zhenbang Wu, Zhiyuan Fan, Yiyang Zhou, Kangyu Zhu, Wenhao Zheng, Zhaoyang Wang, Xiao Wang, Xuchao Zhang, Chetan Bansal, Marc Niethammer, Junzhou Huang, Hongtu Zhu, Yun Li, Jimeng Sun, Zongyuan Ge, Gang Li, James Zou, Huaxiu Yao · PDF
CharED: Character-wise Ensemble Decoding for Large Language Models
Kevin Gu, Eva Tuecke, Dmitriy A Katz, Raya Horesh, David Alvarez-Melis, Mikhail Yurochkin · PDF
Code Agents are State of The Art Software Testers
Niels Mündler, Mark Niklas Mueller, Jingxuan He, Martin Vechev · PDF
Combining Pre-trained LoRA Modules Improves Few-shot Adaptation of Foundation Models to New Tasks
Nader Asadi, Mahdi Beitollahi, Yasser H. Khalil, Yinchuan Li, Guojun Zhang, Xi Chen · PDF
ContextCite: Attributing Model Generation to Context
Benjamin Cohen-Wang, Harshay Shah, Kristian Georgiev, Aleksander Madry · PDF
Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?
Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith · PDF
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar · PDF
DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection
Yewon Lim, Changyeon Lee, Aerin Kim, Oren Etzioni · PDF
Domain-Aware Fine-Tuning of Foundation Models
Uğur Ali Kaplan, Yumeng Li, Margret Keuper, Anna Khoreva, Dan Zhang · PDF
Dual Risk Minimization for Robust Fine-tuning of Zero-Shot Models
Kaican Li, Weiyan Xie, Ricardo Silva, Nevin L. Zhang · PDF
Efficient Evolutionary Search over Chemical Space with Large Language Models
Haorui Wang, Marta Skreta, Yuanqi Du, Wenhao Gao, Lingkai Kong, Cher Tian Ser, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Alan Aspuru-Guzik, Kirill Neklyudov, Chao Zhang · PDF
End-To-End Causal Effect Estimation from Unstructured Natural Language Data
Nikita Dhawan, Leonardo Cotta, Karen Ullrich, Rahul Krishnan, Chris J. Maddison · PDF
Estimating Probability Densities of Tabular Data using a Transformer Model combined with Denoising Diffusion
Henry W. Leung, Jo Bovy, Joshua S. Speagle · PDF
Evaluating Self-Supervised Foundation Models in Holographic Imaging
Silas Dietler, Yanick Zeder, Elias Graf, Kilian Koch, Andreas Schwendimann, Tommaso Bendinelli · PDF
Evaluation of RAG Metrics for Question Answering in the Telecom Domain
Sujoy Roychowdhury, Sumit Soman, H. G. Ranjani, Neeraj Gunda, Vansh Chhabra, SAI KRISHNA BALA · PDF
ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts
Samar Khanna, Medhanie Irgau, David B. Lobell, Stefano Ermon · PDF
Extracting Training Data from Document-Based VQA Models
Francesco Pinto, Nathalie Rauschmayr, Florian Tramèr, Philip Torr, Federico Tombari · PDF
Extrapolative Protein Design through Triplet-based Preference Learning
Mostafa Karimi, Sharmi Banerjee, Tommi Jaakkola, Bella Dubrov, Shang Shang, Ron Benson · PDF
Federated Fine-Tuning of Vision Foundation Models via Probabilistic Masking
Vasileios Tsouvalas, Yuki M Asano, Aaqib Saeed · PDF
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
Lukas Struppek, Dominik Hintersdorf, Kristian Kersting, Adam Dziedzic, Franziska Boenisch · PDF
FoMu-SSL: Foundation Model-Guided Multi-Sensor Self-Supervised Learning for Remote Sensing
Dabin Seo, Haeji Jung, Jinkyu Kim · PDF
Generalization vs. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data
Antonis Antoniades, Xinyi Wang, Yanai Elazar, Alfonso Amayuelas, Alon Albalak, Kexun Zhang, William Yang Wang · PDF
Geometric Median Matching for Robust Data Pruning
Anish Acharya, Inderjit S Dhillon, Sujay Sanghavi · PDF
GROD: Enhancing Generalization of Transformer with Out-of-Distribution Detection
Yijin Zhou, Yu Guang Wang · PDF
Improving GFlowNets for Text-to-Image Diffusion Alignment
Dinghuai Zhang, Yizhe Zhang, Jiatao Gu, Ruixiang ZHANG, Joshua M. Susskind, Navdeep Jaitly, Shuangfei Zhai · PDF
Improving Graph-Language Alignment with Hierarchical Graph Tokenization
Yongqiang Chen, Quanming Yao, Juzheng Zhang, James Cheng, Yatao Bian · PDF
In Search of Forgotten Domain Generalization
Prasanna Mayilvahanan, Roland S. Zimmermann, Thaddäus Wiedemer, Evgenia Rusak, Attila Juhos, Matthias Bethge, Wieland Brendel · PDF
In-Context Learning Improves Compositional Understanding of Vision-Language Models
Matteo Nulli, Anesa Ibrahimi, Avik Pal, Hoshe Lee, Ivona Najdenkoska · PDF
Inference Performance Optimization for Large Language Models on CPUs
Pujiang He, Shan Zhou, Wenhuan Huang, Changqing Li, Duyi Wang, Bin Guo, Chen Meng, Sheng Gui, Weifei Yu, Yi Xie · PDF
InstructBooth: Instruction-following Personalized Text-to-Image Generation
Daewon Chae, Nokyung Park, Jinkyu Kim, Kimin Lee · PDF
Instruction Tuning With Loss Over Instructions
Zhengyan Shi, Adam X. Yang, Bin Wu, Laurence Aitchison, Emine Yilmaz, Aldo Lipani · PDF
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data
Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Tomasz Korbak, Henry Sleight, Rajashree Agrawal, John Hughes, Dhruv Bhandarkar Pai, Andrey Gromov, Dan Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo · PDF
It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF
TaiMing Lu, Lingfeng Shen, Xinyu Yang, Weiting Tan, Beidi Chen, Huaxiu Yao · PDF
Jogging the Memory of Unlearned Models Through Targeted Relearning Attacks
Shengyuan Hu, Yiwei Fu, Steven Wu, Virginia Smith · PDF
Language Model-In-The-Loop: Data Optimal Approach to Recommend Actions in Text Games
Arjun V Sudhakar, Prasanna Parthasarathi, Janarthanan Rajendran, Sarath Chandar · PDF
Leveraging Generative Foundation Models for Domain Generalization
Sobhan Hemati, Mahdi Beitollahi, Amir Hossein Estiri, Bassel Al Omari, Xi Chen, Guojun Zhang · PDF
LIFTED: Multimodal Mixture-of-Experts for Clinical Trial Outcome Prediction
Wenhao Zheng, Dongshen Peng, Hongxia Xu, Yun Li, Hongtu Zhu, Tianfan Fu, Huaxiu Yao · PDF
LLM Task Interference: Impact of Task-Switch in Conversational History
Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark Gales, Mario Fritz · PDF
LoRD: Low-Rank Decomposition of Monolingual Code LLMs for One-Shot Compression
Ayush Kaushal, Tejas Vaidhya, Irina Rish · PDF
Merging Improves Self-Critique Against Jailbreak Attacks
Victor Gallego · PDF
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge?
Zhaorun Chen, Yichao Du, Zichen Wen, Yiyang Zhou, Chenhang Cui, Zhenzhen Weng, Haoqin Tu, Chaoqi Wang, Zhengwei Tong, Leria HUANG, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou, Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, Huaxiu Yao · PDF
Model Breadcrumbs: Scalable Upcycling of Finetuned Foundation Models via Sparse Task Vectors Merging
MohammadReza Davari, Eugene Belilovsky · PDF
MoRe Fine-Tuning with 10x Fewer Parameters
Wenxuan Tan, Nicholas Roberts, Tzu-Heng Huang, Jitian Zhao, John Cooper, Samuel Guo, Chengyu Duan, Frederic Sala · PDF
Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators
Jianhao Yuan, Francesco Pinto, Adam Davies, Philip Torr · PDF
On the Discrepancy and Connection between Memorization and Generation in Diffusion Models
Hanyu Wang, Yujin Han, Difan Zou · PDF
On the Privacy Risks of Post-Hoc Explanations of Foundation Models
Catherine Huang, Martin Pawelczyk, Himabindu Lakkaraju · PDF
Open LLMs are Necessary for Private Adaptations and Outperform their Closed Alternatives
Vincent Hanke, Tom Blanchard, Franziska Boenisch, Iyiola Emmanuel Olatunji, Michael Backes, Adam Dziedzic · PDF
OTTER: Effortless Label Distribution Adaptation of Zero-shot Models
Changho Shin, Jitian Zhao, Sonia Cromp, Harit Vishwakarma, Frederic Sala · PDF
Out-Of-Context Prompting Boosts Fairness and Robustness in Large Language Model Predictions
Leonardo Cotta, Chris J. Maddison · PDF
PanSAM: Zero-Shot, Prompt-Free Pancreas Segmentation in CT Imaging
Abolfazl Malekahmadi, Mohammad Taha Teimuri Jervakani, Armin Behnamnia, Zahra Dehghanian, Amir Shamloo, Hamid R. Rabiee · PDF
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis
Sagar Srinivas Sakhinana, Sannidhi Gowri Naga Krishna Geethan, Chidaksh Ravuru, Venkataramana Runkana · PDF
PLUTO: Pathology-Universal Transformer
Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma L Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi, Jennifer A. Hipp, Darren Fahy, Benjamin Glass, Eric E. Walk, John Abel, Harsha Vardhan pokkalla, Andrew H. Beck, Sean Grullon · PDF
POST: A Framework for Privacy of Soft-prompt Transfer
Xun Wang, Jing Xu, Franziska Boenisch, Michael Backes, Adam Dziedzic · PDF
Pretrained Hybrids with MAD Skills
Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi GNVV, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala · PDF
Privacy Auditing of Large Language Models
Ashwinee Panda, Xinyu Tang, Milad Nasr, Christopher A. Choquette-Choo, Prateek Mittal · PDF
Private Fine-tuning of Large Language Models with Zeroth-order Optimization
Xinyu Tang, Ashwinee Panda, Milad Nasr, Saeed Mahloujifar, Prateek Mittal · PDF
Projected Language Models: A Large Model Pre-Segmented Into Smaller Ones
David Grangier, Angelos Katharopoulos, Pierre Ablin, Awni Hannun · PDF
Quantum 3D Visual Grounding: A Step Towards Quantum-inspired AI-Visualization
Adib Bazgir, Rama chandra Praneeth Madugula, Yuwen Zhang · PDF
Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters
Kartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Viswanath Ganapathy, Rafael Esteves, Shreya Kadambi, Shubhankar Borse, Paul Whatmough, Risheek Garrepalli, Mart Van Baalen, Harris Teague, Markus Nagel · PDF
Recursive Introspection: Teaching LLM Agents How to Self-Improve
Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar · PDF
RNR: Teaching Large Language Models to Follow Roles and Rules
Kuan Wang, Alexander Bukharin, Haoming Jiang, Qingyu Yin, Zhengyang Wang, Tuo Zhao, Jingbo Shang, Chao Zhang, Bing Yin, Xian Li, Jianshu Chen, Shiyang Li · PDF
RouteFinder: Towards Foundation Models for Vehicle Routing Problems
Federico Berto, Chuanbo Hua, Nayeli Gast Zepeda, André Hottung, Niels Wouda, Leon Lan, Kevin Tierney, Jinkyoo Park · PDF
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
Rishit Dagli, Shivesh Prakash, Robert Wu, Houman Khosravani · PDF
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
Min Cai, Yuchen Zhang, Shichang Zhang, Fan Yin, Difan Zou, Yisong Yue, Ziniu Hu · PDF
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs
Jiatong Han, Jannik Kossen, Muhammed Razzak, Lisa Schut, Shreshth A Malik, Yarin Gal · PDF
Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
Swanand Kadhe, Farhan Ahmed, Dennis Wei, Nathalie Baracaldo, Inkit Padhi · PDF
Strong Copyright Protection for Language Models via Adaptive Model Fusion
Javier Abad, Konstantin Donhauser, Francesco Pinto, Fanny Yang · PDF
Test-Time Prototype Evolution for Generalizable Vision-Language Models
Ce Zhang, Simon Stepputtis, Katia P. Sycara, Yaqi Xie · PDF
The Effect of Data Corruption on Multimodal Long Form Responses
Daniel Z Kaplan, Alexis Roger, Mohamed Osman, Irina Rish · PDF
TimeDiT: General-purpose Diffusion Transformers for Time Series Foundation Model
Defu Cao, Wen Ye, Yan Liu · PDF
Towards Safe Large Language Models for Medicine
Tessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju · PDF
TriLM vs FloatLM: Ternary LLMs are more Performant than Quantized FP16 LLMs
Ayush Kaushal, Tejas Vaidhya, Tejas Pandey, Aaryan Bhagat, Irina Rish · PDF
Two-Level Test-Time Adaptation in Multimodal Learning
Jixiang Lei, Franz Pernkopf · PDF
Understanding the Role of Functional Diversity in Weight-Ensembling with Ingredient Selection and Multidimensional Scaling
Alex Rojas, David Alvarez-Melis · PDF
Unsupervised Feature Extraction from a Foundation Model Zoo for Cell Similarity Search in Oncological Microscopy Across Devices
Gabriel Kalweit, Anusha Klett, Mehdi Naouar, Jens Rahnfeld, Yannick Vogt, Diana Laura Infante Ramirez, Rebecca Berger, Jesus Duque Afonso, Tanja Nicole Hartmann, Marie Follo, Michael Luebbert, Roland Mertelsmann, Evelyn Ullrich, Joschka Boedecker, Maria Kalweit · PDF
Unveiling CLIP Dynamics: Linear Mode Connectivity and Generalization
Alireza Abdollahpourrostam, Amartya Sanyal, Seyed-Mohsen Moosavi-Dezfooli · PDF
USCILab3D: A Large-scale, Long-term, Semantically Annotated Outdoor Dataset
Kiran Lekkala, Henghui Bao, Peixu Cai, Wei Zer Lim, Chen Liu, Laurent Itti · PDF
VFA: Vision Frequency Analysis of Foundation Models and Human
Mohammad Javad Darvishi Bayazi, Md Rifat Arefin, Jocelyn Faubert, Irina Rish · PDF
Vision-Language Models Provide Promptable Representations for Reinforcement Learning
William Chen, Oier Mees, Aviral Kumar, Sergey Levine · PDF
Waterfall: Framework for Robust and Scalable Text Watermarking
Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low · PDF
When Do Language Models Need to Be Large?
Zhixun Chen, Yali Du, David Henry Mguni · PDF
Zero-Shot Generalization of GNNs over Distinct Attribute Domains
Yangyi Shen, Beatrice Bevilacqua, Joshua Robinson, Charilaos Kanatsoulis, Jure Leskovec, Bruno Ribeiro · PDF

Accepted papers (95)

☆$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs

☆A Critical Look At Tokenwise Reward-Guided Text Generation

☆Adapting LLM Agents with Universal Feedback in Communication

☆Adaptive Concept Bottleneck for Foundation Models

☆AdaptiveBackdoor: Backdoored Language Model Agents that Detect Human Overseers

☆Adversarially Robust CLIP Models Induce Better (Robust) Perceptual Metrics

☆An Auditing Test to Detect Behavioral Shift in Language Models

☆An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Foundation Models

☆Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks

☆Bilingual Adaptation of Monolingual Foundation Models

☆Black-Box Detection of Language Model Watermarks

☆BUILD: Buffer-free Incremental Learning with OOD Detection for the Wild

☆Calibrated Self-Rewarding Vision Language Models

☆CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

☆CharED: Character-wise Ensemble Decoding for Large Language Models

☆Code Agents are State of The Art Software Testers

☆Combining Pre-trained LoRA Modules Improves Few-shot Adaptation of Foundation Models to New Tasks

☆ContextCite: Attributing Model Generation to Context

☆Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

☆DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

☆DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

☆Domain-Aware Fine-Tuning of Foundation Models

☆Dual Risk Minimization for Robust Fine-tuning of Zero-Shot Models

☆Efficient Evolutionary Search over Chemical Space with Large Language Models

☆End-To-End Causal Effect Estimation from Unstructured Natural Language Data

☆Estimating Probability Densities of Tabular Data using a Transformer Model combined with Denoising Diffusion

☆Evaluating Self-Supervised Foundation Models in Holographic Imaging

☆Evaluation of RAG Metrics for Question Answering in the Telecom Domain

☆ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts

☆Extracting Training Data from Document-Based VQA Models

☆Extrapolative Protein Design through Triplet-based Preference Learning

☆Federated Fine-Tuning of Vision Foundation Models via Probabilistic Masking

☆Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models

☆FoMu-SSL: Foundation Model-Guided Multi-Sensor Self-Supervised Learning for Remote Sensing

☆Generalization vs. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data

☆Geometric Median Matching for Robust Data Pruning

☆GROD: Enhancing Generalization of Transformer with Out-of-Distribution Detection

☆Improving GFlowNets for Text-to-Image Diffusion Alignment

☆Improving Graph-Language Alignment with Hierarchical Graph Tokenization

☆In Search of Forgotten Domain Generalization

☆In-Context Learning Improves Compositional Understanding of Vision-Language Models

☆Inference Performance Optimization for Large Language Models on CPUs

☆InstructBooth: Instruction-following Personalized Text-to-Image Generation

☆Instruction Tuning With Loss Over Instructions

☆Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

☆It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF

☆Jogging the Memory of Unlearned Models Through Targeted Relearning Attacks

☆Language Model-In-The-Loop: Data Optimal Approach to Recommend Actions in Text Games

☆Leveraging Generative Foundation Models for Domain Generalization

☆LIFTED: Multimodal Mixture-of-Experts for Clinical Trial Outcome Prediction

☆LLM Task Interference: Impact of Task-Switch in Conversational History

☆LoRD: Low-Rank Decomposition of Monolingual Code LLMs for One-Shot Compression

☆Merging Improves Self-Critique Against Jailbreak Attacks

☆MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge?

☆Model Breadcrumbs: Scalable Upcycling of Finetuned Foundation Models via Sparse Task Vectors Merging

☆MoRe Fine-Tuning with 10x Fewer Parameters

☆Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators

☆On the Discrepancy and Connection between Memorization and Generation in Diffusion Models

☆On the Privacy Risks of Post-Hoc Explanations of Foundation Models

☆Open LLMs are Necessary for Private Adaptations and Outperform their Closed Alternatives

☆OTTER: Effortless Label Distribution Adaptation of Zero-shot Models

☆Out-Of-Context Prompting Boosts Fairness and Robustness in Large Language Model Predictions

☆PanSAM: Zero-Shot, Prompt-Free Pancreas Segmentation in CT Imaging

☆Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis

☆PLUTO: Pathology-Universal Transformer

☆POST: A Framework for Privacy of Soft-prompt Transfer

☆Pretrained Hybrids with MAD Skills

☆Privacy Auditing of Large Language Models

☆Private Fine-tuning of Large Language Models with Zeroth-order Optimization

☆Projected Language Models: A Large Model Pre-Segmented Into Smaller Ones

☆Quantum 3D Visual Grounding: A Step Towards Quantum-inspired AI-Visualization

☆Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters

☆Recursive Introspection: Teaching LLM Agents How to Self-Improve

☆RNR: Teaching Large Language Models to Follow Roles and Rules

☆RouteFinder: Towards Foundation Models for Vehicle Routing Problems

☆SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

☆Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller

☆Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs

☆Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs

$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs

A Critical Look At Tokenwise Reward-Guided Text Generation

Adapting LLM Agents with Universal Feedback in Communication

Adaptive Concept Bottleneck for Foundation Models

AdaptiveBackdoor: Backdoored Language Model Agents that Detect Human Overseers

Adversarially Robust CLIP Models Induce Better (Robust) Perceptual Metrics

An Auditing Test to Detect Behavioral Shift in Language Models

An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Foundation Models

Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks

Bilingual Adaptation of Monolingual Foundation Models

Black-Box Detection of Language Model Watermarks

BUILD: Buffer-free Incremental Learning with OOD Detection for the Wild

Calibrated Self-Rewarding Vision Language Models

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

CharED: Character-wise Ensemble Decoding for Large Language Models

Code Agents are State of The Art Software Testers

Combining Pre-trained LoRA Modules Improves Few-shot Adaptation of Foundation Models to New Tasks

ContextCite: Attributing Model Generation to Context

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Domain-Aware Fine-Tuning of Foundation Models

Dual Risk Minimization for Robust Fine-tuning of Zero-Shot Models

Efficient Evolutionary Search over Chemical Space with Large Language Models

End-To-End Causal Effect Estimation from Unstructured Natural Language Data

Estimating Probability Densities of Tabular Data using a Transformer Model combined with Denoising Diffusion

Evaluating Self-Supervised Foundation Models in Holographic Imaging

Evaluation of RAG Metrics for Question Answering in the Telecom Domain

ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts

Extracting Training Data from Document-Based VQA Models

Extrapolative Protein Design through Triplet-based Preference Learning

Federated Fine-Tuning of Vision Foundation Models via Probabilistic Masking

Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models

FoMu-SSL: Foundation Model-Guided Multi-Sensor Self-Supervised Learning for Remote Sensing

Generalization vs. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data

Geometric Median Matching for Robust Data Pruning

GROD: Enhancing Generalization of Transformer with Out-of-Distribution Detection

Improving GFlowNets for Text-to-Image Diffusion Alignment

Improving Graph-Language Alignment with Hierarchical Graph Tokenization

In Search of Forgotten Domain Generalization

In-Context Learning Improves Compositional Understanding of Vision-Language Models

Inference Performance Optimization for Large Language Models on CPUs

InstructBooth: Instruction-following Personalized Text-to-Image Generation

Instruction Tuning With Loss Over Instructions

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF

Jogging the Memory of Unlearned Models Through Targeted Relearning Attacks

Language Model-In-The-Loop: Data Optimal Approach to Recommend Actions in Text Games

Leveraging Generative Foundation Models for Domain Generalization

LIFTED: Multimodal Mixture-of-Experts for Clinical Trial Outcome Prediction

LLM Task Interference: Impact of Task-Switch in Conversational History

LoRD: Low-Rank Decomposition of Monolingual Code LLMs for One-Shot Compression

Merging Improves Self-Critique Against Jailbreak Attacks

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge?

Model Breadcrumbs: Scalable Upcycling of Finetuned Foundation Models via Sparse Task Vectors Merging

MoRe Fine-Tuning with 10x Fewer Parameters

Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators

On the Discrepancy and Connection between Memorization and Generation in Diffusion Models

On the Privacy Risks of Post-Hoc Explanations of Foundation Models

Open LLMs are Necessary for Private Adaptations and Outperform their Closed Alternatives

OTTER: Effortless Label Distribution Adaptation of Zero-shot Models

Out-Of-Context Prompting Boosts Fairness and Robustness in Large Language Model Predictions

PanSAM: Zero-Shot, Prompt-Free Pancreas Segmentation in CT Imaging

Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis

PLUTO: Pathology-Universal Transformer

POST: A Framework for Privacy of Soft-prompt Transfer

Pretrained Hybrids with MAD Skills

Privacy Auditing of Large Language Models

Private Fine-tuning of Large Language Models with Zeroth-order Optimization

Projected Language Models: A Large Model Pre-Segmented Into Smaller Ones

Quantum 3D Visual Grounding: A Step Towards Quantum-inspired AI-Visualization

Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters

Recursive Introspection: Teaching LLM Agents How to Self-Improve

RNR: Teaching Large Language Models to Follow Roles and Rules

RouteFinder: Towards Foundation Models for Vehicle Routing Problems

SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs

Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs

Strong Copyright Protection for Language Models via Adaptive Model Fusion