NeurIPS 2024 Past Large language modelsEfficiency

Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning

AFM 2024

Submission deadline
Oct 5, 2024, 12:00 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (128)

Fetched from OpenReview (v2) on 2026-06-10.

  1. $\text{Transformer}^2$: Self-adaptive LLMs

    Qi Sun, Edoardo Cetin, Yujin Tang · PDF
  2. A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

    Hui Yuan, Yifan Zeng, Yue Wu, Huazheng Wang, Mengdi Wang, Liu Leqi · PDF
  3. Accelerated Preference Optimization for Large Language Model Alignment

    Jiafan He, Huizhuo Yuan, Quanquan Gu · PDF
  4. AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations

    Gaurav Verma, Rachneet Kaur, Nishan Srishankar, Zhen Zeng, Tucker Balch, Manuela Veloso · PDF
  5. Adapting Foundation Models via Training-free Dynamic Weight Interpolation

    Changdae Oh, Yixuan Li, Kyungwoo Song, Sangdoo Yun, Dongyoon Han · PDF
  6. Adapting Language Models via Token Translation

    Zhili Feng, Tanya Marwah, Nicolo Fusi, David Alvarez-Melis, Lester Mackey · PDF
  7. Adaptive LoRA Merging for Efficient Domain Incremental Learning

    Eric Nuertey Coleman, Luigi Quarantiello, Julio Hurtado, Vincenzo Lomonaco · PDF
  8. Adaptive World Models: Learning Behaviors by Latent Imagination Under Non-Stationarity

    Emiliyan Gospodinov, Vaisakh Shaj, Philipp Becker, Stefan Geyer, Gerhard Neumann · PDF
  9. Agent Skill Acquisition for LLMs via CycleQD

    So Kuroki, Taishi Nakamura, Takuya Akiba, Yujin Tang · PDF
  10. AgentMerge: Enhancing Generalization in Fine-Tuned LLM Agents

    Megh Thakkar, Léo Boisvert, Thibault Le Sellier de Chezelles, Alexandre Piché, Maxime Gasse, Alexandre Lacoste, Massimo Caccia · PDF
  11. AoP-SAM: Automation of Prompts for Efficient Segmentation

    Yi Chen, Muyoung Son, Chuanbo Hua, Joo-Young Kim · PDF
  12. APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding

    Xinyu Yang, Tianqi Chen, Beidi Chen · PDF
  13. Approximate Top-k for Increased Parallelism

    Oscar Key, Luka Ribar, Alberto Cattaneo, Luke Hudlass-Galley, Douglas Orr · PDF
  14. Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle

    Hui Dai, Ryan Teehan, Mengye Ren · PDF
  15. Assisted Few-Shot Learning for Vision-Language Models in Agricultural Stress Phenotype Identification

    Muhammad Arbab Arshad, Talukder Zaki Jubery, Asheesh K Singh, ARTI SINGH, Chinmay Hegde, Baskar Ganapathysubramanian, Aditya Balu, Adarsh Krishnamurthy, Soumik Sarkar · PDF
  16. Automated Design of Agentic Systems

    Shengran Hu, Cong Lu, Jeff Clune · PDF
  17. Automatically Generating Custom Context-Driven SFT Data for LLMs with Multi-Granularity

    Shanghaoran Quan · PDF
  18. Better Prompt Compression Without Multi-Layer Perceptrons

    Edouardo Honig, Andrew Lizarraga, Zijun Frank Zhang, Ying Nian Wu · PDF
  19. Can the Spectrum of the Neural Tangent Kernel Anticipate Fine-Tuning Performance?

    Zahra Rahimi Afzal, Tara Esmaeilbeig, Mojtaba Soltanalian, Mesrob I Ohannessian · PDF
  20. Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning?

    Bowen Zhao, Leo Parker Dirac, Paulina Varshavskaya · PDF
  21. CAT Pruning: Cluster-Aware Token Pruning For Text-to-Image Diffusion Models

    Xinle Cheng, Zhuoming Chen, Zhihao Jia · PDF
  22. CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing

    Wenhao Zheng, Yixiao Chen, Weitong Zhang, Souvik Kundu, Yun Li, Zhengzhong Liu, Eric P. Xing, Hongyi Wang, Huaxiu Yao · PDF
  23. Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMs

    Megh Thakkar, Yash More, Quentin Fournier, Matthew Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das, Sarath Chandar · PDF
  24. Continuous Language Model Interpolation for Dynamic and Controllable Text Generation

    Sara Kangaslahti, David Alvarez-Melis · PDF
  25. Controlling Forgetting with Test-Time Data in Continual Learning

    Vaibhav Singh, Rahaf Aljundi, Eugene Belilovsky · PDF
  26. Controlling Multimodal LLMs via Reward-guided Decoding

    Oscar Mañas, Pierluca D'Oro, Koustuv Sinha, Adriana Romero-Soriano, Michal Drozdzal, Aishwarya Agrawal · PDF
  27. COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement

    Yuxi Xie, Anirudh Goyal, Xiaobao Wu, Xunjian Yin, Xiao Xu, Min-Yen Kan, Liangming Pan, William Yang Wang · PDF
  28. CTRL-O: Language-Controllable Object-Centric Visual Representation Learning

    Aniket Rajiv Didolkar, Andrii Zadaianchuk, Rabiul Awal, Maximilian Seitzer, Efstratios Gavves, Aishwarya Agrawal · PDF
  29. Data-Efficient Training by Evolved Sampling

    Ziheng Cheng, Zhong Li, Jiang Bian · PDF
  30. Deliberate Practice with Synthetic Data

    Reyhane Askari-Hemmat, Mohammad Pezeshki, Pietro Astolfi, Melissa Hall, Florian Bordes, Jakob Verbeek, Michal Drozdzal, Adriana Romero-Soriano · PDF
  31. Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models

    Ognjen Rudovic, Pranay Dighe, Yi Su, Vineet Garg, Sameer Dharur, Xiaochuan Niu, Ahmed Hussen Abdelaziz, Saurabh Adya, Ahmed Tewfik · PDF
  32. Do Think Tags Really Help LLMs Plan? A Critical Evaluation of ReAct-Style Prompting

    Mudit Verma, Siddhant Bhambri, Subbarao Kambhampati · PDF
  33. Domain Adaptation for Robust Model Routing

    Christoph Dann, Yishay Mansour, Teodor Vanislavov Marinov, Mehryar Mohri · PDF
  34. DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach

    Daniel Gallo Fernández, Răzvan-Andrei Matișan, Alejandro Monroy Muñoz, Ana Maria Vasilcoiu, Janusz Partyka, Tin Hadži Veljković, Metod Jazbec · PDF
  35. Dynamic Subset Tuning: Expanding the Operational Range of Parameter-Efficient Training for Large Language Models

    Felix Stahlberg, Jared Lichtarge, Shankar Kumar · PDF
  36. Dynamically Managing a Prompt Pool via Self-Enhancement in Continual Learning

    Hayun Lee, Kiseong Hong, Hwanhee Lee, Sungho Suh, Eunwoo Kim · PDF
  37. Effective Text-to-Image Alignment with Quality Aware Pair Ranking

    Kunal Singh, Mukund Khanna, Pradeep Moturi · PDF
  38. Efficient Domain Adaptation of Robotic Foundation Models via Hypernetwork-Generated LoRA

    Zheng Xiong, Siddhant Sharma, Kang Li, Risto Vuorio, Shimon Whiteson · PDF
  39. Efficient Fine-Tuning of Image-Conditional Diffusion Models for Depth and Surface Normal Estimation

    Gonzalo Martin Garcia, Karim Abou Zeid, Christian Schmidt, Daan de Geus, Alexander Hermans, Bastian Leibe · PDF
  40. Efficient Transfer Learning driven by Layer-wise Features Aggregation

    Chanwoo Kim, Jeyoon Yeom, JOOWANG KIM, Suho Kang, Kyungwoo Song · PDF
  41. Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs

    Jonas Hübotter, Sascha Bongni, Ido Hakimi, Andreas Krause · PDF
  42. Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation

    Quanting Xie, So Yeon Min, Tianyi Zhang, Kedi Xu, Aarav Bajaj, Russ Salakhutdinov, Matthew Johnson-Roberson, Yonatan Bisk · PDF
  43. Empowering LLM Agents with Zero-Shot Optimal Decision-Making through Q-learning

    Jiajun Chai, Sicheng Li, Yuqian Fu, Dongbin Zhao, Yuanheng Zhu · PDF
  44. Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generation

    Manish Bhattarai, Minh N. Vu, Javier E. Santos, Ismael Boureima, Daniel O'Malley · PDF
  45. Enhancing Fine-Tuning Efficiency of LLMs Through Gradient Subspace Tracking

    Sahar Rajabi, Sirisha Rambhatla · PDF
  46. Enhancing Long Context Performance in LLMs Through Inner Loop Query Mechanism

    Yimin Tang, Yurong Xu, Ning Yan, Masood S. Mortazavi · PDF
  47. Enhancing Multi-Agent Multi-Modal Collaboration with Fine-Grained Reward Modeling

    Qian Yang, Weixiang Yan, Aishwarya Agrawal · PDF
  48. Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications

    Bo Wen, Xin Zhang · PDF
  49. Ensemble-based Offline Reinforcement Learning with Adaptive Behavior Cloning

    Danyang Wang, Lingsong Zhang · PDF
  50. Evaluating RAG System Performance: The Impact of Knowledge Cut-off and Fine-Tuning

    Omkar Dige, John Willes, D. B. Emerson · PDF
  51. Exploring Visual Prompt Tuning for Demographic Adaptation in Foundation Models for Medical Imaging

    Artur Parkhimchyk, Amirreza Naziri, Laleh Seyyed-Kalantari · PDF
  52. Extracting Parallelism from Large Language Model Queries

    Steven Kolawole, Keshav Santhanam, Virginia Smith, Pratiksha Thaker · PDF
  53. Fast and Accurate Language Model Decoding via Parallel Token Processing

    Zhepei Wei, Wei-Lin Chen, Xinyu Zhu, Yu Meng · PDF
  54. Fine-Grained Visual Recognition in the Age of Multimodal LLMs

    Hari Chandana Kuchibhotla, Abbavaram Gowtham Reddy, Sai Srinivas Kancheti, Vineeth N. Balasubramanian · PDF
  55. Fine-tuning LLM Agents with Retrospective In-Context Online Learning

    Wentse Chen, Jiayu Chen, Fahim Tajwar, Hao Zhu, Xintong Duan, Russ Salakhutdinov, Jeff Schneider · PDF
  56. FlashDP: Memory-Efficient and High-Throughput DP-SGD Training for Large Language Models

    Liangyu Wang, Junxiao Wang, Jie Ren, Zihang Xiang, David E. Keyes, Di Wang · PDF
  57. From One to Zero: RAG-IM Adapts Language Models for Interpretable Zero-Shot Clinical Predictions

    Sazan Mahbub, Caleb Ellington, Sina Alinejad, Kevin Wen, Yingtao Luo, Ben Lengerich, Eric P. Xing · PDF
  58. Fully-inductive Node Classification on Arbitrary Graphs

    Jianan Zhao, Mikhail Galkin, Hesham Mostafa, Michael M. Bronstein, Zhaocheng Zhu, Jian Tang · PDF
  59. Generating Diverse Negations from Affirmative Sentences

    Darian Marlis Rodriguez Vasquez, Afroditi Papadaki · PDF
  60. Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass

    Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Benjamin Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng · PDF
  61. GraphText: Graph Reasoning in Text Space

    Jianan Zhao, Le Zhuo, Yikang Shen, Meng Qu, Kai Liu, Michael M. Bronstein, Zhaocheng Zhu, Jian Tang · PDF
  62. IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning

    Soeun Lee, Si-Woo Kim, Taewhan Kim, Dong-Jin Kim · PDF
  63. Imbalance-Regularized LoRA: A Plug-and-Play Method for Improving Fine-Tuning of Foundation Models

    Zhenyu Zhu, Yongtao Wu, Quanquan Gu, Volkan Cevher · PDF
  64. Improving In-Context Learning with Small Language Model Ensembles

    M. Mehdi Mojarradi, Lingyi Yang, Robert McCraith, Adam Mahdi · PDF
  65. Improving Model Merging with Natural Niches

    João Abrantes, Robert Tjarko Lange, Yujin Tang · PDF
  66. In-Context Learning behaves as a greedy layer-wise gradient descent algorithm

    Brian K Chen, Tianyang Hu, Hui Jin, Hwee Kuan Lee, Kenji Kawaguchi · PDF
  67. Informed Tree of Thought: Cost-efficient Problem Solving with Large Language Models

    Sajad Mousavi, Desik Rengarajan, Ashwin Ramesh Babu, Sahand Ghorbanpour, Vineet Gundecha, Avisek Naug, Soumyendu Sarkar · PDF
  68. Instant Transformer Adaption via HyperLoRA

    Rujikorn Charakorn, Edoardo Cetin, Yujin Tang, Robert Tjarko Lange · PDF
  69. InstructRAG: Instructing Retrieval Augmented Generation via Self-Synthesized Rationales

    Zhepei Wei, Wei-Lin Chen, Yu Meng · PDF
  70. InvestAlign: Align LLMs with Investor Decision-Making under Herd Behavior

    Huisheng Wang, Zhuoshi Pan, Hangjing Zhang, Mingxiao Liu, Yiqing Lin, H. Vicky Zhao · PDF
  71. Is In-Context Learning Sufficient for Instruction Following in LLMs?

    Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion · PDF
  72. LangDA: Language-guided Domain Adaptive Semantic Segmentation

    Chang Liu, Saad Hossain, C Thomas, Kwei-Herng Lai, Raviteja Vemulapalli, Sirisha Rambhatla, Alexander Wong · PDF
  73. Leveraging Self Weak-supervision for Improved VLM Performance

    Shuvendu Roy, Ali Etemad · PDF
  74. LinkGPT: Teaching Large Language Models To Predict Missing Links

    Zhongmou He, Jing Zhu, Shengyi Qian, Joyce Chai, Danai Koutra · PDF
  75. Long Context RAG Performance of Large Language Models

    Quinn Leng, Jacob Portes, Sam Havens, Matei Zaharia, Michael Carbin · PDF
  76. LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

    Di Wu, Hongwei Wang, Wenhao Yu, Yuwei Zhang, Kai-Wei Chang, Dong Yu · PDF
  77. MagicPIG: LSH Sampling for Efficient LLM Generation

    Zhuoming Chen, Ranajoy Sadhukhan, Zihao Ye, Yang Zhou, Jianyu Zhang, Niklas Nolte, Yuandong Tian, Matthijs Douze, Leon Bottou, Zhihao Jia, Beidi Chen · PDF
  78. MD-DiT: Step-aware Mixture-of-Depths for Efficient Diffusion Transformers

    Mingzhu Shen, Pengtao Chen, Peng Ye, Guoxuan Xia, Tao Chen, Christos-Savvas Bouganis, Yiren Zhao · PDF
  79. Memory Efficient Continual Learning with CLIP Models

    Ryan King, Gang Li, Bobak J Mortazavi, Tianbao Yang · PDF
  80. MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guarantees

    Ryan Zhang, Herbert Woisetschläger, Shiqiang Wang, Hans Arno Jacobsen · PDF
  81. metaTextGrad: Learning to learn with language models as optimizers

    Guowei Xu, Mert Yuksekgonul, Carlos Guestrin, James Zou · PDF
  82. Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention

    Tianyun Yang, Ziniu Li, Juan Cao, Chang Xu · PDF
  83. MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

    Peng Xia, Kangyu Zhu, Haoran Li, Tianze Wang, Weijia Shi, Linjun Zhang, James Zou, Huaxiu Yao · PDF
  84. MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

    Peng Xia, Siwei Han, Shi Qiu, Yiyang Zhou, Zhaoyang Wang, Wenhao Zheng, Zhaorun Chen, Chenhang Cui, Mingyu Ding, Linjie Li, Lijuan Wang, Huaxiu Yao · PDF
  85. Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models

    Gang Li, Wendi Yu, Yao Yao, Wei Tong, Yingbin Liang, Qihang Lin, Tianbao Yang · PDF
  86. N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs

    Ilya Zisman, Alexander Nikulin, Andrei Polubarov, Lyubaykin Nikita, Vladislav Kurenkov · PDF
  87. Narrow Transformer: Mono-lingual Code SLM for Desktop

    Kamalkumar Rathinasamy, Balaji A J, Ankush Kumar, Gagan Gayari, Harshini K, Rajab Ali Mondal, Sreenivasa Raghavan K S, Swayam Singh, Mohammed Rafee Tarafdar · PDF
  88. NegMerge: Consensual Weight Negation for Strong Machine Unlearning

    Hyoseo Kim, Dongyoon Han, Junsuk Choe · PDF
  89. Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts

    Nikolas Gritsch, Qizhen Zhang, Acyr Locatelli, Sara Hooker, Ahmet Üstün · PDF
  90. OmniPredict: GPT-4o Enhanced Multi-modal Pedestrian Crossing Intention Prediction

    Je-Seok Ham, Jia Huang, Peng Jiang, Jinyoung Moon, Yongjin Kwon, Srikanth Saripalli, Changick Kim · PDF
  91. On Pre-training of Multimodal Language Models Customized for Chart Understanding

    Wan-Cyuan Fan, Yen-Chun Chen, Mengchen Liu, Lu Yuan, Leonid Sigal · PDF
  92. One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

    Fabian Paischer, Lukas Hauzenberger, Thomas Schmied, Benedikt Alkin, Marc Peter Deisenroth, Sepp Hochreiter · PDF
  93. P3O: Pessimistic Preference-based Policy Optimization for Robust Alignment from Preferences

    Dhawal Gupta, Christoph Dann, Alekh Agarwal · PDF
  94. PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences

    Daiwei Chen, Yi Chen, Aniket Rege, Ramya Korlakai Vinayak · PDF
  95. Personalized Adaptation via In-Context Preference Learning

    Allison Lau, Younwoo Choi, Vahid Balazadeh, Keertana Chidambaram, Vasilis Syrgkanis, Rahul Krishnan · PDF
  96. Personalized Language Modeling from Personalized Human Feedback

    Xinyu Li, Ruiyang Zhou, Zachary Chase Lipton, Liu Leqi · PDF
  97. Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

    Joel Jang, Seungone Kim, Bill Yuchen Lin, Yizhong Wang, Jack Hessel, Luke Zettlemoyer, Hannaneh Hajishirzi, Yejin Choi, Prithviraj Ammanabrolu · PDF
  98. Personas within Parameters: Fine-Tuning Small Language Models with Low-Rank Adapters to Mimic User Behaviors

    Himanshu Thakur, Eshani Agrawal, Smruthi Mukund · PDF
  99. Pick Your Influencer: Being Selective is Good for Personalization

    Ashutosh Ranjan, Vivek Srivastava, Shirish Karande · PDF
  100. PM-Jewelry: Personalized Multimodal Adaptation for Virtual Jewelry Try-On with Latent Diffusion

    Yangfan He, Yinghui Xia, Jinfeng Wei, TIANYU SHI, Yang Jingsong · PDF
  101. Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer

    Yu Yang, Pan Xu · PDF
  102. Prompt Learning Based Adaptor for Enhanced Video Editing with Pretrained Text-to-Image Diffusion Models

    Yangfan He, Sida Li, Jianhui Wang · PDF
  103. RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems

    Jennifer Hsia, Afreen Shaikh, Zhiruo Wang, Graham Neubig · PDF
  104. REGENT: A Retrieval-Augmented Generalist Agent That Can Act in-Context In New Environments

    Kaustubh Sridhar, Souradeep Dutta, Dinesh Jayaraman, Insup Lee · PDF
  105. Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks

    Minju Seo, Jinheon Baek, James Thorne, Sung Ju Hwang · PDF
  106. SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents

    Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Xufang Luo, Hao Cheng, Dongsheng Li, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Jianfeng Gao · PDF
  107. Self-Play Preference Optimization for Language Model Alignment

    Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu · PDF
  108. Sirius: Contextual Sparsity with Correction for Efficient LLM

    Yang Zhou, Zhuoming Chen, Zhaozhuo Xu, Xi Victoria Lin, Beidi Chen · PDF
  109. Situated Instruction Following Under Ambiguous Human Intent

    So Yeon Min, Xavier Puig, Devendra Singh Chaplot, Tsung-Yen Yang, Akshara Rai, Priyam Parashar, Russ Salakhutdinov, Yonatan Bisk, Roozbeh Mottaghi · PDF
  110. Slaying the HyDRA: Parameter-Efficient Hyper Networks with Low-Displacement Rank Adaptation

    Xiangyu Chen, Ye Wang, Matthew Brand, Pu Perry Wang, Jing Liu, Toshiaki Koike-Akino · PDF
  111. SpikingVTG: Saliency Feedback Gating Enabled Spiking Video Temporal Grounding

    Malyaban Bal, Brian Matejek, Susmit Jha, Adam D. Cobb · PDF
  112. Synergistic Weak-Strong Collaboration by Aligning Preferences

    Yizhu Jiao, Xuchao Zhang, Zhaoyang Wang, Yubo Ma, Zhun Deng, Rujia Wang, Chetan Bansal, Saravan Rajmohan, Jiawei Han, Huaxiu Yao · PDF
  113. Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers

    Yingyu Liang, Zhenmei Shi, Zhao Song, Yufa Zhou · PDF
  114. Text as Images: Can Multimodal Large Language Models Follow Printed Instructions in Pixels?

    Xiujun Li, Yujie Lu, William Yang Wang, Yejin Choi · PDF
  115. Towards Conversational AI for Spina Bifida Care

    Asfandyar Azhar, Shaurjya Mandal, Nidhish Shah · PDF
  116. Towards Federated Low-Rank Adaptation with Rank Heterogeneity

    Yuji Byun, Jaeho Lee · PDF
  117. Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning

    Song Jiang, Da JU, Andrew Cohen, Sasha Mitts, Aaron Foss, Justine T Kao, Xian Li, Yuandong Tian · PDF
  118. Towards Personalized Language Models via Inference-time Human Preference Optimization

    Nikki Lijing Kuang, Wei Sun, Scott McFaddin, Yian Ma, Markus Ettl · PDF
  119. Towards Robust and Cost-Efficient Knowledge Unlearning for Large Language Models

    Sungmin Cha, Sungjun Cho, Dasol Hwang, Moontae Lee · PDF
  120. Transfer Learning for Finetuning Large Language Models

    Tobias Strangmann, Lennart Purucker, Jörg K.H. Franke, Ivo Rapant, Fabio Ferreira, Frank Hutter · PDF
  121. Uncertainty-Penalized Direct Preference Optimization

    Sam Houliston, Alizée Pace, Alexander Immer, Gunnar Ratsch · PDF
  122. Understanding Visual Concepts Across Models

    Brandon Trabucco, Max A Gurinas, Kyle Doherty, Russ Salakhutdinov · PDF
  123. Uniform Text-Motion Generation and Editing via Diffusion Model

    Ruoyu Wang, Xiang Li, Tengjiao Sun, Yangfan He, TIANYU SHI, yitingxie · PDF
  124. ViPCap: Retrieval Text-based Visual Prompts for Lightweight Image Captioning

    Taewhan Kim, Soeun Lee, Si-Woo Kim, Dong-Jin Kim · PDF
  125. Visual Language Alignment Tuning

    Le Zhang, Qian Yang, Aishwarya Agrawal · PDF
  126. Warmstarting for Scaling Language Models

    Neeratyoy Mallik, Maciej Janowski, Johannes Hog, Herilalaina Rakotoarison, Aaron Klein, Josif Grabocka, Frank Hutter · PDF
  127. XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

    Alexander Nikulin, Ilya Zisman, Alexey Zemtsov, Vladislav Kurenkov · PDF
  128. ZO-Offloading: Fine-Tuning LLMs with 100 Billion Parameters on a Single GPU

    Liangyu Wang, Jie Ren, Hang Xu, Junxiao Wang, David E. Keyes, Di Wang · PDF