NeurIPS 2024 Past Other

NeurIPS 2024 Workshop on Fine-Tuning in Modern Machine Learning: Principles and Scalability

FITML 2024

Submission deadline
Oct 1, 2024, 23:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (88)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Layer Selection Approach to Test Time Adaptation

    Sabyasachi Sahoo, Mostafa ElAraby, Jonas Ngnawe, Yann Batiste Pequignot, Frederic Precioso, Christian Gagné · PDF
  2. A Tensor-based Convolutional Neural Network for Small Dataset Classification

    Zhenhua Chen, David J. Crandall · PDF
  3. Accelerating Direct Preference Optimization with Prefix Sharing

    Franklin Wang, Sumanth Hegde · PDF
  4. ActNAS : Generating Efficient YOLO Models using Activation NAS

    Sudhakar Sah, Ravish Kumar, Darshan C Ganji, Ehsan Saboori · PDF
  5. Adapting Language Models via Token Translation

    Zhili Feng, Tanya Marwah, Nicolo Fusi, David Alvarez-Melis, Lester Mackey · PDF
  6. Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models

    Zeman Li, Xinwei Zhang, Peilin Zhong, Yuan Deng, Meisam Razaviyayn, Vahab Mirrokni · PDF
  7. An empirical study of CLIP fine-tuning with similarity clusters

    Shixuan Liu, Yiwei Lyu, Honglak Lee, Todd C Hollon · PDF
  8. Analysing Softmax Entropy Minimization for Adaptating Multitask Models at Test-time

    Soumyajit Chatterjee, Abhirup Ghosh, Fahim Kawsar, Mohammad Malekzadeh · PDF
  9. Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs

    Yung-Chieh Chan, George Pu, Apaar Shanker, Parth Suresh, Penn Jenks, John Heyer, Samuel Marc Denton · PDF
  10. Best Unpacking DPO and PPO: Disentangling Practices for Learning from Preference Feedback

    Hamish Ivison, Yizhong Wang, Jiacheng Liu, Zeqiu Wu, Valentina Pyatkin, Nathan Lambert, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi · PDF
  11. Characterizing the Training Dynamics of Private Fine-tuning with Langevin diffusion

    Shuqi Ke, Charlie Hou, Sewoong Oh, Giulia Fanti · PDF
  12. COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences

    Yixin Liu, Argyris Oikonomou, Weiqiang Zheng, Yang Cai, Arman Cohan · PDF
  13. Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization

    Hritik Bansal, Ashima Suvarna, Gantavya Bhatt, Nanyun Peng, Kai-Wei Chang, Aditya Grover · PDF
  14. CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation

    Ingo Ziegler, Abdullatif Köksal, Desmond Elliott, Hinrich Schuetze · PDF
  15. Deep Reinforcement Learning Without Experience Replay, Target Networks, or Batch Updates

    Mohamed Elsayed, Gautham Vasan, A. Rupam Mahmood · PDF
  16. Discrepancy-Guided Parameter Suppression for Robust Fine-tuning

    Chang Liu, Jingyu Ma · PDF
  17. DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent

    Taiyi Wang, Zhihao Wu, Jianheng Liu, Derek Yuen, Jianye HAO, Jun Wang, Kun Shao · PDF
  18. E-Tamba: Efficient Transformer-Mamba Layer Transplantation

    DAZHI PENG, Hangrui Cao · PDF
  19. Early Exiting in Deep Neural Networks via Dirichlet-based Uncertainty Quantification

    Feng Xia, Jake Snell, Thomas L. Griffiths · PDF
  20. Effective Text-to-Image Alignment with Quality Aware Pair Ranking

    Kunal Singh, Mukund Khanna, Pradeep Moturi · PDF
  21. Efficient Fine-Tuning of Behavior Cloned Policies with Reinforcement Learning from Limited Demonstrations

    Samyeul Noh, Seonghyun Kim, Ingook Jang · PDF
  22. Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs

    Jonas Hübotter, Sascha Bongni, Ido Hakimi, Andreas Krause · PDF
  23. Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generation

    Manish Bhattarai, Javier E. Santos, Ismael Boureima, Daniel O'Malley · PDF
  24. Ensembling Finetuned Language Models for Text Classification

    Sebastian Pineda Arango, Maciej Janowski, Lennart Purucker, Arber Zela, Frank Hutter, Josif Grabocka · PDF
  25. Entropic Distribution Matching for Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

    Ziniu Li, Congliang Chen, Tian Xu, Zeyu Qin, Jiancong Xiao, Ruoyu Sun, Zhi-Quan Luo · PDF
  26. Evaluating Fine-Tuning Efficiency of Human-Inspired Learning Strategies in Medical Question Answering

    Yushi Yang, Andrew Michael Bean, Robert McCraith, Adam Mahdi · PDF
  27. Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model

    Divyanshu Aggarwal, Sankarshan Damle, Navin Goyal, Satya Lokam, Sunayana Sitaram · PDF
  28. Faster, More Efficient RLHF through Off-Policy Asynchronous Learning

    Michael Noukhovitch, Shengyi Huang, Sophie Xhonneux, Arian Hosseini, Rishabh Agarwal, Aaron Courville · PDF
  29. FedEx-LoRA: Exact Aggregation for Federated Parameter-Efficient Fine-Tuning of Foundation Models

    Raghav Singhal, Kaustubh Ponkshe, Praneeth Vepakomma · PDF
  30. Fine tuning language models to align fidelity and efficiency of generative retrieval in multi-turn dialogues

    Jeremy Curuksu · PDF
  31. Fine-tuning Vision Classifiers On A Budget

    Sunil Kumar, Ted Sandler, Paulina Varshavskaya · PDF
  32. Fitness Aware Human Motion Generation with Fine-Tuning

    Kiril Bikov, Shiye Su, Deepro Choudhury, Zhilin Guo, Weihao Xia, Mehmet Salih Çeliktenyıldız, Chenliang Zhou, Param Hanji, Cengiz Oztireli · PDF
  33. Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape

    Tao Li, Zhengbao He, Yujun Li, Yasheng Wang, Lifeng Shang, Xiaolin Huang · PDF
  34. Flexora: Flexible Low-Rank Adaptation for Large Language Models

    Chenxing Wei, Yao Shu, Ying Tiffany He, Fei Yu · PDF
  35. FourierKAN outperforms MLP on Text Classification Head Fine-tuning

    Abdullah Al Imran, Md Farhan Ishmam · PDF
  36. FRACTAL: Fine-Grained Scoring from Aggregate Text Labels

    Yukti Makhija, Priyanka Agrawal, Rishi Saket, Aravindan Raghuveer · PDF
  37. GaLore-mini: Low Rank Gradient Learning with Fewer Learning Rates

    Weihao Huang, Zhenyu Zhang, Yushun Zhang, Zhi-Quan Luo, Ruoyu Sun, Zhangyang Wang · PDF
  38. Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$-divergence Minimization

    Haoyuan Sun, Bo Xia, Yongzhe Chang, Xueqian Wang · PDF
  39. Hierarchical Unlearning Framework for Multi-Class Classification

    Abraham Chan, Arpan Gujarati, Karthik Pattabiraman, Sathish Gopalakrishnan · PDF
  40. HyperDPO: Conditioned One-Shot Multi-Objective Fine-Tuning Framework

    Yinuo Ren, Tesi Xiao, Michael Shavlovsky, Lexing Ying, Holakou Rahmanian · PDF
  41. ImageNet-RIB Benchmark: Large Pre-Training Datasets Don't Guarantee Robustness after Fine-Tuning

    Jaedong Hwang, Brian Cheung, Zhang-Wei Hong, Akhilan Boopathy, Pulkit Agrawal, Ila R Fiete · PDF
  42. Improving Fine-Tuning with Latent Cluster Correction

    Cédric Ho Thanh · PDF
  43. Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization

    Hao Sun, Thomas Pouplin, Nicolás Astorga, Tennison Liu, Mihaela van der Schaar · PDF
  44. Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples

    Noël Vouitsis, Rasa Hosseinzadeh, Brendan Leigh Ross, Valentin Villecroze, Satya Krishna Gorti, Jesse C. Cresswell, Gabriel Loaiza-Ganem · PDF
  45. Inducing Semi-Structured Sparsity by Masking for Efficient Model Inference in Convolutional Networks

    David Danhofer · PDF
  46. Instant Transformer Adaption via HyperLoRA

    Rujikorn Charakorn, Edoardo Cetin, Yujin Tang, Robert Tjarko Lange · PDF
  47. Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning

    Simran Kaur, Simon Park, Anirudh Goyal, Sanjeev Arora · PDF
  48. Investigating the Role of Fine-Tuning in Addressing the Gap Between Synthetic and Real Data in Generative Foundation Models

    Leonhard Hennicke, Christian Medeiros Adriano, Holger Giese, Lukas Schott, Jan Mathias Koehler · PDF
  49. Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment

    Chenliang Li, Siliang Zeng, Zeyi Liao, Jiaxiang Li, Dongyeop Kang, Alfredo Garcia, Mingyi Hong · PDF
  50. Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective

    Ethan Harvey, Mikhail Petrov, Michael C Hughes · PDF
  51. LLM Alignment Through Successive Policy Re-weighting (SPR)

    Xinnan Zhang, Siliang Zeng, Jiaxiang Li, Kaixiang Lin, Mingyi Hong · PDF
  52. Mastering Task Arithmetic: $\tau$Jp as a Key Indicator for Weight Disentanglement

    Kotaro Yoshida, Yuji Naraki, Takafumi Horie, Ryosuke Yamaki, Ryotaro Shimizu, Yuki Saito, Julian McAuley, Hiroki Naganuma · PDF
  53. Memory retaining finetuning via distillation

    Zitong Yang, Aonan Zhang, Sam Wiseman, Xiang Kong, Ke Ye, Dong Yin · PDF
  54. Model Soup for Better RLHF: Weight Space Averaging to Improve Alignment in LLMs

    Atoosa Chegini, Hamid Kazemi, Seyed Iman Mirzadeh, Dong Yin, Maxwell Horton, Moin Nabi, Mehrdad Farajtabar, Keivan Alizadeh · PDF
  55. MPLoRA: Orthogonal Multi-Path Low-Rank Adaptation for Parameter Efficient Fine-Tuning

    Junhan Shi, Fulin Wang, Qing Li, Yong Jiang · PDF
  56. Navigating Parameter Space with Geodesic Interpolation: A New Approach to Efficient Fine-Tuning

    Sophia Abraham · PDF
  57. Noise Stability Optimization for Finding Flat Minima: A Hessian-based Regularization Approach

    Hongyang R. Zhang, Dongyue Li, Zhenshuo Zhang · PDF
  58. On Efficient Distillation from LLMs to SLMs

    Metod Jazbec, Menglin Xia, Ankur Mallick, Daniel Madrigal, Dongge Han, Samuel Kessler, Victor Rühle · PDF
  59. On the Transferability of Parameter-Efficient Continual Learning for Vision Transformers

    Leon Ackermann, Van-Linh Nguyen · PDF
  60. One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

    Fabian Paischer, Lukas Hauzenberger, Thomas Schmied, Benedikt Alkin, Marc Peter Deisenroth, Sepp Hochreiter · PDF
  61. Online Fine-Tuning with Uncertainty Quantification for Offline Pre-Trained Agents

    Ingook Jang, Seonghyun Kim, Samyeul Noh · PDF
  62. Optimizing Small Language Models for In-Vehicle Function-Calling

    YAHYA SOWTI KHIABANI, Farris Atif, Chieh Hsu, Sven Stahlmann, Tobias Michels, Sebastian Kramer, Benedikt Heidrich, M. Saquib Sarfraz, Julian Merten, Faezeh Tafazzoli · PDF
  63. PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences

    Daiwei Chen, Yi Chen, Aniket Rege, Ramya Korlakai Vinayak · PDF
  64. Parameter-Efficient Fine-Tuning of State Space Models

    Kevin Galim, Wonjun Kang, Yuchen Zeng, Hyung Il Koo, Kangwook Lee · PDF
  65. Parasite Networks: Transfer Learning in Resource-Constrained Domains

    Andrew Alini, Douglas E. Sturim, Kevin Brady, Pooya Khorrami · PDF
  66. REACT: Residual-Adaptive Contextual Tuning for Fast Model Adaptation in Cybersecurity

    Jiayun Zhang, Junshen Xu, Yi Fan · PDF
  67. RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates

    Md Kowsher, Tara Esmaeilbeig, Chun-Nam Yu, Mojtaba Soltanalian, Niloofar Yousefi · PDF
  68. Scalability of memorization-based machine unlearning

    Kairan Zhao, Peter Triantafillou · PDF
  69. Self-Stitching: Widely Applicable and Efficient Transfer Learning Using Stitching Layer

    Tanachai Anakewat, YUSUKE Mukuta, Thomas Westfechtel, Tatsuya Harada · PDF
  70. Semi-Supervised Fine-Tuning of Vision Foundation Models with Content-Style Decomposition

    Mariia Drozdova, Vitaliy Kinakh, Yury Belousov, Erica Lastufka, Slava Voloshynovskiy · PDF
  71. Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

    Heyang Zhao, Chenlu Ye, Quanquan Gu, Tong Zhang · PDF
  72. Simultaneous Weight and Architecture Optimization for Neural Networks

    Zitong Huang, Mansooreh Montazerin, Ajitesh Srivastava · PDF
  73. Skip Transformers: Efficient Inference through Skip-Routing

    Matthew Peroni, Dimitris Bertsimas · PDF
  74. SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors

    Vijay Lingam, Atula Tejaswi, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Alex Dimakis, Eunsol Choi, Aleksandar Bojchevski, Sujay Sanghavi · PDF
  75. Teaching LLMs How To Learn with Contextual Fine-Tuning

    Younwoo Choi, Muhammad Adil Asif, Ziwen Han, John Willes, Rahul Krishnan · PDF
  76. Token Pruning using a Lightweight Background Aware Vision Transformer

    Sudhakar Sah, Ravish Kumar, Honnesh Rohmetra, Ehsan Saboori · PDF
  77. TOU: Truncated-factorized reduction for an efficient-parameter model fine-tuning

    Phuong Thi-Mai Nguyen, Minh-Son Dao, Koji Zettsu · PDF
  78. Towards Long-Context Time Series Foundation Models With A Handful Of Additional Parameters

    Nina Żukowska, Mononito Goswami, Michał Wiliński, Willa Potosnak, Artur Dubrawski · PDF
  79. Towards Natural Machine Unlearning

    Zhengbao He, Tao Li, Xinwen Cheng, Zhehao Huang, Xiaolin Huang · PDF
  80. TreeTop: Topology-Aware Fine-Tuning for LLM Conversation Tree Understanding

    Jashn Arora, Rahul Madhavan, Karthikeyan Shanmugam, John Palowitch, Manish Jain · PDF
  81. Uncertainty-Penalized Direct Preference Optimization

    Sam Houliston, Alizée Pace, Alexander Immer, Gunnar Ratsch · PDF
  82. Understanding Visual Concepts Across Models

    Brandon Trabucco, Max A Gurinas, Kyle Doherty, Russ Salakhutdinov · PDF
  83. Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

    Noam Razin, Sadhika Malladi, Adithya Bhaskar, Danqi Chen, Sanjeev Arora, Boris Hanin · PDF
  84. UnoLoRA: Single Low-Rank Adaptation for Efficient Multitask Fine-tuning

    Akash Kamalesh, Anirudh Lakhotia, Nischal H S, Prerana Sanjay Kulkarni, Gowri Srinivasa · PDF
  85. Variational Best-of-N Alignment

    Afra Amini, Tim Vieira, Elliott Ash, Ryan Cotterell · PDF
  86. Variational Low-Rank Adaptation Using IVON

    Bai Cong, Nico Daheim, Yuesong Shen, Daniel Cremers, Rio Yokota, Mohammad Emtiyaz Khan, Thomas Möllenhoff · PDF
  87. What Causes a Disparate Impact in a Quantized Model?

    Abhimanyu Bellam, Jung-Eun Kim · PDF
  88. XoRA: Expander Adapted LoRA Finetuning

    Amaljith EV, Arindam Biswas, Suryam Arnav Kalra, Pabitra Mitra, BISWAJIT BASU · PDF