ICML 2024 Past EfficiencyOptimization

2nd Workshop on Advancing Neural Network Training: Computational Efficiency, Scalability, and Resource Optimization (WANT@ICML 2024)

WANT@ICML 2024

Submission deadline
Jun 3, 2024, 12:00 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (42)

Fetched from OpenReview (v2) on 2026-06-10.

  1. Accelerating Best-of-N via Speculative Rejection

    Ruiqi Zhang, Momin Haider, Ming Yin, Jiahao Qiu, Mengdi Wang, Peter Bartlett, Andrea Zanette · PDF
  2. AdaMeM: Memory Efficient Momentum for Adafactor

    Nikhil Vyas, Depen Morwani, Sham M. Kakade · PDF
  3. Adaptive Model Pruning in Federated Learning through Loss Exploration

    Christian Internò, Elena Raponi, Niki van Stein, Thomas Bäck, Markus Olhofer, Yaochu Jin, Barbara Hammer · PDF
  4. Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies

    Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris, Bhavya Kailkhura · PDF
  5. An Analytical Approach to Enhancing DNN Efficiency and Accuracy Using Approximate Multiplication

    Salar Shakibhamedan, Anice Jahanjoo, Amin Aminifar, Nima Amirafshar, Nima TaheriNejad, Axel Jantsch · PDF
  6. Asynchronous Local-SGD Training for Language Modeling

    Bo Liu, Rachita Chhaparia, Arthur Douillard, Satyen Kale, Andrei Alex Rusu, Jiajun Shen, Arthur Szlam, MarcAurelio Ranzato · PDF
  7. Bayesian-LoRA: LoRA based Parameter Efficient Fine-Tuning using Optimal Quantization levels and Rank Values trough Differentiable Bayesian Gates

    Cristian Meo, Ksenia Sycheva, Anirudh Goyal, Justin Dauwels · PDF
  8. Boolean Logic for Low-Energy Deep Learning

    Van Minh Nguyen, Cristian Ocampo, Aymen Askri, Ba-Hien Tran · PDF
  9. Can LLMs Enhance Performance Prediction for Deep Learning Models?

    Karthick Panner Selvam, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Bryan Perozzi, Mats Brorsson · PDF
  10. Class-aware Initialization of Early Exits for Pre-training Large Language Models

    Alperen Gormez, Erdem Koyuncu · PDF
  11. Coarse-to-Fine Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition

    Hichem Sahbi · PDF
  12. Communication Efficient Federated Learning with Differentiated Aggregation

    Peyman Gholami, Hulya Seferoglu · PDF
  13. DASH: Warm-Starting Neural Network Training Without Loss of Plasticity Under Stationarity

    Baekrok Shin, Junsoo Oh, Hanseul Cho, Chulhee Yun · PDF
  14. DiLoCo: Distributed Low-Communication Training of Language Models

    Arthur Douillard, Qixuan Feng, Andrei Alex Rusu, Rachita Chhaparia, Yani Donchev, Adhiguna Kuncoro, MarcAurelio Ranzato, Arthur Szlam, Jiajun Shen · PDF
  15. DrJAX: Scalable and Differentiable MapReduce Primitives in JAX

    J Keith Rush, Zachary Charles, Zachary Garrett, Sean Augenstein, Nicole Elyse Mitchell · PDF
  16. ECO: Efficient Computational Optimization for Exact Machine Unlearning in Deep Neural Networks

    Yu-Ting Huang, Pei-Yuan Wu, Chuan-Ju Wang · PDF
  17. Effective Layer Pruning Through Similarity Metric Perspective

    Ian Pons, Bruno Yamamoto, Anna Helena Reali Costa, Artur Jordao · PDF
  18. Efficient Adaptive Federated Optimization

    Su Hyeong Lee, Sidharth Sharma, Manzil Zaheer, Tian Li · PDF
  19. Efficient Document Ranking with Learnable Late Interactions

    Himanshu Jain, Ziwei Ji, Ankit Singh Rawat, Andreas Veit, Sadeep Jayasumana, Sashank J. Reddi, Aditya Krishna Menon, Felix Yu · PDF
  20. Enhancing Fine-grained Multi-modal Alignment via Adapters: A Parameter-Efficient Training Framework for Referring Image Segmentation

    Zunnan Xu, Jiaqi Huang, Ting Liu, Yong Liu, Haonan Han, Kehong Yuan, Xiu Li · PDF
  21. Fisher-aware Quantization for DETR Detectors with Critical-category Objectives

    Huanrui Yang, Yafeng Huang, Zhen Dong, Denis A Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Yuan Du, Kurt Keutzer, Shanghang Zhang · PDF
  22. Language Adaptation on a Tight Academic Compute Budget: Tokenizer Swapping Works and Pure bfloat16 Is Enough

    Konstantin Dobler, Gerard de Melo · PDF
  23. Liouna: Biologically Plausible Learning for Efficient Pre-Training of Transferrable Deep Models

    Fady Rezk, Antreas Antoniou, Henry Gouk, Timothy Hospedales · PDF
  24. LoQT: Low Rank Adapters for Quantized Training

    Sebastian Bugge Loeschcke, Mads Toftrup, Michael Kastoryano, Serge Belongie, Vésteinn Snæbjarnarson · PDF
  25. Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs

    Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal · PDF
  26. Lowering PyTorch's Memory Consumption for Selective Differentiation

    Samarth Bhatia, Felix Dangel · PDF
  27. Memory and Bandwidth are All You Need for Fully Sharded Data Parallel

    Jiangtao Wang, Jan Ebert, Oleg Filatov, Stefan Kesselheim · PDF
  28. Model-Agnostic Graph Dataset Compression with the Tree Mover’s Distance

    Mika Sarkin Jain, Stefanie Jegelka, Ishani Karmarkar, Luana Ruiz, Ellen Vitercik · PDF
  29. MoReDrop: Dropout without Dropping

    Li Jiang, Duo Li, Yichuan Ding, Xue Liu, Victor Wai Kin Chan · PDF
  30. Multi-objective Differentiable Neural Architecture Search

    Rhea Sanjay Sukthanker, Arber Zela, Benedikt Staffler, Samuel Dooley, Josif Grabocka, Frank Hutter · PDF
  31. Optimistic Asynchrony Control: Achieving Synchronous Convergence With Asynchronous Throughput for Embedding Model Training

    Roger Waleffe, Jason Mohoney · PDF
  32. Resolving Discrepancies in Compute-Optimal Scaling of Language Models

    Tomer Porian, Mitchell Wortsman, Jenia Jitsev, Ludwig Schmidt, Yair Carmon · PDF
  33. Resource-constrained Neural Architecture Search on Language Models: A Case Study

    Andreas Paraskeva, Joao Pedro Reis, Suzan Verberne, Jan N. van Rijn · PDF
  34. SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models

    Zhaoxu Luo, Bowen Song, Liyue Shen · PDF
  35. Scalify: scale propagation for efficient low-precision LLM training

    Paul Balanca, Samuel Hosegood, Carlo Luschi, Andrew W Fitzgibbon · PDF
  36. Single Train Multi Deploy on Topology Search Spaces using Kshot-Hypernet

    Jingyue Zhuge, Christian Mayr, Anand Subramoney, David Kappel · PDF
  37. SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors

    Vijay Lingam, Atula Tejaswi Neerkaje, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Eunsol Choi, Alex Dimakis, Aleksandar Bojchevski, sujay sanghavi · PDF
  38. TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

    Zhengqing Yuan, Zhaoxu Li, Weiran Huang, Yanfang Ye, Lichao Sun · PDF
  39. Towards Efficient and Scalable Training of Differentially Private Deep Learning

    Sebastian Rodriguez Beltran, Marlon Tobaben, Niki Andreas Loppi, Antti Honkela · PDF
  40. u-μP: The Unit-Scaled Maximal Update Parametrization

    Charlie Blake, Constantin Eichenberg, Josef Dean, Lukas Balles, Luke Yuri Prince, Björn Deiseroth, Andres Felipe Cruz-Salinas, Carlo Luschi, Samuel Weinbach, Douglas Orr · PDF
  41. Variational Stochastic Gradient Descent for Deep Neural Networks

    Haotian Chen, Anna Kuzina, Babak Esmaeili, Jakub M. Tomczak · PDF
  42. Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity

    Wentao Guo, Jikai Long, Yimeng Zeng, Zirui Liu, Xinyu Yang, Yide Ran, Jacob R. Gardner, Osbert Bastani, Christopher De Sa, Xiaodong Yu, Beidi Chen, Zhaozhuo Xu · PDF