NeurIPS 2024 Past Efficiency

Workshop on Machine Learning and Compression, NeurIPS 2024

Compression Workshop @ NeurIPS 2024

Submission deadline
Oct 1, 2024, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (95)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Theory for Compressibility of Graph Transformers for Transductive Learning

    Hamed Shirzad, Honghao Lin, Ameya Velingker, Balaji Venkatachalam, David Woodruff, Danica J. Sutherland · PDF
  2. A Tighter Complexity Analysis of SparseGPT

    Xiaoyu Li, Yingyu Liang, Zhenmei Shi, Zhao Song · PDF
  3. Accelerating Memory-Efficient LLM Training and Fine-Tuning via Tracking the Gradient Subspace

    Sahar Rajabi, Sirisha Rambhatla · PDF
  4. Adapting Language Models via Token Translation

    Zhili Feng, Tanya Marwah, Lester Mackey, David Alvarez-Melis, Nicolo Fusi · PDF
  5. Adaptive Quantization and Pruning of Deep Neural Networks via Layer Importance Estimation

    Tushar Shinde · PDF
  6. AdaQuantLM: LLM Quantization with Adaptive Bit-Widths

    Shuangyi Chen, Ashish J Khisti · PDF
  7. An image to tailor: I-Frame Domain Adaptation in Neural Video Compression

    Alberto Presta, Gabriele Spadaro, Attilio Fiandrotti, Marco Grangetto · PDF
  8. An Information Theory of Compute-Optimal Size Scaling, Emergence, and Plateaus in Language Models

    Anuj K. Nayak, Lav R. Varshney · PDF
  9. Benchmarking neural lossless compression algorithms on multi-purpose astronomical image data

    Tuan Truong, Rithwik Sudharsan, Yibo Yang, Peter Xiangyuan Ma, Ruihan Yang, Stephan Mandt, Joshua S. Bloom · PDF
  10. BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models

    Xingyu Zheng, Xianglong Liu, Haotong Qin, Xudong Ma, Mingyuan Zhang, Haojie Hao, Jiakai Wang, Zixiang Zhao, Jinyang Guo, Michele Magno · PDF
  11. Breaking Smoothness: The Struggles of Neural Compressors with Discontinuous Mappings

    Ezgi Ozyilkan, Jona Ballé, Sourbh Bhadane, Aaron B. Wagner, Elza Erkip · PDF
  12. Bridging the Gap between Diffusion Models and Universal Quantization for Image Compression

    Lucas Relic, Roberto Azevedo, Yang Zhang, Markus Gross, Christopher Schroers · PDF
  13. CDQuant: Greedy Coordinate Descent for Accurate LLM Quantization

    Pranav Ajit Nair, Arun Suggala · PDF
  14. Communication Compression for Tensor Parallel LLM Inference

    Jan Hansen-Palmus, Michael Truong Le, Oliver Hausdörfer, Alok Verma · PDF
  15. Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging

    Ismail Erbas, Vikas Pandey, Aporva Amarnath, Naigang Wang, Karthik Swaminathan, Stefan T. Radev, Xavier Intes · PDF
  16. Conditional Hallucinations for Image Compression

    Till Aczel, Roger Wattenhofer · PDF
  17. Copula-based Estimation of Continuous Sources for a Class of Constrained Rate-Distortion Functions

    Giuseppe Serra, Photios A. Stavrou, Marios Kountouris · PDF
  18. Deep Clustering with Associative Memories

    Bishwajit Saha, Dmitry Krotov, Mohammed J Zaki, Parikshit Ram · PDF
  19. Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts

    Ashwinee Panda, Vatsal Baherwani, Zain Sarwar, Benjamin Thérien, Stephen Rawls, Sambit Sahu, Supriyo Chakraborty, Tom Goldstein · PDF
  20. Differentiable Attention

    Yancheng Wang, Dongfang Sun, Yingzhen Yang · PDF
  21. Diffusion Models With Learned Adaptive Noise

    Subham Sekhar Sahoo, Aaron Gokaslan, Christopher De Sa, Volodymyr Kuleshov · PDF
  22. Distillation of Discrete Diffusion through Dimensional Correlations

    Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi, Hiromi Wakaki, Yuki Mitsufuji · PDF
  23. Does Representation Matter? Exploring Intermediate Layers in Large Language Models

    Oscar Skean, Md Rifat Arefin, Ravid Shwartz-Ziv · PDF
  24. EAMQ: Environment-based Adaptive Model Quantization on Federated Reinforcement Learning

    YU CHENYUE · PDF
  25. Efficient and Robust Spike Ensemble Coding of Signals

    Anik Chattopadhyay, Arunava Banerjee · PDF
  26. Efficient Compression of Sparse Accelerator Data Using Implicit Neural Representations and Importance Sampling

    Xihaier Luo, Samuel Lurvey, Yi Huang, Yihui Ren, Jin Huang, Byung-Jun Yoon · PDF
  27. Efficient Model Compression Techniques with FishLeg

    Jamie McGowan, Wei Sheng Lai, Weibin Chen, Henry Aldridge, Jools Clarke, Jezabel R Garcia, Rui Xia, Yilei Liang, Guillaume Hennequin, Alberto Bernacchia · PDF
  28. Empirical Upper Bounds for Unstructured Sparsity in Compute-Efficient Language Modeling

    Esha Singh, Shane Bergsma, Nolan Simran Dey, Joel Hestness, Gavia Gray · PDF
  29. EXAQ: Exponent Aware Quantization For LLMs Acceleration

    Moran Shkolnik, Maxim Fishman, Brian Chmiel, Hilla Ben-Yaacov, Ron Banner, Kfir Yehuda Levy · PDF
  30. Exploiting Temporal Priors for Efficient Real-time Compression and Feedback of Wireless Channels

    Akshay Malhotra, Mohamed Salah Ibrahim, Keya Patani · PDF
  31. FinerCut: Finer-grained Interpretable Layer Pruning for Large Language Models

    Yang Zhang, Yawei Li, Xinpeng Wang, Qianli Shen, Barbara Plank, Bernd Bischl, Mina Rezaei, Kenji Kawaguchi · PDF
  32. Flexible image decoding in learned image compression

    Hossein Motamednia, Azadeh Mansouri, Fariba Saadati Monem, Ahmad Mahmoudi-Aznaveh · PDF
  33. Formalizing Limits of Knowledge Distillation Using Partial Information Decomposition

    Pasan Dissanayake, Faisal Hamman, Barproda Halder, Ilia Sucholutsky, Qiuyi Zhang, Sanghamitra Dutta · PDF
  34. Fused-Layer CNNs for Memory-Efficient Inference on Microcontrollers

    Mark Deutel, Frank Hannig, Christopher Mutschler, Jürgen Teich · PDF
  35. FV-NeRV: Neural Compression for Free Viewpoint Videos

    Takuya Fujihashi, Sorachi Kato, Toshiaki Koike-Akino · PDF
  36. Getting free Bits Back from Rotational Symmetries in LLMs

    Jiajun He, Gergely Flamich, José Miguel Hernández-Lobato · PDF
  37. Graph Transformation Augmentation for Contrastive Learning of Graph-Level Representation: An Initial Exploration

    Tianchao Li, Yulong Pei · PDF
  38. Grow to Compress? Efficient Training of Robust Networks on the Edge

    Vignesh Sundaresha, Naresh Shanbhag · PDF
  39. How Many Does It Take to Prune a Network: Comparing One-Shot vs. Iterative Pruning Regimes

    Tomasz Wojnar, Mikołaj Janusz, Luca Benini, Yawei Li, Kamil Adamczewski · PDF
  40. Improving Knowledge Distillation with Teacher's Explanation

    Sayantan Chowdhury, Ben Liang, Ali Tizghadam, Ilijc Albanese · PDF
  41. Information-theoretic Generalization Analysis for Vector-Quantized VAEs

    Futoshi Futami, Masahiro Fujisawa · PDF
  42. Integration of Large Vision Models in Driver Monitoring Systems: Compressing and Distilling for Real-Time Automotive Applications

    Georgios Markos Chatziloizos, Andrea Ancora, Andrew I. Comport, Barat Christian · PDF
  43. Interactions Across Blocks in Post-Training Quantization of Large Language Models

    Khasmamad Shabanovi, Lukas Wiest, Vladimir Golkov, Daniel Cremers, Thomas Pfeil · PDF
  44. Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations

    Kola Ayonrinde, Michael T Pearce, Lee Sharkey · PDF
  45. Large Language Model Compression with Neural Architecture Search

    Rhea Sanjay Sukthanker, Benedikt Staffler, Frank Hutter, Aaron Klein · PDF
  46. Latent Probabilistic Dataset Distillation with Theoretical Guarantees

    Progyan Das, Shrutimoy Das, Anirban Dasgupta · PDF
  47. Layer-Importance guided Adaptive Quantization for Efficient Speech Emotion Recognition

    Tushar Shinde, RITIKA JAIN, Avinash Kumar Sharma · PDF
  48. Layer-wise Quantization for Distributed Variational Inequalities

    Anh Duc Nguyen, Ilia Markov, Ali Ramezani-Kebrya, Kimon Antonakopoulos, Dan Alistarh, Volkan Cevher · PDF
  49. Learnable Fourier-based Activations for Implicit Signal Representations

    Parsa Mojarad Adi, Ali Mehrabian · PDF
  50. Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

    Niket Nikul Patel, Ravid Shwartz-Ziv · PDF
  51. LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization

    Rui Xie, Tianchen Zhao, Zhihang Yuan, Rui Wan, Wenxi Gao, Zhenhua Zhu, Xuefei Ning, Yu Wang · PDF
  52. LLM Vocabulary Compression for Low-Compute Environments

    Sreeram Vennam, Anish R Joishy, Ponnurangam Kumaraguru · PDF
  53. LORC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy

    Rongzhi Zhang, Kuan Wang, Liyuan Liu, Shuohang Wang, Hao Cheng, Chao Zhang, yelong shen · PDF
  54. Losslessly Compressible Neural Network Parameters

    Matthew Farrugia-Roberts · PDF
  55. LSH-E Tells You What To Discard: An Adaptive Locality-Sensitive Strategy for KV Cache Compression

    Tahseen Rabbani, Minghui Liu, Tony O'Halloran, Ananth Sankaralingam, Mary-Anne Hartley, Furong Huang · PDF
  56. M2M-TAG: Training-Free Many-to-Many Token Aggregation for Vision Transformer Acceleration

    Fanhu Zeng, Deli Yu · PDF
  57. Majority Kernels: An Approach to Leverage Big Model Dynamics for Efficient Small Model Training

    Hanna Mazzawi, Pranjal Awasthi, Javier Gonzalvo, Srikumar Ramalingam · PDF
  58. MAPLE: Memory-Aware Predict and Load for Efficient LLM Inference

    Zhenyu Liu, Zhemin Zhang, Zirui Zhang, Yanyuan Qin, Jiayi Luo, Zhenyu Gu, Liu Liu · PDF
  59. MCUCoder: Adaptive Bitrate Learned Video Compression for IoT Devices

    Ali Hojjat, Janek Haberer, Olaf Landsiedel · PDF
  60. Mind the Gap Between Synthetic and Real: Probing Transfer Capabilities of Stable Diffusion Images

    Leonhard Hennicke, Christian Medeiros Adriano, Holger Giese, Jan Mathias Koehler, Lukas Schott · PDF
  61. Neural Compression for Multispectral Satellite Images

    Woojin Cho, Steve Andreas Immanuel, Junhyuk Heo, Darongsae Kwon · PDF
  62. Neural Normalized Compression Distance and the Disconnect Between Compression and Classification

    John Hurwitz, Charles K. Nicholas, Edward Raff · PDF
  63. Non-interactive Remote Coordination

    Yassine Hamdi, Xueyan Niu, Bo Bai, Deniz Gunduz · PDF
  64. On the Relationship Between Model Training Dynamics and Early Pruning Periods

    Elvis Nunez, Stefano Soatto · PDF
  65. P-SpikeSSM: Harnessing Probabilistic Spiking State Space Models for Long-Range Dependency Tasks

    Malyaban Bal, Abhronil Sengupta · PDF
  66. Partially Frozen Random Networks Contain Compact Strong Lottery Tickets

    Hikari Otsuka, Daiki Chijiwa, Ángel López García-Arias, Yasuyuki Okoshi, Kazushi Kawamura, Thiem Van Chu, Daichi Fujiki, Susumu Takeuchi, Masato Motomura · PDF
  67. Perception Loss Function Adaptive to Rate for Learned Video Compression

    Sadaf Salehkalaibar, Buu Phan, João Atz Dick, Ashish J Khisti, Jun Chen, Wei Yu · PDF
  68. PerCo (SD): Open Perceptual Compression

    Nikolai Körber · PDF
  69. Polar Codes for Channel Simulation

    Sharang M. Sriramu, Rochelle Barsz, Elizabeth Polito, Aaron B. Wagner · PDF
  70. Prechastic Coding: An Alternative Approach to Neural Network Description Lengths

    Paris Dominic Louis Flood, Pietro Lio · PDF
  71. QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models

    Zhumazhan Balapanov, Vanessa Matvei, Olivia Holmberg, Edward Magongo, Kevin Zhu, Jonathan Pei · PDF
  72. Randomly Pivoted V-optimal Design: Fast Data Selection under Low Intrinsic Dimension

    Yijun Dong, Xiang Pan, Hoang Phan, Qi Lei · PDF
  73. Sample Compression Hypernetworks: From Generalization Bounds to Meta-Learning

    Benjamin Leblanc, Mathieu Bazinet, Nathaniel D'Amours, Alexandre Drouin, Pascal Germain · PDF
  74. Sample compression unleashed : New generalization bounds for real valued losses

    Mathieu Bazinet, Valentina Zantedeschi, Pascal Germain · PDF
  75. SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding

    Zhenglin Wang, Jialong Wu, Yilong Lai, Congzhi Zhang, Deyu Zhou · PDF
  76. Self-Data Distillation for Recovering Quality in Pruned Large Language Models

    Vithursan Thangarasa, Ganesh Venkatesh, Nish Sinnadurai, Sean Lie · PDF
  77. Shrinking the Size of Deep Extreme Multi-Label Classification

    Marco Bornstein, Tahseen Rabbani, Brian Joseph Gravelle, Furong Huang · PDF
  78. Simple LLM Compression Recovery Using Dynamic Prompting with Theoretical Analysis

    Duc N.M Hoang, Minsik Cho, Thomas Merth, Mohammad Rastegari, Zhangyang Wang · PDF
  79. SNeRV: Scalable Neural Representations for Video Coding

    Yiying Wei, Hadi Amirpour, Christian Timmerer · PDF
  80. SpikingVTG: Saliency Feedback Gating Enabled Spiking Video Temporal Grounding

    Malyaban Bal, Brian Matejek, Susmit Jha, Adam D. Cobb · PDF
  81. Sustainable AI: Efficient Pruning of Large Language Models in Resource-Limited Environments

    Ashhadul Islam, SAMIR BELHAOUARI, Amine Bermak · PDF
  82. TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

    Makoto Shing, Kou Misaki, Han Bao, Sho Yokoi, Takuya Akiba · PDF
  83. The Rate-Distortion-Perception Trade-Off with Algorithmic Realism

    Yassine Hamdi, Aaron B. Wagner, Deniz Gunduz · PDF
  84. The Trichromatic Strong Lottery Ticket Hypothesis: Neural Compression With Three Primary Supermasks

    Ángel López García-Arias, Yasuyuki Okoshi, Hikari Otsuka, Daiki Chijiwa, Yasuhiro Fujiwara, Susumu Takeuchi, Masato Motomura · PDF
  85. Towards Scalable Compression with Universally Quantized Diffusion Models

    Yibo Yang, Justus Will, Stephan Mandt · PDF
  86. Training Block-wise Sparse Models Using Kronecker Product Decomposition

    Ding Zhu, Zhiqun Zuo, Mohammad Mahdi Khalili · PDF
  87. Training-Free Visual Token Compression via Delayed Spatial Merging

    Jung Hwan Heo, Seyedarmin Azizi, Arash Fayyazi, Massoud Pedram · PDF
  88. Transformers Learn to Compress Variable-order Markov Chains in-Context

    Ruida Zhou, Chao Tian, Suhas Diggavi · PDF
  89. Unified Lookup Tables: Privacy-Preserving Foundation Models

    Nikita Janakarajan, Irina Espejo Morales, Marvin Alberts, Andrea Giovannini, Matteo Manica, Antonio Foncubierta-Rodríguez · PDF
  90. Unifying Subsampling Pattern Variations for Compressed Sensing MRI with Neural Operators

    Armeet Singh Jatyani, Jiayun Wang, Zihui Wu, Miguel Liu-Schiaffini, Bahareh Tolooshams, Anima Anandkumar · PDF
  91. Vector Quantization with Sorting Transformation

    Hongzhi Wang, Tanveer Syeda-mahmood · PDF
  92. VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression

    Yunkee Chae, Woosung Choi, Yuhta Takida, Junghyun Koo, Yukara Ikemiya, Zhi Zhong, Kin Wai Cheuk, Marco A. Martínez-Ramírez, Kyogu Lee, Wei-Hsiang Liao, Yuki Mitsufuji · PDF
  93. Wasserstein Distortion with Intrinsic $\sigma$-Maps

    Yang Qiu, Ziyuan Lin, Aaron B. Wagner · PDF
  94. Weight-Sharing Method for Upsampling Layer from Feature Embedding Recursive Block

    Jinwoo Hyun, YunKyong Hyon, Mira Lee, Sunju Lee, Taeyoung Ha, Young Rock Kim · PDF
  95. What Makes for Good Image Captions?

    Delong Chen, Samuel Cahyawijaya, Etsuko Ishii, Ho Shu Chan, Yejin Bang, Pascale Fung · PDF