NeurIPS 2025 Past ML systems

Machine Learning for Systems 2025

MLForSys2025

Submission deadline
Aug 30, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (41)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Data-driven ML Approach for Maximizing Performance in LLM-Adapter Serving

    Ferran Agullo, Joan Oliveras Torra, Chen Wang, Alberto Gutierrez-Torre, Olivier Tardieu, Alaa Youssef, Jordi Torres, Josep Lluis Berral · PDF
  2. A Joint Learning Approach to Hardware Caching and Prefetching

    · PDF
  3. Advancing Routing-Awareness in Analog ICs Floorplanning

    · PDF
  4. Adversarial Query Synthesis via Bayesian Optimization

    · PDF
  5. Agentic Bridge Framework: Closing the Gap Between Agentic Capability and Performance Benchmarks

    · PDF
  6. An Early Exploration of Deep-Learning-Driven Prefetching for Far Memory

    · PDF
  7. An Expert in Residence: LLM Agents for Always-On Operating System Tuning

    · PDF
  8. APCE: Adaptive Progressive Context Expansion for Long Context Processing

    · PDF
  9. ASAP: an Agentic Solution to Auto-optimize Performance of Large-Scale LLM Training

    · PDF
  10. Attention-Informed Surrogates for Navigating Power-Performance Trade-offs in HPC

    · PDF
  11. Automated Multi-Agent Workflows for RTL Design

    · PDF
  12. Carbon-Aware RL-LLM Control for Energy-Efficient Liquid-Cooled HPC Data Centers

    · PDF
  13. DataSwift: Smart Choices for Safe Query Optimization

    · PDF
  14. Forecasting machine degradation of GPU Clusters

    Shengnan Cai, Shuxin Nie, Zhehui Chen, Nupur Gulalkari, George Vanica, Chetna Jain, Sethuraman Sankaran · PDF
  15. GraphFaaS: Serverless GNN Inference for Burst-Resilient, Real-Time Intrusion Detection

    · PDF
  16. How Should We Evaluate Data Deletion in Graph-Based ANN Indexes?

    · PDF
  17. InfraGym: Empowering LLM Agents for Real-World Computer System Optimization

    · PDF
  18. Learning to Shard: RL for Co-optimizing the Parallelism Degrees and Per-operator Sharding Dimensions in Distributed LLM Inference

    · PDF
  19. Leveraging Large Language Models to Enhance Machine-Learning-Driven HPC Job Scheduling

    · PDF
  20. LLM-Box : An Agentic Framework for Guided Black-Box Optimization in Mapping LLMs onto Specialized Hardware Accelerators

    · PDF
  21. LLM-Guided Autoscheduling for Large-Scale Sparse Machine Learning

    · PDF
  22. LLMVisor: A Real-Time Latency Attribution Model for Multi-Tenant LLM Serving

    · PDF
  23. Mind the Gap: Time-of-Check to Time-of-Use Vulnerabilities in LLM-Enabled Agents

    Derek Lilienthal, Sanghyun Hong · PDF
  24. ML-Guided Cold Plate Design and Thermal Analysis for Liquid-Cooled HPC Servers

    · PDF
  25. MoE-GPS: Guidlines for Prediction Strategy with Expert Duplication in MoE Load Balancing

    · PDF
  26. MXNorm: Reusing block scales for efficient tensor normalisation

    · PDF
  27. NetGent : Agent-Based Automation of Network Application Workflows

    · PDF
  28. NeuSym-HLS: Learning-Driven Symbolic Distillation in High-Level Synthesis of Hardware Accelerators

    Chung-Mou Pan, Salma Elmalaki, Yasser Shoukry, Sitao Huang · PDF
  29. Optimized Learned Count-Min Sketch

    · PDF
  30. OptRot: Mitigating Weight Outliers via Data-Free Rotations for Post-Training Quantization

    Advait Gadhikar, Riccardo Grazzi, James Hensman · PDF
  31. PORT: Efficient Training-Free Online Routing for High-Volume Multi-LLM Serving

    Fangzhou Wu, Sandeep Silwal · PDF
  32. QAQ: Query-adaptive Mixed-precision Quantization for Large Language Models

    · PDF
  33. Retrieval on Verilog Repositories: A Knowledge-Graph Based Solution

    · PDF
  34. Small Language Models as Compiler Experts: Auto-Parallelization for Heterogeneous Systems

    Prathamesh Devadiga · PDF
  35. Small, Fast, and Certain: Developing a Specialized Verilog Code Completion Solution for the Enterprise

    · PDF
  36. Sustainable Control of Geo-Distributed Datacenters by Distilling Numerical Experts into Adaptive LLM Agents

    · PDF
  37. SwizzlePerf: Hardware-Aware LLMs for GPU Kernel Performance Optimization

    · PDF
  38. Towards Agentic OS: An LLM Agent Framework for Linux Schedulers

    YUSHENG ZHENG, YanPeng Hu, Wei Zhang, Andi Quinn · PDF
  39. Towards Automatically Optimizing Retrieval Augmented AI Systems

    · PDF
  40. Ultra-Efficient Decoding for End-to-End Neural Compression and Reconstruction

    · PDF
  41. When to Reason: Semantic Router for vLLM

    Chen Wang, Xunzhuo Liu, Yuhan Liu, Yue Zhu, Xiangxi Mo, Junchen Jiang, Huamin Chen · PDF