NeurIPS 2025 Past Math & reasoning

NeurIPS 2025 Workshop MLxOR: Mathematical Foundations and Operational Integration of Machine Learning for Uncertainty-Aware Decision-Making

NeurIPS 2025 Workshop MLxOR

Submission deadline
Sep 6, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (147)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Behavioral Model for Exploration vs. Exploitation: Theoretical Framework and Experimental Evidence

    Jingying Ding, Yifan Feng, Ying Rong · PDF
  2. A Covering Framework for Offline POMDPs Learning using Belief Space Metric

    Youheng Zhu, Yiping Lu · PDF
  3. A Deep Proactive Exploration Policy Based on Asymptotic Statistics for Asynchronous Q-Learning

    Xinbo Shi, Jinyang Jiang, Ruihan Zhou, Yijie Peng, Jing Dong · PDF
  4. A Dual Perspective on Decision Focused Learning

    Paula Rodriguez-Diaz, Kirk Bansak, Elisabeth Paulson · PDF
  5. A Near-Optimal Control Policy for Data-driven Assemble-to-order Systems

    Lun Yu, Zhixuan Cai, Zhaoran Wang, Tianhu Deng · PDF
  6. A Sharp Comparison of Prescriptive Analytic Frameworks for The Big Data Newsvendor Problem

    Zhen Qiao, Karthyek Murthy · PDF
  7. A Theoretical Framework for Auxiliary-Loss-Free Load-Balancing of Sparse Mixture-of-Experts in Large-Scale AI Models

    X.Y. Han, Yuan Zhong · PDF
  8. A Variance-Adaptive Lower Bound for Simulation Optimization in Continuous Space

    Jianzhong Du, L. Jeff Hong · PDF
  9. Accelerating Diffusion via Compressed Sensing: Applications to Imaging and Finance

    Zhengyi Guo, Jiatu Li, Wenpin Tang, David Yao · PDF
  10. Achieving $\widetilde{\mathcal O}(1/N)$ Optimality Gap in Weakly-Coupled Markov Decision Processes through Gaussian Approximation

    Chen YAN, Weina Wang, Lei Ying · PDF
  11. Achieving Exponential Asymptotic Optimality in Average-Reward Restless Bandits without Global Attractor Assumption

    Yige Hong, Qiaomin Xie, Yudong Chen, Weina Wang · PDF
  12. Achieving First-Order Statistical Improvements in Data-Driven Optimization

    Henry Lam, Tianyu Wang · PDF
  13. Active Learning for Stochastic Contextual Linear Bandits

    Emma Brunskill, Ishani Karmarkar, Zhaoqi Li · PDF
  14. Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing

    Davin Choo, Yuqi Pan, Tonghan Wang, Milind Tambe, Alastair van Heerden, Cheryl Johnson · PDF
  15. Adaptive Resolving Methods for Reinforcement Learning with Function Approximations

    Jiashuo Jiang, Yiming Zong, Yinyu Ye · PDF
  16. Admissibility of Completely Randomized Trials: A Large-Deviation Approach

    Guido Imbens, Chao Qin, Stefan Wager · PDF
  17. Algorithmic Aspects of Strategic Trading

    Michael Kearns, Mirah Shi · PDF
  18. Almost Sure Convergence of Nonlinear Stochastic Approximation Under General Moment Conditions

    Anh Duc Nguyen, Quang Nguyen, Hoang H Nguyen, Siva Theja Maguluri · PDF
  19. Autoregressive Learning under Joint KL Analysis: Horizon-Free Approximation and Computational-Statistical Tradeoffs

    Yunbei Xu, Yuzhe Yuan, Ruohan Zhan · PDF
  20. Batch-Adaptive Annotations for Causal Inference with Text-Based Outcomes

    Ezinne Nwankwo, Lauri Goldkind, Angela Zhou · PDF
  21. Bayesian Optimization using Partially Observable Gaussian Process Network

    Saksham Kiroriwal, Julius Pfrommer, Jürgen Beyerer · PDF
  22. Bayesian Surrogates for Risk-Aware Pre-Assessment of Aging Bridge Portfolios

    Sophia V. Kuhn, Rafael Bischof, Marius Weber, Antoine Binggeli, Michael Kraus, Walter Kaufmann, Fernando Perez-Cruz · PDF
  23. Belief-Aware Inventory Control with Deep Mixture Models

    Moritz Beck, Anh-Duy Pham · PDF
  24. Bellman Optimality of Average-Reward Robust Markov Decision Processes with a Constant Gain

    Shengbo Wang, Nian Si · PDF
  25. Beyond First-Order: Training LLMs with Stochastic Conjugate Subgradient and AdamW

    Di Zhang, Yihang Zhang, Suvrajeet Sen · PDF
  26. Blessings of many good arms in multi-objective linear bandits

    Heesang Ann, Min-hwan Oh · PDF
  27. Can Linear Probes Measure LLM Uncertainty ?

    Ramzi Dakhmouche, Adrien Letellier, Hossein Gorji · PDF
  28. Chance-constrained Flow Matching for High-Fidelity Constraint-aware Generation

    Jinhao Liang, Sandeep Madireddy, Ferdinando Fioretto · PDF
  29. Compound Poisson Limits in Weighted Bernoulli Congestion Games: Theory Meets Experiments

    Ian Mac Kenney, Javiera Barrera, Roberto Cominetti · PDF
  30. Conformal Tail Risk Control for Large Language Model Alignment

    Catherine Chen, Jingyan Shen, Zhun Deng, Lihua Lei · PDF
  31. Conformalized Decision Risk Assessment

    Wenbin Zhou, Agni Orfanoudaki, Shixiang Zhu · PDF
  32. Confounding-Robust Fitted-Q-Iteration under Observed Markovian Marginals

    David Bruns-Smith, Angela Zhou · PDF
  33. Contextual Bandits for Large-Scale Structured Discrete Constrained Optimization Problems

    Pavithra Harsha, Chitra K Subramanian, Naoki Abe, Shivaram Subramanian, Amadou Ba, Kevin Arturo Fernandez Roman, Mauricio Longinos Garrido, Miao Liu, Aurelie C. Lozano, Chandrasekhar Narayanaswami · PDF
  34. Contextual Budget Bandit for Food Rescue Volunteer Engagement

    Ariana Tang, Naveen Janaki Raman, Fei Fang, Zheyuan Ryan Shi · PDF
  35. Contextual Optimization Under Model Misspecification: A Tractable and Generalizable Approach

    Omar Bennouna, Jiawei Zhang, Saurabh Amin, Asuman E. Ozdaglar · PDF
  36. Contextual Pricing with Heterogeneous Buyers

    Thodoris Lykouris, Sloan Nietert, Princewill Okoroafor, Chara Podimata, Julian Zimmert · PDF
  37. Contextual Value Iteration and Deep Approximation for Bayesian Contextual Bandits

    Kevin Duijndam, Ger Koole, Rob van der Mei · PDF
  38. DAOpt: Modeling and Evaluation of Data-Driven Optimization under Uncertainty with LLMs

    Wenzhuo Zhu, Zheng Cui, Wenhan Lu, Sheng Liu, Yue Zhao · PDF
  39. Data to Dose: Efficient Synthetic Data Generation with Expert Guidance for Personalized Dosing

    H. Satyam Verma, Holly Wiberg, Shixiang Zhu, Sridhar Tayur · PDF
  40. Data-driven generative simulation of SDEs using diffusion models

    Xuefeng Gao, Jiale Zha, XUNYU ZHOU · PDF
  41. Data-Driven Sequential Search

    David Brown, Cagin Uru · PDF
  42. Data-Driven Stochastic Modeling Using Autoregressive Sequence Models: Translating Event Tables to Queueing Dynamics

    Daksh Mittal, Shunri Zheng, Jing Dong, Hongseok Namkoong · PDF
  43. Decision Focused Scenario Generation for Contextual Two-Stage Stochastic Linear Programming

    Jonathan Hornewall, Solène Delannoy-Pavy, Vincent Leclère, Tito Homem-De-Mello · PDF
  44. Decision-Focused Sequential Experimental Design: A Directional Uncertainty-Guided Approach

    Beichen Wan, Mo Liu, Paul Grigas, Zuo-Jun Shen · PDF
  45. Deep Learning for Solving Linear Integral Equations Associated with Markov Chains

    Yanlin Qu, Jose Blanchet, Peter Glynn · PDF
  46. Deep Learning-Driven Contextual Stochastic Optimization for Real-Time Order Fulfillment

    Tinghan Ye, Shuaicheng Tong, Changkun Guan, Beste Basciftci, Pascal Van Hentenryck · PDF
  47. DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management

    Yaqi Xie, Xinru Hao, Jiaxi Liu, Will Ma, Linwei Xin, Lei Cao, Yidong Zhang · PDF
  48. Differentiable Optimization for Deep Learning-Enhanced DC Approximation of AC Optimal Power Flow

    Andrew W. Rosemberg, Michael Klamkin, Pascal Van Hentenryck · PDF
  49. Diffusion Generative Models meet Differential Privacy: A Theoretical Insight

    Ziyu Huang, Wenpin Tang · PDF
  50. Diffusion Models for Adapted Sequential Data Generation

    Haoyang Cao, Minshuo Chen, Yinbin Han, Renyuan Xu · PDF
  51. Distributionally Robust Multimodal Machine Learning

    Peilin Yang, Yu Ma · PDF
  52. Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces

    Linglingzhi Zhu, Yunqin Zhu, Yao Xie · PDF
  53. Distributionally Robust Regularization of Sparse Integer Programming Trained Learning Models

    Sanjeeb Dash, Soumyadip Ghosh, Joao Goncalves, Mark S. Squillante · PDF
  54. Dynamically Augmented CVaR for MDPs and Uncertainty Quantifications for Robust MDPs Characterizing Risk

    Eugene A. Feinberg, Rui Ding · PDF
  55. Efficient Rashomon Set Approximation for Decision Trees

    Zakk Heile, Varun Babbar, Hayden McTavish, Cynthia Rudin · PDF
  56. End-to-End Learning for Information Gathering

    Rares C Cristian, Pavithra Harsha, Georgia Perakis, Brian Quanz · PDF
  57. Ensuring Fairness in Priority-Based Admissions with Uncertain Scores

    Zhiqiang Zhang, Pengyi Shi, Amy R Ward · PDF
  58. Estimate to Decide: Matrix Completion driven Smoothed Online Quadratic Optimization

    Neelkamal Bhuyan, Debankur Mukherjee, Adam Wierman · PDF
  59. Estimation of Treatment Effects under Nonstationarity via the Truncated Policy Gradient Estimator

    Ramesh Johari, Tianyi Peng, Wenqian Xing · PDF
  60. Everyone Contributes! Incentivizing Strategic Cooperation in Multi-LLM Systems via Sequential Public Goods Games

    Yunhao Liang, Yuan Qu, Jingyuan Yang, Shaochong Lin, Zuo-Jun Shen · PDF
  61. Exploration via Feature Perturbation in Contextual Bandits

    Seouh-won Yi, Min-hwan Oh · PDF
  62. Fairness Is More Than Algorithms: Racial Disparities in Time-to-Recidivism

    Jessy Xinyi Han, Kristjan Greenewald, Devavrat Shah · PDF
  63. FairSVM: A Mixed-Integer Programming Framework for Fairness-Constrained Support Vector Machines

    Gabriele Marchesi, Francesca Maggioni, Bismark Singh · PDF
  64. Fast Variability Approximation: Speeding up Divergence-Based Distributionally Robust Optimization via Directed Perturbation

    Henry Lam, Mohamed Lakhnichi · PDF
  65. Federated Calculation of the Transportation Barycenter by a Dual Subgradient Method

    Zhengqi Lin, Andrzej Ruszczynski · PDF
  66. Fine-Grained Prototype-Based Interpretability for Operational Text Classification

    Bowen Wei, Jinhao Pan, Ziwei Zhu · PDF
  67. Finite-Time Minimax Bounds in Queueing Control

    Yujie Liu, Vincent Y. F. Tan, Yunbei Xu · PDF
  68. Flow-based Conformal Prediction for Multi-dimensional Time Series

    Junghwan Lee, Chen Xu, Yao Xie · PDF
  69. FlowGINO: Continuous Reconstruction from Sparse Observations along with Aleatoric and Epistemic Uncertainty Estimation

    Alif Bin Abdul Qayyum, Byung-Jun Yoon · PDF
  70. Follow-the-Perturbed-Leader for Decoupled Bandits: Best-of-Both-Worlds and Practicality

    Chaiwon Kim, Jongyeong Lee, Min-hwan Oh · PDF
  71. From Stacked Predictions to Decisions: A Contextual Optimization Approach

    Yanru Guo, Ruiwei Jiang, Siqian Shen · PDF
  72. Gala: Global LLM Agents for Text-to-Model Translation

    Junyang Cai, Serdar Kadioglu, Bistra Dilkina · PDF
  73. Geometric Data Valuation via Leverage Scores

    Rodrigo Mendoza Smith · PDF
  74. Heterogeneous Treatment Effects in Panel Data

    Retsef Levi, Elisabeth Paulson, Georgia Perakis, Emily Yi Zhang · PDF
  75. Hierarchical Implicit/Explicit Feedback Recommender System

    Kody J. H. Law · PDF
  76. Human-AI Interaction in Product Recommendation

    Jing Dong, Prakirt Jhunjhunwala, Yash Kanoria · PDF
  77. Human-Centric Perishable Inventory Management with AI-Assistance

    Yu Nu, Meng Qi, Karan Girotra, Elena Belavina · PDF
  78. Instance-dependent Sample Complexity for Bilinear Saddle-Point Optimization with Noisy Feedback: An LP-Based Approach

    Jiashuo Jiang · PDF
  79. Integrating qualitative data into transit service design: a stochastic estimate-then-optimize approach

    Alexandre Jacquillat, Shriya Karam · PDF
  80. Joint Pricing and Resource Allocation: An Optimal Online-Learning Approach

    Jianyu Xu, Xuan Wang, Yu-Xiang Wang, Jiashuo Jiang · PDF
  81. k-SVD with Gradient Descent

    Yassir Jedra, Devavrat Shah · PDF
  82. Landmark-Based Node Representations for Shortest Path Distance Approximations in Random Graphs

    My Le, Luana Ruiz, Souvik Dhara · PDF
  83. Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action

    Xin Chen, Yifan Hu, Minda Zhao · PDF
  84. Learning Fair And Effective Points-Based Rewards Programs

    Chamsi Hssaine, Yichun Hu, Ciara Pike-Burke · PDF
  85. Learning from a Biased Sample

    Roshni Sahoo, Lihua Lei, Stefan Wager · PDF
  86. Learning to Handle Constraints in Routing Problems via a Construct-and-Refine Framework

    Jieyi Bi, Zhiguang Cao, Jianan Zhou, Wen Song, Yaoxin Wu, Jie Zhang, Yining Ma, Cathy Wu · PDF
  87. Learning to Optimize at Scale: A Benders Decomposition-TransfORmers Framework for Stochastic Combinatorial Optimization

    Seung Jin Choi, Kimiya Jozani, Joshua F. Cooper, I Esra Buyuktahtakin · PDF
  88. Learning to Select and Rank from Choice-Based Feedback: A Simple Nested Approach

    Junwen Yang, Yifan Feng · PDF
  89. LISTEN to Your Preferences: An LLM Framework for Multi-Objective Selection

    Adam Jovine, Tinghan Ye, David Shmoys, Peter I. Frazier · PDF
  90. Lyapunov-Based Sample Complexity Analysis for Weakly-Coupled MDPs

    Tianhao Wu, Matthew Zurek, Yudong Chen, Weina Wang, Qiaomin Xie · PDF
  91. Measuring Informativeness Gap of (Mis)Calibrated Predictors

    Yiding Feng, Wei Tang · PDF
  92. Mechanistic Interpretability for Neural TSP Solvers

    Reuben Narad, Léonard Boussioux, Michael Wagner · PDF
  93. Mechanistic Modeling of Social Conditions in Disease-Prediction Simulations via Copula-Informed Probabilistic Graphical Models: HIV Case Study

    Amir Khosheghbal, Peter Haas, Chaitra Gopalappa · PDF
  94. MINTS: Minimalist Thompson Sampling

    Kaizheng Wang · PDF
  95. Mixed Integer Programming for Change-point Detection

    Apoorva Narula, Yao Xie, Santanu Dey · PDF
  96. Model-Free Assessment of Simulator Fidelity via Quantile Curves

    Yu-Shiou Willy Lin, Garud Iyengar, Kaizheng Wang · PDF
  97. Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards

    Wenlong Ji, Yihan Pan, Ruihao Zhu, Lihua Lei · PDF
  98. Near-Optimal Real-Time Personalization with Simple Transformers

    Lin An, Andrew A Li, Vaisnavi Nemala, Gabriel Visotsky · PDF
  99. Neural Decision Rule for Constrained Contextual Stochastic Optimization

    Zhangyi Liu, Zhongling Xu, Feng Liu, Rui Gao, Shuang Li · PDF
  100. Non‑Asymptotic Guarantees for Average‑Reward Q‑Learning with Adaptive Stepsizes

    Zaiwei Chen · PDF
  101. Offline Contextual Bandits with Covariate Shift

    Yingying Fan, Yuxuan Han, Jinchi Lv, Xiaocong Xu, Zhengyuan Zhou · PDF
  102. Offline Dynamic Pricing under Covariate Shift and Local Differential Privacy via Twofold Pessimism

    Jongmin Mun, Xiaocong Xu, Yingying Fan · PDF
  103. Online Decision Making with Generative Action Sets

    Jianyu Xu, Vidhi Jain, Bryan Wilder, Aarti Singh · PDF
  104. Online Learning for Dynamic Service Mode Control

    Wenqian Xing, Yue Hu, Anand Kalvit, Vahid Sarhangian · PDF
  105. Online Statistical Inference of Constrained Stochastic Optimization via Random Scaling

    Xinchen Du, Wanrong Zhu, Wei Biao Wu, Sen Na · PDF
  106. Optimality of Linear Policies in Distributionally Robust Linear Quadratic Control

    Bahar Taskesen, Dan Andrei Iancu, Çağıl Koçyiğit, Daniel Kuhn · PDF
  107. Optimization-Driven XGBoost-PINN Framework for Building Temperature Prediction

    Tushar Shinde, Rohan Saha · PDF
  108. Optimizing LLM Inference: Fluid-Based Online Scheduling under Memory Constraints

    Ruicheng Ao, Gan Luo, David Simchi-Levi, Xinshang Wang · PDF
  109. Overfitting in Adaptive Robust Optimization

    Karl Zhu, Dimitris Bertsimas · PDF
  110. Perishable Online Inventory Control with Context-Aware Demand Distributions

    Yuxiao Wen, Jingkai Huang, Weihua ZHOU, Zhengyuan Zhou · PDF
  111. Plan for the Worst With Advice: Advice-Augmented Robust Markov Decision Processes

    Tinashe Handina, Kishan Panaganti, Eric Mazumdar, Adam Wierman · PDF
  112. Policy Gradient Optimization for Markov Decision Processes with Epistemic Uncertainty and General Loss Functions

    Xiaoshuang Wang, Yifan Lin, Enlu Zhou · PDF
  113. Post-Estimation Adjustments in Data-Driven Decision-Making with Applications in Pricing

    Michael Albert, Max Biggs, Ningyuan Chen, Guan Wang · PDF
  114. Prediction-Driven Staffing for Emergency Departments: What to Predict and How to Predict

    Lin Feng, Jing Dong · PDF
  115. Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options

    Joongkyu Lee, Seouh-won Yi, Min-hwan Oh · PDF
  116. Probabilistic Soundness Guarantees in LLM Reasoning Chains

    Weiqiu You, Anton Xue, Shreya Havaldar, Delip Rao, Helen Jin, Chris Callison-Burch, Eric Wong · PDF
  117. Provable Reinforcement Learning from Human Feedback with an Unknown Link Function

    Qining Zhang, Lei Ying · PDF
  118. Pure Exploration via Frank--Wolfe Self-Play

    Xinyu Liu, Chao Qin, Wei You · PDF
  119. Q-learning with Posterior Sampling

    Priyank Agrawal, Shipra Agrawal, Azmat Azati · PDF
  120. Quantifying policy uncertainty in generative flow networks with uncertain rewards

    Ramón Dineth Nartallo-Kaluarachchi, Robert Manson-Sawko, Shashanka Ubaru, Dongsung Huh, Malgorzata J. Zimon, Lior Horesh · PDF
  121. Rebalancing and Clearance Pricing of Near-Expiry Inventory in Online Grocery Retail

    Ziyuan Zhou, LONG HE, Zhengling Qi, Guangrui Ma, Yuanbo Peng, Yushu Yao, Zhongyi Zha, Mingyong Zhao, Zhenyu Zhuo · PDF
  122. Reducing Contextual Stochastic Bilevel Optimization via Structured Function Approximation

    Maxime Bouscary, Jiawei Zhang, Saurabh Amin · PDF
  123. Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management

    Huiling Meng, Ningyuan Chen, Xuefeng Gao · PDF
  124. Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate

    Yifan Lin, Yuhao Wang, Enlu Zhou · PDF
  125. Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems

    Jongyeong Lee, Junya Honda, Shinji Ito, Min-hwan Oh · PDF
  126. RiskPO: Risk-based Policy Optimization with Verifiable Reward for LLM Post-Training

    Tao Ren, Jinyang Jiang, Hui Yang, Wan Tian, Yijie Peng · PDF
  127. Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization

    Cheng Tang, Zhishuai Liu, Pan Xu · PDF
  128. Robust Strategic Classification under Decision-Dependent Cost Uncertainty

    Sura Alhanouti, Guzin Bayraksan, Parinaz Naghizadeh · PDF
  129. Safe Start: Configuring Optimization Algorithms for Decision-Making under Extreme Risks

    Wasin Meesena, Henry Lam · PDF
  130. Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction

    Yiting He, Zhishuai Liu, Weixin Wang, Pan Xu · PDF
  131. Scalable First-order Method for Certifying Optimal k-Sparse GLMs

    Jiachang Liu, Soroosh Shafiee, Andrea Lodi · PDF
  132. Selective Cost-Aware Random Forests for Unreliable Data

    Sarwesh Rauniyar · PDF
  133. Self-Normalized Resets for Plasticity in Continual Learning

    Adam Daniel Jozefiak, Vivek Farias · PDF
  134. SOCRATES: Simulation Optimization with Correlated Replicas and Adaptive Trajectory Evaluations

    Haoting Zhang, Haoxian Chen, Donglin Zhan, Hanyang Zhao, Henry Lam, Wenpin Tang, David Yao, Zeyu Zheng · PDF
  135. SOLID: a Framework of Synergizing Optimization and LLMs for Intelligent Decision-Making

    Yinsheng Wang, Tario G You, Léonard Boussioux, Shan Liu · PDF
  136. Statistical Properties of Robust Optimization under Distribution Shifts

    Zhiyi Li, Xiaojie Mao, Yunbei Xu, Ruohan Zhan · PDF
  137. Structure-Informed Deep Reinforcement Learning for Inventory Management

    Alvaro Maggiar, Sohrab Andaz, Akhil Bagaria, Carson Eisenach, Dean Foster, Omer Gottesman, Dominique Perrault-Joncas · PDF
  138. Structured Difference-of-Q via Orthogonal Learning

    Defu Cao, Angela Zhou · PDF
  139. Tail-Optimized Caching for LLM Inference

    Wenxin Zhang, Yueying Li, Ciamac C. Moallemi, Tianyi Peng · PDF
  140. The Oversight Game: Learning AI Control and Corrigibility in Markov Games

    William Overman, Mohsen Bayati · PDF
  141. Towards Efficient Foundation Model: A Novel Time Series Embedding

    Jessy Xinyi Han, Arth Dharaskar, Nathaniel Lanier, Abdullah Omar Alomar, Aditya Agrawal, Angela Yuan, Jocelyn Hsieh, Ishan Shah, Muhammad Jehangir Amjad, Devavrat Shah · PDF
  142. Training Deep-Parametric Policies Using Lagrangian Duality

    Andrew W. Rosemberg, Alexandre Street, Davi Michel Valladao, Pascal Van Hentenryck · PDF
  143. Transformer-Based Next-Step Prediction for Queue Length Distribution

    Jieqi Di, Jiecheng Lu, Runhua Wu, Yuwei Zhou · PDF
  144. Uncertainty Estimation using Variance-Gated Distributions

    Martin Gillis, Isaac Xu, Thomas Trappenberg · PDF
  145. Understanding Scaling Laws via Neural Feature Learning Dynamics

    Zihan Yao, Ruoyu Wu, Tianxiang Gao · PDF
  146. Variational Generative Modeling of Stochastic Point Processes

    Xinlong Du, Harsha Honnappa, Vinayak Rao · PDF
  147. Who Should Do What? Adaptive Delegation in Human-AI Collaboration

    Wei Gu, Michael Lingzhi Li, Shixiang Zhu · PDF