ICLR 2026 Past Math & reasoning

Workshop on Latent & Implicit Thinking – Going Beyond CoT Reasoning

LIT Workshop @ ICLR 2026

Submission deadline
Feb 9, 2026, 12:00 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (78)

Fetched from OpenReview (v2) on 2026-06-10.

  1. ActivationReasoning: Logical Reasoning in Latent Activation Spaces

    Lukas Helff, Ruben Härle, Wolfgang Stammer, Felix Friedrich, Manuel Brack, Antonia Wüst, Hikaru Shindo, Patrick Schramowski, Kristian Kersting · PDF
  2. Adaptive Loops and Memory in Transformers: Think Harder or Know More?

    Markus Frey, Behzad Shomali, Ali Hamza Bashir, David Berghaus, Joachim Koehler, Mehdi Ali · PDF
  3. All Roads Lead to Rome: Distilling Verifiable Reasoning via Shared Decision Pivots

    Dongkyu Cho, Amy B.Z. Zhang, Bilel Fehri, Sheng Wang, Rumi Chunara, Hengrui Cai, Rui Song · PDF
  4. Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory

    Usman Anwar, Tim Bakker, Dana Kianfar, Cristina Pinneri, Christos Louizos · PDF
  5. Are Latent Reasoning Models Easily Interpretable?

    Connor Dilgren, Sarah Wiegreffe · PDF
  6. Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling

    Ivan Rodkin, Daniil Orel, Konstantin Smirnov, Arman Bolatov, Bilal Elbouardi, Besher Hassan, Yuri Kuratov, Aydar Bulatov, Preslav Nakov, Timothy Baldwin, Artem Shelmanov, Mikhail Burtsev · PDF
  7. Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge

    Xutao Ma, Yixiao Huang, Hanlin Zhu, Somayeh Sojoudi · PDF
  8. Bypassing the Rationale: Causal Auditing of Implicit Reasoning in Language Models

    Anish Sathyanarayanan, Aditya Nagarsekar, Aarush Rathore · PDF
  9. Can the Future Inform the Present? Investigating Latent Lookahead Refinement via Multi-Token Prediction

    Somesh Mehra, Alejandro Hernández-Cano, Martin Jaggi · PDF
  10. ConFu: Contemplate the Future for Better Speculative Sampling

    Zongyue Qin, Raghavv Goel, Risheek Garrepalli, Mukul Gagrani, Mingu Lee, Yizhou Sun · PDF
  11. Cross-Layer Clustering for Stochastic Parameter Decomposition

    Saman Seshadri, Jack Digilov, Sean Esla, Nathan Zixia Hu, Michael Ivanitskiy, Pablo Bernabeu-Perez · PDF
  12. Debugging code world models

    Babak Rahmani · PDF
  13. Denoising is not the End: Discrete Diffusion Language Models with Self-Correction

    Jinwei Zhang, Dimitri von Rütte, Yuhui Ding, Thomas Hofmann · PDF
  14. Discovering Interpretable Algorithms by Decompiling Transformers to RASP

    Xinting Huang, Aleksandra Bakalova, Satwik Bhattamishra, William Merrill, Michael Hahn · PDF
  15. Do Depth-Grown Models Overcome the Curse of Depth? An In-Depth Analysis

    Ferdinand Kapl, Emmanouil Angelis, Tobias Höppe, Kaitlin Maile, Johannes von Oswald, Nino Scherrer, Stefan Bauer · PDF
  16. Dual-Channel Steering: Combining Explicit Prompting and Implicit Parameter Modulation for Reasoning Diversity

    Takahito Tanimura, Kotaro Furuya · PDF
  17. Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

    Xingwei Qu, Shaowen Wang, Zihao Huang, Kai Hua, Fan Yin, Jundong Zhou, Qiyang Min, Zihao Wang, Yizhi LI, Tianyu Zhang, He Xing, Zheng Zhang, Yuxuan Song, Tianyu Zheng, Zhiyuan Zeng, Chenghua Lin, Ge Zhang, Wenhao Huang · PDF
  18. Dynamics Within Latent Chain-of-Thought: An Empirical Study of Causal Structure

    Zirui Li, Xuefeng Bai, Kehai Chen, Yizhi LI, Jian Yang, Chenghua Lin, Min Zhang · PDF
  19. Emergent Analogy in Transformers

    Gouki Minegishi, Jingyuan Feng, Hiroki Furuta, Takeshi Kojima, Yusuke Iwasawa, Yutaka Matsuo · PDF
  20. Encode, Think, Decode: Scaling test-time reasoning with recursive latent thoughts

    Yeskendir Koishekenov, Aldo Lipani, Nicola Cancedda · PDF
  21. Energy-Conditioned Thinking: A Three-State Framework for Adaptive Depth and Halting

    Ning Coeva · PDF
  22. From Growing to Looping: A Unified View of Iterative Computation in LLMs

    Ferdinand Kapl, Emmanouil Angelis, Kaitlin Maile, Johannes von Oswald, Stefan Bauer · PDF
  23. How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision?

    Yingqian Cui, Zhenwei Dai, Bing He, Zhan Shi, Hui Liu, Rui Sun, Zhiji Liu, Yue Xing, Jiliang Tang, Benoit Dumoulin · PDF
  24. How to Train Your HRM

    Sam Olesker-Taylor, Erika Aranas, Michael Arthur Leopold Pearce, Luke Hudlass-Galley · PDF
  25. Implicit Statistical Inference in Transformers: Approximating Likelihood-Ratio Tests In-Context

    Faris Chaudhry, Siddhant Gadkari · PDF
  26. Inference-Time Rethinking with Latent Thought Vectors for Math Reasoning

    Deqian Kong, Minglu Zhao, Aoyang Qin, Bo Pang, Chenxin Tao, David Hartmann, Edouardo Honig, Dehong Xu, Amit H. Kumar, Matthew Sarte, Chuan Li, Jianwen Xie, Ying Nian Wu · PDF
  27. Is continuous CoT better suited for multilingual reasoning?

    Ali Hamza Bashir, Behzad Shomali, Markus Frey, Mehdi Ali, Rafet Sifa, David Berghaus · PDF
  28. LaneRoPE: Positional Encoding for Collaborative Parallel Reasoning and Generation

    Gabriele Cesa, Thomas Hehn, Aleix Torres-Camps, Àlex Batlle Casellas, Jordi Ros-Giralt, Arash Behboodi, Tribhuvanesh Orekondy · PDF
  29. LASER: Low-Rank Activation SVD for Efficient Recursion

    Ege Çakar, Ketan Raghu, Lia Zheng · PDF
  30. Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning

    Lina Berrayana, Ahmed Heakl, Abdullah Sohail, Thomas Hofmann, Salman Khan, Wei Chen · PDF
  31. LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

    Xinwu Ye, Yicheng Mao, Jia Zhang, Yimeng Liu, Li Hao, Fang Wu, Zhiwei Li, Yuxuan Liao, Zehong Wang, Zhiyuan Liu, Zhenfei Yin, Li Yuan, Philip Torr, Huan Sun, xiangxiang Zeng, Mengdi Wang, Le Cong, Shenghua Gao, Xiangru Tang · PDF
  32. Learning Efficient Latent Reasoning with Abstract Chain-of-Thought

    Keshav Ramji, Tahira Naseem, Ramón Fernandez Astudillo · PDF
  33. Learning from Partial Chain-of-Thought via Truncated-Reasoning Self-Distillation

    Gianluigi Silvestri, Edoardo Cetin · PDF
  34. Learning Multi-step Reasoning via Persistent Latent State Propagation

    Yinxi Li, Jiaao Chen, Fang Wu, Jiakai Yu, Heli Qi, Weihao Xuan, Haokai Zhao, Pengyu Nie, Di Jin, Xiangru Tang · PDF
  35. Learning State-Tracking from Code: REPL Traces and Probabilistic Automata

    Julien Siems, Riccardo Grazzi, Kirill Kalinin, Hitesh Ballani, Babak Rahmani · PDF
  36. Learning to Execute Graph Algorithms Exactly with Graph Neural Networks

    Muhammad Fetrat Qharabagh, Artur Back de Luca, George Giapitzakis, Kimon Fountoulakis · PDF
  37. Lightweight Latent Reasoning for Narrative Tasks

    Alexander Gurung, Nikolay Malkin, Mirella Lapata · PDF
  38. LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

    William Lugoloobi, Thomas Foster, William Bankes, Chris Russell · PDF
  39. LOOK BEFORE YOU LEAP: THERMODYNAMIC ARBITRATION OF PARAMETRIC AND NON-PARAMETRIC KNOWLEDGE IN LLM AGENTS VIA SELF-REGULATING MEMORY ARCHITECTURES

    Akash Das, Ishan Roy · PDF
  40. Mechanisms of Introspective Awareness

    Uzay Macar, Li Yang, Atticus Wang, Peter Wallich, Emmanuel Ameisen, Jack Lindsey · PDF
  41. Mechanistic Analysis Of Universality: Numerical Comparison Circuits Across Transformer Architectures

    Arya Bhardia, Julian Ramirez, Siddhanta Verma, Karen Mkrtchyan · PDF
  42. Mechanistic Evidence for Faithfulness Decay in Chain-of-Thought Reasoning

    Donald Ye, Max Loffgren, Om Kotadia, Linus Wong, Jonas Rohweder · PDF
  43. MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning

    Yapeng Mi, Yanpeng Zhao, Hengli Li, Chenxi Li, Huimin Wu, Xiaojian Ma, Song-Chun Zhu, Ying Nian Wu, Qing Li · PDF
  44. Modeling Tool Use in Transformers via Computation Oracles

    Utkarsh Tiwari, Sai Soumya Nalli, Amit Deshpande · PDF
  45. Offline RL with Hierarchical Action Chunking

    Ahad Jawaid · PDF
  46. On the Residual Scaling of Looped Transformers: Stability and Transferability

    Shaowen Wang, Bingrui Li, Ge Zhang, Wenhao Huang, Jian Li · PDF
  47. One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models

    Chris Cameron, Wangzheng Wang, Nikita Ivanov, Ashmita Bhattacharyya, Didier Chételat, Yingxue Zhang · PDF
  48. Parcae: A Dynamical Systems Perspective to Stable Looped LLMs

    Hayden Prairie, Zachary Novack, Taylor Berg-Kirkpatrick, Daniel Y Fu · PDF
  49. Polestar-Cache: Reconciling Parallel Decoding and Accuracy in Diffusion LLMs via Token Drift-Aware KV Cache Recalibration

    Mingyu Lee, Akshat Ramachandran, Souvik Kundu, Tushar Krishna · PDF
  50. Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space

    Chengzhi Liu, Yuzhe YANG, Yue Fan, Qingyue Wei, Sheng Liu, Xin Eric Wang · PDF
  51. RECURRENT-DEPTH VLA: IMPLICIT TEST-TIME COMPUTE SCALING OF VISION–LANGUAGE–ACTION MODELS VIA LATENT ITERATIVE REASONING

    Yalcin Tur, Jalal Naghiyev, Haoquan Fang, Wei-Chuan Tsai, Jiafei Duan, Dieter Fox, Ranjay Krishna · PDF
  52. Recursive Reasoning as Attractor Landscape Search: Mechanistic Dynamics of the Tiny Recursive Model

    Andreas Efstathiou, Aishwarya Balwani · PDF
  53. Rejection Mixing: Fast Semantic Propagation of Mask Tokens for Efficient DLLM Inference

    Yushi Ye, Feng Hong, Huangjie Zheng, Xu Chen, Zhiyong Chen, Yanfeng Wang, Jiangchao Yao · PDF
  54. RFG: Test-Time Scaling for Diffusion Large Language Model Reasoning with Reward-Free Guidance

    Tianlang Chen, Minkai Xu, Jure Leskovec, Stefano Ermon · PDF
  55. SEMIE: Semantic Entropy-Informed Decoding

    Benjamin Patrick Evans, Sumitra Ganesh, Leo Ardon · PDF
  56. Single-Position Intervention Fails: Distributed Output Templates Drive In-Context Learning

    Bryan Cheng, Jasper Zhang · PDF
  57. T2MLR: Transformer with Temporal Middle-Layer Recurrence

    Ziyang Cai, Xingyu Zhu, Yihe Dong, Yinghui He, Sanjeev Arora · PDF
  58. Task-Specific Knowledge Distillation via Intermediate Probes

    Ryan Brown, Chris Russell · PDF
  59. Test-Time Meta-Adaptation with Self-Synthesis

    Zeyneb N. Kaya, Nick Rui · PDF
  60. The Illusion of Superposition in Latent CoT via Soft Thinking

    Michael Rizvi-Martel, Marius Mosbach · PDF
  61. The Mechanistic Invariance Test: Genomic Language Models Fail To Learn Positional Regulatory Logic

    Bryan Cheng, Jasper Zhang · PDF
  62. The Power of Power Law: Asymmetry Enables Compositional Reasoning

    Zixuan Wang, Xingyu Dang, Jason D. Lee, Kaifeng Lyu · PDF
  63. THINK DEEP, SPEAK ONCE: RELIT, A RECURSIVE LATENT IMPLICIT TRANSFORMER FRAMEWORK

    Abhishek Panwar, Maheep Singh, Saksham Bansal · PDF
  64. Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

    Tianyu Fu, Yichen You, Zekai Chen, Guohao Dai, Huazhong Yang, Yu Wang · PDF
  65. Thinking in Latents: Adaptive Anchor Refinement for Implicit Reasoning in LLMs

    Disha Sheshanarayana, Rajat Subhra Pal, Manjira Sinha, Tirthankar Dasgupta · PDF
  66. Thinking into the Future: Latent Lookahead Training for Transformers

    Lorenzo Noci, Gregor Bachmann, Seyed-Mohsen Moosavi-Dezfooli, Moin Nabi · PDF
  67. Tiny Autoregressive Recursive Models

    Paulius Rauba, Claudio Fanconi, Mihaela van der Schaar · PDF
  68. Tiny Recursive Reasoning with Mamba-2 Attention Hybrid

    Wenlong Wang, Fergal Reid · PDF
  69. Transformers Provably Learn to Internalize Chain-of-Thought

    Yixiao Huang, Hanlin Zhu, Zixuan Wang, Jiantao Jiao, Stuart Russell, Somayeh Sojoudi, Song Mei · PDF
  70. TSLM: Tree-Structured Language Modeling for Divergent Thinking

    Doyoung Kim, JaeHyeok Doo, Minjoon Seo · PDF
  71. Ulterior Motives: Detecting Misaligned Reasoning in Continuous Thought Models

    Sharan Ramjee · PDF
  72. Variational Latent Reasoning Guided by Rendered Chain-of-Thought

    Fanmeng Wang, Haotian Liu, Guojiang Zhao, Hongteng Xu, Zhifeng Gao · PDF
  73. When does Chain-of-Thought Help: A Markovian Perspective

    Zihan Wang, Yijun Dong, Qi Lei · PDF
  74. When Intermediate Supervision Doesn’t Help: Evidence from Recurrent CNNs

    Elisa Klunder, Guillaume Pourcel, Steven Abreu · PDF
  75. When Pruning Breaks Reasoning: Chain-of-Thought Similarity and Faithfulness in Language Models

    AVINASH KUMAR SHARMA, Tushar Shinde · PDF
  76. When Shallow Wins: Silent Failures and the Depth–Accuracy Paradox in Latent Reasoning

    Subramanyam Sahoo, Aman Chadha, Vinija Jain, Divya Chaudhary · PDF
  77. Which Heads Matter for Reasoning? RL-Guided KV Cache Compression

    Wenjie Du, Li Jiang, Keda TAO, Xue Liu, Huan Wang · PDF
  78. ε-Leaf Enumeration: Non-Repeating Self-Consistency via Truncated Tree Search

    Xueyan Li, Johannes Zenn, Ekaterina Fadeeva, Guinan Su, Mrinmaya Sachan, Jonas Geiping · PDF