NeurIPS 2024PastAI for science

NeurIPS 2024 Workshop on Scientific Methods for Understanding Deep Learning

SciForDL

Official website ↗OpenReview venue ↗See all NeurIPS workshops →✎ Edit this entry

Submission deadline: Sep 18, 2024, 12:59 UTC
imported from OpenReview — check the website for extensions
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (73)

Fetched from OpenReview (v2) on 2026-06-10.

A Continuous-Time Analysis of Adaptive Optimization and Normalization
Rhys Gould, Hidenori Tanaka · PDF
A Method on Searching Better Activation Functions
Haoyuan Sun, Zihao Wu, Bo Xia, Pu Chang, Zibin Dong, Yifu Yuan, Yongzhe Chang, Xueqian Wang · PDF
Alice in Wonderland: Simple Tasks Reveal Severe Generalization and Basic Reasoning Deficits in State-Of-the-Art Large Language Models
Marianna Nezhurina, Lucia Cipolina-Kun, Mehdi Cherti, Jenia Jitsev · PDF
Amplified Early Stopping Bias: Overestimated Performance with Deep Learning
Nona Rajabi, Antonio H. Ribeiro, Miguel Vasco, Danica Kragic · PDF
Are Capsule Networks Texture or Shape Biased?
Riccardo Renzulli, Dominik Vranay, Marco Grangetto · PDF
BatchTopK Sparse Autoencoders
Bart Bussmann, Patrick Leask, Neel Nanda · PDF
Causation Does Not Imply Correlation: A Study of Circuit Mechanisms and Model Behaviors
Jenny Kaufmann, Victoria R Li, Martin Wattenberg, David Alvarez-Melis, Naomi Saphra · PDF
Characterizing stable regions in the residual stream of LLMs
Jett Janiak, Jacek Karwowski, Chatrik Singh Mangat, Giorgi Giglemiani, Nora Petrova, Stefan Heimersheim · PDF
Comparing Apples and Oranges: is Stitching Similarity a Load of Spheres?
Damian Smith, Antonia Marcu · PDF
Denoising for Manifold Extrapolation
Zeyu Yun, Galen Chuang, Derek Dong, Yubei Chen · PDF
Distributional Scaling Laws for Emergent Capabilities
Rosie Zhao, Naomi Saphra, Sham M. Kakade · PDF
Effectiveness of Sparse Autoencoder for understanding and removing gender bias in LLMs
Praveen Hegde · PDF
Eliminating Position Bias of Language Models: A Mechanistic Approach
Ziqi Wang, Hanlin Zhang, Xiner Li, Kuan-Hao Huang, Chi Han, Shuiwang Ji, Sham M. Kakade, Hao Peng, Heng Ji · PDF
Emergence of Hierarchical Emotion Representations in Large Language Models
Bo Zhao, Maya Okawa, Eric J Bigelow, Rose Yu, Tomer Ullman, Hidenori Tanaka · PDF
Emergent properties with repeated examples
Francois Charton, Julia Kempe · PDF
EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition
Youssef Doulfoukar, Laurent Mertens, Joost Vennekens · PDF
Evaluating Loss Landscapes from a Topology Perspective
Tiankai Xie, Caleb Geniesse, Jiaqing Chen, Yaoqing Yang, Dmitriy Morozov, Michael W. Mahoney, Ross Maciejewski, Gunther H. Weber · PDF
Explicit Regularisation, Sharpness and Calibration
Israel Mason-Williams, Fredrik Ekholm, Ferenc Huszár · PDF
Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks
Alba Carballo-Castro, Sonia Laguna, Moritz Vandenhirtz, Julia E Vogt · PDF
Exploring model depth and data complexity through the lens of cellular automata
Tianyu He, Darshil Doshi, Aritra Das, Andrey Gromov · PDF
Generalization vs Specialization under Concept Shift
Alex Nguyen, David J. Schwab, Vudtiwat Ngampruetikorn · PDF
Hiding in a Plain Sight: Out-of-Distribution Data in the Logit Space Embeddings
Vangjush Kostandin Komini, Sarunas Girdzijauskas · PDF
How Learning Rates Shape Neural Network Focus: Insights from Example Ranking
Ekaterina Lobacheva, Keller Jordan, Aristide Baratin, Nicolas Le Roux · PDF
How rare events shape the learning curves of hierarchical data
Hyunmo Kang, Francesco Cagnetta, Matthieu Wyart · PDF
Illusions as features: the generative side of recognition
Tahereh Toosi, Kenneth D. Miller · PDF
Impact of Label Noise on Learning Complex Features
Rahul Vashisht, P Krishna Kumar, Harsha Vardhan Govind, Harish Guruprasad Ramaswamy · PDF
Improving Deep Learning Speed and Performance through Synaptic Neural Balance
Antonios Alexos, Ian Domingo, Pierre Baldi · PDF
Input Space Mode Connectivity in Deep Neural Networks
Jakub Vrabel, Ori Shem-Ur, Yaron Oz, David Krueger · PDF
Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations
Kola Ayonrinde, Michael T Pearce, Lee Sharkey · PDF
Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs
Daniel J Lee, Stefan Heimersheim · PDF
Is Expressivity Essential for the Predictive Performance of Graph Neural Networks?
Fabian Jogl, Pascal Welke, Thomas Gärtner · PDF
Is network fragmentation a useful complexity measure?
Coenraad Mouton, Randle Rabe, Daniël Gerbrand Haasbroek, Marthinus Wilhelmus Theunissen, Hermanus Lambertus Potgieter, Marelie Hattingh Davel · PDF
Is Saliency Really Captured By Gradient?
Nehal Yasin, Jonathon Hare, Antonia Marcu · PDF
Knowledge Distillation for Teaching Symmetry Invariances
Patrick Odagiu, Nicole Nobili, Fabian Dionys Schrag, Yves Bicker, Yuhui Ding · PDF
Knowledge Distillation: The Functional Perspective
Israel Mason-Williams, Gabryel Mason-Williams, Mark Sandler · PDF
Language model scaling laws and zero-sum learning
Andrei Mircea, Ekaterina Lobacheva, Supriyo Chakraborty, Nima Chitsazan, Irina Rish · PDF
Learnability in the Context of Neural Tangent Kernels
Progyan Das, Dwip Dalal · PDF
Learned Random Label Predictions as a Neural Network Complexity Metric
Marlon Becker, Benjamin Risse · PDF
Learning Stochastic Rainbow Networks
Vivian White, Muawiz Sajjad Chaudhary, Guy Wolf, Guillaume Lajoie, Kameron Decker Harris · PDF
Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference
Anton Xue, Avishree Khare, Rajeev Alur, Surbhi Goel, Eric Wong · PDF
Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory
Bao Pham, Gabriel Raya, Matteo Negri, Mohammed J Zaki, Luca Ambrogioni, Dmitry Krotov · PDF
Model Recycling: Model component reuse to promote in-context learning
Lindsay M. Smith, Chase Goddard, Vudtiwat Ngampruetikorn, David J. Schwab · PDF
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
Yi Zhang, Difan Zou · PDF
Pre-processing and Compression: Understanding Hidden Representation Refinement Across Imaging Domains via Intrinsic Dimension
Nicholas Konz, Maciej A Mazurowski · PDF
Probing the Decision Boundaries of In-context Learning in Large Language Models Download PDF
Siyan Zhao, Tung Nguyen, Aditya Grover · PDF
Rethinking Knowledge Transfer in Learning Using Privileged Information
Danil Provodin, Bram van den Akker, Christina Katsimerou, Maurits Clemens Kaptein, Mykola Pechenizkiy · PDF
Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics
Charlotte Beylier, Simon M. Hofmann, Nico Scherf · PDF
Robust Learning in Bayesian Parallel Branching Graph Neural Networks: The Narrow Width Limit
Zechen Zhang, Haim Sompolinsky · PDF
softmax is not enough (for sharp out-of-distribution)
Petar Veličković, Christos Perivolaropoulos, Federico Barbero, Razvan Pascanu · PDF
SolidMark: How to Evaluate Memorization in Image Generative Models
Nicky Kriplani, Minh Pham, Malikka Rajshahi, Chinmay Hegde, Niv Cohen · PDF
Sometimes I am a Tree: Data Drives Fragile Hierarchical Generalization
Tian Qin, Naomi Saphra, David Alvarez-Melis · PDF
Sparse autoencoders for dense text embeddings reveal hierarchical feature sub-structure
Christine Ye, Charles O'Neill, John F Wu, Kartheik G. Iyer · PDF
Specialization-generalization transition in exemplar-based in-context learning
Chase Goddard, Lindsay M. Smith, Vudtiwat Ngampruetikorn, David J. Schwab · PDF
Standard adversarial attacks only fool the final layer
Stanislav Fort · PDF
Stitching Sparse Autoencoders of Different Sizes
Patrick Leask, Bart Bussmann, Joseph Isaac Bloom, Curt Tigges, Noura Al Moubayed, Neel Nanda · PDF
Structure Development in List Sorting Transformers
Einar Urdshals, Jasmina nasufi · PDF
Structured Identity Mapping Learning As a Model for Compositional Generalization in Generative Models
Yongyi Yang, Core Francisco Park, Ekdeep Singh Lubana, Maya Okawa, Wei Hu, Hidenori Tanaka · PDF
Testing knowledge distillation theories with dataset size
Giulia Lanzillotta, Felix Sarnthein, Gil Kur, Thomas Hofmann, Bobby He · PDF
The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains
Ezra Edelman, Nikolaos Tsilivis, Surbhi Goel, Benjamin L. Edelman, eran malach · PDF
The Master Key Filters Hypothesis: Deep Filters Are General
Zahra Babaiee, Peyman Kiasari, Daniela Rus, Radu Grosu · PDF
The Pitfalls of Memorization: When Memorization Hinders Generalization
Reza Bayat, Mohammad Pezeshki, Elvis Dohmatob, David Lopez-Paz, Pascal Vincent · PDF
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov, Kushal Tirumala, Hassan Shapourian, Paolo Glorioso, Dan Roberts · PDF
Token-token correlations predict the scaling of the test loss with the number of input tokens
Francesco Cagnetta, Matthieu Wyart · PDF
Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps
Fuxiao Liu · PDF
Training Dynamics of Convolutional Neural Networks for Learning the Derivative Operator
Erik Y. Wang, Yongji Wang, Ching-Yao Lai · PDF
Training Neural Networks for Modularity aids Interpretability
Satvik Golechha, Dylan Cope, Nandi Schoots · PDF
Transformers can reinforcement learn to approximate Gittins Index
Vladimir Petrov, Nikhil Vyas, Lucas Janson · PDF
Twin Studies of Factors in OOD Generalization
Victoria R Li, Jenny Kaufmann, David Alvarez-Melis, Naomi Saphra · PDF
Understanding the Limitations of B-Spline KANs: Convergence Dynamics and Computational Efficiency
Avik Pal, Dipankar Das · PDF
Understanding the Transient Nature of In-Context Learning: The Window of Generalization
Core Francisco Park, Ekdeep Singh Lubana, Hidenori Tanaka · PDF
Understanding Visual Concepts Across Models
Brandon Trabucco, Max A Gurinas, Kyle Doherty, Russ Salakhutdinov · PDF
Unraveling the Latent Hierarchical Structure of Language and Images via Diffusion Models
Antonio Sclocchi, Noam Itzhak Levi, Alessandro Favero, Matthieu Wyart · PDF
We Need Far Fewer Unique Filters Than We Thought
Zahra Babaiee, Peyman Kiasari, Daniela Rus, Radu Grosu · PDF

Accepted papers (73)

☆A Continuous-Time Analysis of Adaptive Optimization and Normalization

☆A Method on Searching Better Activation Functions

☆Alice in Wonderland: Simple Tasks Reveal Severe Generalization and Basic Reasoning Deficits in State-Of-the-Art Large Language Models

☆Amplified Early Stopping Bias: Overestimated Performance with Deep Learning

☆Are Capsule Networks Texture or Shape Biased?

☆BatchTopK Sparse Autoencoders

☆Causation Does Not Imply Correlation: A Study of Circuit Mechanisms and Model Behaviors

☆Characterizing stable regions in the residual stream of LLMs

☆Comparing Apples and Oranges: is Stitching Similarity a Load of Spheres?

☆Denoising for Manifold Extrapolation

☆Distributional Scaling Laws for Emergent Capabilities

☆Effectiveness of Sparse Autoencoder for understanding and removing gender bias in LLMs

☆Eliminating Position Bias of Language Models: A Mechanistic Approach

☆Emergence of Hierarchical Emotion Representations in Large Language Models

☆Emergent properties with repeated examples

☆EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition

☆Evaluating Loss Landscapes from a Topology Perspective

☆Explicit Regularisation, Sharpness and Calibration

☆Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks

☆Exploring model depth and data complexity through the lens of cellular automata

☆Generalization vs Specialization under Concept Shift

☆Hiding in a Plain Sight: Out-of-Distribution Data in the Logit Space Embeddings

☆How Learning Rates Shape Neural Network Focus: Insights from Example Ranking

☆How rare events shape the learning curves of hierarchical data

☆Illusions as features: the generative side of recognition

☆Impact of Label Noise on Learning Complex Features

☆Improving Deep Learning Speed and Performance through Synaptic Neural Balance

☆Input Space Mode Connectivity in Deep Neural Networks

☆Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations

☆Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs

☆Is Expressivity Essential for the Predictive Performance of Graph Neural Networks?

☆Is network fragmentation a useful complexity measure?

☆Is Saliency Really Captured By Gradient?

☆Knowledge Distillation for Teaching Symmetry Invariances

☆Knowledge Distillation: The Functional Perspective

☆Language model scaling laws and zero-sum learning

☆Learnability in the Context of Neural Tangent Kernels

☆Learned Random Label Predictions as a Neural Network Complexity Metric

☆Learning Stochastic Rainbow Networks

☆Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

☆Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory

☆Model Recycling: Model component reuse to promote in-context learning

☆On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models

☆Pre-processing and Compression: Understanding Hidden Representation Refinement Across Imaging Domains via Intrinsic Dimension

☆Probing the Decision Boundaries of In-context Learning in Large Language Models Download PDF

☆Rethinking Knowledge Transfer in Learning Using Privileged Information

☆Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics

☆Robust Learning in Bayesian Parallel Branching Graph Neural Networks: The Narrow Width Limit

☆softmax is not enough (for sharp out-of-distribution)

☆SolidMark: How to Evaluate Memorization in Image Generative Models

☆Sometimes I am a Tree: Data Drives Fragile Hierarchical Generalization

☆Sparse autoencoders for dense text embeddings reveal hierarchical feature sub-structure

☆Specialization-generalization transition in exemplar-based in-context learning

☆Standard adversarial attacks only fool the final layer

☆Stitching Sparse Autoencoders of Different Sizes

☆Structure Development in List Sorting Transformers

☆Structured Identity Mapping Learning As a Model for Compositional Generalization in Generative Models

☆Testing knowledge distillation theories with dataset size

☆The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains

☆The Master Key Filters Hypothesis: Deep Filters Are General

☆The Pitfalls of Memorization: When Memorization Hinders Generalization

☆The Unreasonable Ineffectiveness of the Deeper Layers

☆Token-token correlations predict the scaling of the test loss with the number of input tokens

☆Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps

☆Training Dynamics of Convolutional Neural Networks for Learning the Derivative Operator

☆Training Neural Networks for Modularity aids Interpretability

☆Transformers can reinforcement learn to approximate Gittins Index

☆Twin Studies of Factors in OOD Generalization

☆Understanding the Limitations of B-Spline KANs: Convergence Dynamics and Computational Efficiency

☆Understanding the Transient Nature of In-Context Learning: The Window of Generalization

☆Understanding Visual Concepts Across Models

☆Unraveling the Latent Hierarchical Structure of Language and Images via Diffusion Models

☆We Need Far Fewer Unique Filters Than We Thought

A Continuous-Time Analysis of Adaptive Optimization and Normalization

A Method on Searching Better Activation Functions

Alice in Wonderland: Simple Tasks Reveal Severe Generalization and Basic Reasoning Deficits in State-Of-the-Art Large Language Models

Amplified Early Stopping Bias: Overestimated Performance with Deep Learning

Are Capsule Networks Texture or Shape Biased?

BatchTopK Sparse Autoencoders

Causation Does Not Imply Correlation: A Study of Circuit Mechanisms and Model Behaviors

Characterizing stable regions in the residual stream of LLMs

Comparing Apples and Oranges: is Stitching Similarity a Load of Spheres?

Denoising for Manifold Extrapolation

Distributional Scaling Laws for Emergent Capabilities

Effectiveness of Sparse Autoencoder for understanding and removing gender bias in LLMs

Eliminating Position Bias of Language Models: A Mechanistic Approach

Emergence of Hierarchical Emotion Representations in Large Language Models

Emergent properties with repeated examples

EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition

Evaluating Loss Landscapes from a Topology Perspective

Explicit Regularisation, Sharpness and Calibration

Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks

Exploring model depth and data complexity through the lens of cellular automata

Generalization vs Specialization under Concept Shift

Hiding in a Plain Sight: Out-of-Distribution Data in the Logit Space Embeddings

How Learning Rates Shape Neural Network Focus: Insights from Example Ranking

How rare events shape the learning curves of hierarchical data

Illusions as features: the generative side of recognition

Impact of Label Noise on Learning Complex Features

Improving Deep Learning Speed and Performance through Synaptic Neural Balance

Input Space Mode Connectivity in Deep Neural Networks

Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations

Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs

Is Expressivity Essential for the Predictive Performance of Graph Neural Networks?

Is network fragmentation a useful complexity measure?

Is Saliency Really Captured By Gradient?

Knowledge Distillation for Teaching Symmetry Invariances

Knowledge Distillation: The Functional Perspective

Language model scaling laws and zero-sum learning

Learnability in the Context of Neural Tangent Kernels

Learned Random Label Predictions as a Neural Network Complexity Metric

Learning Stochastic Rainbow Networks

Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory

Model Recycling: Model component reuse to promote in-context learning

On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models

Pre-processing and Compression: Understanding Hidden Representation Refinement Across Imaging Domains via Intrinsic Dimension

Probing the Decision Boundaries of In-context Learning in Large Language Models Download PDF

Rethinking Knowledge Transfer in Learning Using Privileged Information

Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics

Robust Learning in Bayesian Parallel Branching Graph Neural Networks: The Narrow Width Limit

softmax is not enough (for sharp out-of-distribution)

SolidMark: How to Evaluate Memorization in Image Generative Models

Sometimes I am a Tree: Data Drives Fragile Hierarchical Generalization

Sparse autoencoders for dense text embeddings reveal hierarchical feature sub-structure

Specialization-generalization transition in exemplar-based in-context learning

Standard adversarial attacks only fool the final layer

Stitching Sparse Autoencoders of Different Sizes

Structure Development in List Sorting Transformers

Structured Identity Mapping Learning As a Model for Compositional Generalization in Generative Models

Testing knowledge distillation theories with dataset size

The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains

The Master Key Filters Hypothesis: Deep Filters Are General

The Pitfalls of Memorization: When Memorization Hinders Generalization

The Unreasonable Ineffectiveness of the Deeper Layers

Token-token correlations predict the scaling of the test loss with the number of input tokens

Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps

Training Dynamics of Convolutional Neural Networks for Learning the Derivative Operator

Training Neural Networks for Modularity aids Interpretability

Transformers can reinforcement learn to approximate Gittins Index

Twin Studies of Factors in OOD Generalization

Understanding the Limitations of B-Spline KANs: Convergence Dynamics and Computational Efficiency

Understanding the Transient Nature of In-Context Learning: The Window of Generalization

Understanding Visual Concepts Across Models

Unraveling the Latent Hierarchical Structure of Language and Images via Diffusion Models

We Need Far Fewer Unique Filters Than We Thought