ICLR 2025 Past Other
Will Synthetic Data Finally Solve the Data Access Problem?
ICLR 2025 Workshop SynthData
- Submission deadline
- Feb 6, 2025, 23:59 UTC imported from OpenReview — check the website for extensions
- Submission portal
- OpenReview
- Notes
- Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).
Accepted papers (42)
Fetched from OpenReview (v2) on 2026-06-10.
-
[Tiny] Parameterized Synthetic Text Generation with SimpleStories
-
[Tiny] Synthetic-based retrieval of patient medical data
-
[Tiny] Understanding the Impact of Data Domain Extraction on Synthetic Data Privacy
-
Accelerating Differentially Private Federated Learning via Adaptive Extrapolation
-
AN OPTIMAL CRITERION FOR STEERING DATA DISTRIBUTIONS TO ACHIEVE EXACT FAIRNESS
-
Augmented Conditioning Is Enough For Effective Training Image Generation
-
Benchmarking Differentially Private Tabular Data Synthesis Algorithms
-
Breaking Focus: Contextual Distraction Curse in Large Language Models
-
Can LLMs Replace Economic Choice Prediction Labs? The Case of Language-based Persuasion Games
-
Can Transformers Learn Full Bayesian Inference In Context?
-
Compositional World Knowledge leads to High Utility Synthetic data
-
Deconstructing Bias: A Multifaceted Framework for Diagnosing Cultural and Compositional Inequities in Text-to-Image Generative Models
-
Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection
-
DIET-PATE: Knowledge Transfer in PATE without Public Data
-
Differentially Private Synthetic Data via APIs 3: Using Simulators Instead of Foundation Model
-
Efficient Randomized Experiments Using Foundation Models
-
Empowering LLMs in Decision Games through Algorithmic Data Synthesis
-
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions
-
Evaluating Inter-Column Logical Relationships in Synthetic Tabular Data Generation
-
Grounding QA Generation in Knowledge Graphs and Literature: A Scalable LLM Framework for Scientific Discovery
-
How Well Does Your Tabular Generator Learn the Structure of Tabular Data?
-
Human-like compositional learning of visually-grounded concepts using synthetic data
-
Improved Density Ratio Estimation for Evaluating Synthetic Data Quality
-
Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data?
-
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
-
Leveraging Vertical Public-Private Split for Improved Synthetic Data Generation
-
Orchestrating Synthetic Data with Reasoning
-
Out-of-Distribution Detection using Synthetic Data Generation
-
Private Federated Learning using Preference-Optimized Synthetic Data
-
SoftSRV: Learn to generate targeted synthetic data.
-
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
-
Stronger Models are NOT Always Stronger Teachers for Instruction Tuning
-
SyntheRela: A Benchmark For Synthetic Relational Database Generation
-
Synthetic Data for Blood Vessel Network Extraction
-
Synthetic Data Pruning in High Dimensions: A Random Matrix Perspective
-
Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation
-
Text to 3D Object Generation for Scalable Room Assembly
-
TIMER: Temporal Instruction Modeling and Evaluation for Longitudinal Clinical Records
-
Towards Internet-Scale Training For Agents
-
Training-Free Safe Denoisers For Safe Use of Diffusion Models
-
TRIG-Bench: A Benchmark for Text-Rich Image Grounding
-
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data