DataComp

non-profit

https://www.datacomp.ai/dclm/index.html#home

AI & ML interests

None defined yet.

Recent Activity

Lewis-Lau authored a paper 3 days ago

Unlocking Feature Learning in Gated Delta Networks at Scale

Lewis-Lau authored a paper 3 days ago

Self-Distilled Policy Gradient

Lewis-Lau submitted a paper 3 days ago

Unlocking Feature Learning in Gated Delta Networks at Scale

View all activity

authored 2 papers 3 days ago

Unlocking Feature Learning in Gated Delta Networks at Scale

Paper • 2606.04048 • Published 6 days ago • 2

Self-Distilled Policy Gradient

Paper • 2606.04036 • Published 6 days ago • 22

submitted a paper to Daily Papers 3 days ago

Unlocking Feature Learning in Gated Delta Networks at Scale

Paper • 2606.04048 • Published 6 days ago • 2

submitted a paper to Daily Papers 4 days ago

Self-Distilled Policy Gradient

Paper • 2606.04036 • Published 6 days ago • 22

authored 2 papers 6 days ago

EXCEEDS: Extracting Complex Events via Nugget-based Grid Modeling in Scientific Domain

Paper • 2406.14075 • Published Apr 24

Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory

Paper • 2605.31086 • Published 10 days ago • 5

submitted a paper to Daily Papers 27 days ago

PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents

Paper • 2605.07039 • Published May 7 • 4

authored 3 papers about 1 month ago

ChartGen: Scaling Chart Understanding Via Code-Guided Synthetic Chart Generation

Paper • 2507.19492 • Published May 31, 2025 • 1

Composition-Grounded Instruction Synthesis for Visual Reasoning

Paper • 2510.15040 • Published Oct 16, 2025

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published Mar 28 • 29

submitted a paper to Daily Papers 2 months ago

Test-Time Scaling Makes Overtraining Compute-Optimal

Paper • 2604.01411 • Published Apr 1 • 28

authored a paper 3 months ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 154

authored 8 papers 3 months ago

Towards Principled Disentanglement for Domain Generalization

Paper • 2111.13839 • Published Nov 27, 2021

Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation

Paper • 2202.01336 • Published Feb 2, 2022

The Impact of Symbolic Representations on In-context Learning for Few-shot Reasoning

Paper • 2212.08686 • Published Dec 16, 2022

How Does Critical Batch Size Scale in Pre-training?

Paper • 2410.21676 • Published Oct 29, 2024

Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning

Paper • 2506.10378 • Published Jun 12, 2025 • 2

EvoLM: In Search of Lost Language Model Training Dynamics

Paper • 2506.16029 • Published Jun 19, 2025

AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?

Paper • 2507.15887 • Published Jul 19, 2025

Weight Decay Improves Language Model Plasticity

Paper • 2602.11137 • Published Feb 11 • 2