Jarrod Barnes's picture

Jarrod Barnes PRO

Jarrodbarnes

·

https://dynamicalsystems.ai

AI & ML interests

Continual Learning, Reinforcement Learning

Recent Activity

updated a dataset about 3 hours ago

Jarrodbarnes/processrl-terminal-environments

published a dataset about 3 hours ago

Jarrodbarnes/processrl-terminal-environments

liked a dataset about 16 hours ago

open-thoughts/OpenThoughts-Agent-v1-RL

View all activity

Organizations

upvoted an article 2 days ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+5

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra

•

3 days ago

• 31

upvoted a paper 3 days ago

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Paper • 2605.26494 • Published 4 days ago • 32

upvoted a paper 9 days ago

Look Before You Leap: Autonomous Exploration for LLM Agents

Paper • 2605.16143 • Published 15 days ago • 9

upvoted a collection 10 days ago

📊 DNA benchmarks

Zero-shot DNA benchmarks for Variant Effect prediction, Sequence Recovery and Perturbation tasks. • 5 items • Updated 11 days ago • 9

upvoted a paper 10 days ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published Apr 16 • 12

upvoted a collection 11 days ago

Laguna XS.2

Designed for agentic coding and long-horizon work on a local machine. Apache 2.0. • 5 items • Updated 23 days ago • 23

upvoted a collection 15 days ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 18 items • Updated about 16 hours ago • 298

upvoted 2 papers 22 days ago

ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?

Paper • 2602.19594 • Published Feb 23 • 3

Structured Distillation of Web Agent Capabilities Enables Generalization

Paper • 2604.07776 • Published Apr 9 • 23

upvoted a paper 24 days ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 48

upvoted a collection about 1 month ago

💧 LFM2.5

Collection of post-trained and base LFM2.5 models. • 33 items • Updated 2 days ago • 142

upvoted a paper about 1 month ago

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Paper • 2604.24954 • Published Apr 27 • 24

upvoted 3 collections about 1 month ago

MiMo-V2.5

4 items • Updated Apr 27 • 87

SAM3

6 items • Updated Mar 26 • 263

Qwen3.6

4 items • Updated Apr 22 • 382

upvoted 2 papers about 2 months ago

Crystalite: A Lightweight Transformer for Efficient Crystal Modeling

Paper • 2604.02270 • Published Apr 2 • 1

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published Mar 26 • 53

upvoted an article 2 months ago

Article

SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation

OpenMed

•

Mar 23

• 17

upvoted 2 papers 2 months ago

Self-Improving Pretraining: using post-trained models to pretrain better models

Paper • 2601.21343 • Published Jan 29 • 19

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 185