david dinucu-jianu's picture

10 34

david dinucu-jianu PRO

rd211

·

rd211

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

Salesforce/FARE-20B

updated a dataset 3 months ago

rd211/Big-Math-RL-Verified-Filtered

updated a model 3 months ago

rd211/Qwen3-4B-Confv2-merged

View all activity

Organizations

upvoted a paper 7 months ago

From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning

Paper • 2505.15607 • Published May 21, 2025 • 3

upvoted a collection 9 months ago

Kimina Prover Preview

State-of-the-Art Models for Formal Mathematical Reasoning • 5 items • Updated Apr 28, 2025 • 33

upvoted a paper 9 months ago

MathTutorBench: A Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors

Paper • 2502.18940 • Published Feb 26, 2025 • 2

upvoted a paper 11 months ago

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Paper • 2411.04983 • Published Nov 7, 2024 • 13

upvoted a paper 12 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 287

upvoted a paper about 1 year ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 24

upvoted an article about 1 year ago

Article

Releasing the largest multilingual open pretraining dataset

Nov 13, 2024

•

104

upvoted 3 collections over 1 year ago

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated 3 days ago • 52

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 11 days ago • 162

DeepSeek-Math

DeepSeek Math series • 6 items • Updated Nov 27, 2025 • 44