4 31 216

PeijieDong

pprp

https://pprp.github.io

AI & ML interests

Model Compression; Large Language Model;

Recent Activity

liked a dataset 13 days ago

Gen-Verse/Open-AgentRL-30K

liked a model 16 days ago

nvidia/Nemotron-Flash-1B

liked a dataset about 1 month ago

Idavidrein/gpqa

View all activity

Organizations

None yet

liked a dataset 13 days ago

Gen-Verse/Open-AgentRL-30K

Viewer • Updated Oct 14, 2025 • 30.1k • 72 • 6

liked a model 16 days ago

nvidia/Nemotron-Flash-1B

Text Generation • 1.0B • Updated 12 days ago • 1.11k • 28

liked 2 datasets about 1 month ago

Idavidrein/gpqa

Benchmark • Updated Mar 28, 2024 • 1.25k • 71.3k • 345

nvidia/Llama-Nemotron-VLM-Dataset-v1

Viewer • Updated Oct 22, 2025 • 2.86M • 1.45k • 155

upvoted 2 papers about 2 months ago

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

Paper • 2512.01816 • Published Dec 1, 2025 • 91

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 293

liked 2 datasets 2 months ago

allenai/olmo-mix-1124

Viewer • Updated Aug 19, 2025 • 621M • 20k • 86

OptimalScale/ClimbLab

Viewer • Updated May 4, 2025 • 1.24B • 2.47k • 13

liked a Space 3 months ago

The Smol Training Playbook

📚

2.9k

The secrets to building world-class LLMs

liked a model 3 months ago

inclusionAI/Ring-flash-linear-2.0

Text Generation • 104B • Updated Oct 23, 2025 • 89 • 98

liked a dataset 3 months ago

InternSVG/SArena

Viewer • Updated Nov 4, 2025 • 14k • 210 • 8

liked 2 models 3 months ago

inclusionAI/Ring-lite-linear-preview

Text Generation • 17B • Updated Aug 18, 2025 • 52 • 37

microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9, 2025 • 1.31k • 362

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated Nov 18, 2025 • 65.1k • • 940

upvoted 2 papers 4 months ago

Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution

Paper • 2509.21072 • Published Sep 25, 2025 • 15

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 102

liked a model 5 months ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • 2B • Updated Nov 21, 2025 • 991 • 235

upvoted a paper 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 261

liked 2 datasets 5 months ago

nvidia/AceReason-1.1-SFT

Viewer • Updated Jun 18, 2025 • 3.96M • 1.42k • 97

nvidia/AceReason-Math

Viewer • Updated Jun 18, 2025 • 49.6k • 1.2k • 46

PeijieDong

AI & ML interests

Recent Activity

Organizations

pprp's activity

The Smol Training Playbook