2 23 77

Masoud Hashemi

masoudhashemi

AI & ML interests

None yet

Recent Activity

liked a Space 14 days ago

aminediroHF/trainer-generator-bf16-mismatch

liked a Space 20 days ago

AdithyaSK/rl-environments-guide

upvoted an article about 2 months ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

View all activity

Organizations

liked a Space 14 days ago

Defeating the trainer-generator precision mismatch in TRL

🎯

Download research PDF (Pro access required)

liked a Space 20 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

168

Building and scaling RL environments for LLM training

liked a model 5 months ago

LLM360/K2-V2

Updated Jan 26 • 150 • 32

liked a Space 5 months ago

AI Deadlines

⚡

756

Find upcoming AI conference deadlines

liked a dataset 5 months ago

nvidia/Nemotron-Agentic-v1

Preview • Updated Dec 15, 2025 • 3.13k • 166

liked a Space 5 months ago

Apriel Chat

💬

ServiceNow-AI model chat

liked a model 6 months ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

Image-Text-to-Text • Updated Dec 22, 2025 • 545 • 300

liked a dataset 6 months ago

open-thoughts/OpenThoughts-Agent-v1-SFT

Viewer • Updated Jan 27 • 15.2k • 2.47k • 92

liked a Space 8 months ago

DNR-Bench

⚡

DNR-Bench leaderboard for RLM's

liked a model 8 months ago

ServiceNow-AI/Apriel-1.5-15b-Thinker

Image-Text-to-Text • 15B • Updated Oct 6, 2025 • 230 • 468

liked a dataset 8 months ago

GAIR/LIMI

Viewer • Updated Oct 9, 2025 • 78 • 339 • 24

liked 4 datasets 9 months ago

liked a model 9 months ago

AI-MO/Kimina-Prover-RL-1.7B

2B • Updated Aug 14, 2025 • 1.68k • 12

liked 4 datasets 10 months ago

nvidia/OpenScienceReasoning-2

Viewer • Updated Jul 31, 2025 • 803k • 1.5k • 57

nvidia/OpenMathInstruct-1

Viewer • Updated Feb 16, 2024 • 6.08M • 3.99k • 253

nvidia/AceReason-1.1-SFT

Viewer • Updated Jun 18, 2025 • 3.96M • 2.02k • 100

microsoft/rStar-Coder

Viewer • Updated Jul 20, 2025 • 1.86M • 5.33k • 242