2 11 60

Masoud Hashemi

masoudhashemi

AI & ML interests

None yet

Recent Activity

liked a dataset about 9 hours ago

microsoft/rStar-Coder

upvoted an article 8 days ago

SmolLM3: smol, multilingual, long-context reasoner

upvoted a collection 30 days ago

MiniMax-M1

View all activity

Organizations

liked a dataset about 9 hours ago

microsoft/rStar-Coder

Viewer • Updated 1 day ago • 1.86M • 38 • 74

upvoted an article 8 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

9 days ago

• 541

upvoted a collection 30 days ago

MiniMax-M1

Collection

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 14 days ago • 108

liked a Space 30 days ago

325

MiniMax M1

💬

Generate code snippets and web applications from text descriptions

liked a model about 2 months ago

TIGER-Lab/general-verifier

Question Answering • 2B • Updated Apr 15 • 3.61k • 15

upvoted a collection about 2 months ago

General-Reasoner

Collection

Advancing LLMs' general reasoning capabilities • 9 items • Updated 22 days ago • 4

liked 2 models 2 months ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • 16B • Updated 20 days ago • 81.3k • 427

a-m-team/AM-Thinking-v1

Text Generation • 33B • Updated May 14 • 1.76k • • 192

liked 2 datasets 2 months ago

Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24 • 60k • 4.04k • 477

joey00072/seeder_pico_thinking_function_calling

Viewer • Updated Apr 23 • 15 • 18 • 1

upvoted an article 3 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3, 2024

• 36

liked a dataset 4 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 6.04k • 527

upvoted an article 4 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

liked a Space 5 months ago

2.82k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 5 months ago

Article

The N Implementation Details of RLHF with PPO

and 2 others •

Oct 24, 2023

• 61

liked a Space 6 months ago

Optillm

💬

Chat with different models using various approaches

liked a model 7 months ago

google/Gemma-Embeddings-v0.8

Updated Dec 12, 2024 • 48

liked a Space 7 months ago

574

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked 2 Spaces 8 months ago

105

Judge Arena

💻

Vote on AI responses to rank models

Open Persian LLM Leaderboard

🏅

Open Persian LLM Leaderboard

Masoud Hashemi

AI & ML interests

Recent Activity

Organizations

masoudhashemi's activity

SmolLM3: smol, multilingual, long-context reasoner

MiniMax M1

Selective fine-tuning of Language Models with Spectrum

Open R1: Update #3

The Ultra-Scale Playbook

The N Implementation Details of RLHF with PPO

Optillm

Scaling test-time compute

Judge Arena

Open Persian LLM Leaderboard