Masoud Hashemi's picture

Masoud Hashemi

masoudhashemi

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

LLM360/K2-V2

liked a Space 16 days ago

huggingface/ai-deadlines

liked a dataset 21 days ago

nvidia/Nemotron-Agentic-v1

View all activity

Organizations

upvoted an article about 1 month ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

29 days ago

•

82

upvoted a paper 3 months ago

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 5

upvoted an article 3 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

Sep 23, 2025

•

134

upvoted a collection 3 months ago

Apriel-1.5-15B-Thinker

3 items • Updated Oct 2, 2025 • 76

upvoted an article 4 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

+9

Sep 22, 2025

•

125

upvoted a paper 4 months ago

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

upvoted a paper 5 months ago

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21, 2025 • 67

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

743

upvoted a collection 7 months ago

MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Oct 21, 2025 • 120

upvoted a collection 8 months ago

General-Reasoner

Advancing LLMs' general reasoning capabilities • 9 items • Updated Oct 12, 2025 • 6

upvoted an article 8 months ago

Article

Selective fine-tuning of Language Models with Spectrum

Sep 3, 2024

•

36

upvoted an article 10 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

296

upvoted an article 11 months ago

Article

The N Implementation Details of RLHF with PPO

+1

Oct 24, 2023

•

71

upvoted an article over 1 year ago

Article

BigCodeBench: The Next Generation of HumanEval

+7

Jun 18, 2024

•

52

upvoted a collection over 1 year ago

[lecture artifacts] aligning open language models

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 57

upvoted a paper over 1 year ago

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 62

upvoted a collection almost 2 years ago

MoEs papers reading list

60 items • Updated Nov 4, 2024 • 145

upvoted a paper about 2 years ago

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 55