20 54 83

Asaf Yehudai

Asaf-Yehudai

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

new activity 3 days ago

gaia-benchmark/leaderboard:Access to the submission and evaluation data

upvoted a paper 3 days ago

Open Data Synthesis For Deep Research

View all activity

Organizations

upvoted a paper about 23 hours ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published 5 days ago • 160

New activity in gaia-benchmark/leaderboard 3 days ago

Access to the submission and evaluation data

#69 opened 4 days ago by

Asaf-Yehudai

upvoted a paper 3 days ago

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published 9 days ago • 58

liked a model 19 days ago

stepfun-ai/NextStep-1-Large-Edit

Image-to-Image • 15B • Updated 20 days ago • 729 • 47

upvoted a paper 24 days ago

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Paper • 2508.09983 • Published 25 days ago • 67

liked a Space 27 days ago

531

GAIA Leaderboard

🦾

Submit and evaluate models on GAIA benchmark

liked a model about 1 month ago

LGAI-EXAONE/EXAONE-4.0-32B

Text Generation • 32B • Updated Aug 4 • 118k • 250

liked a Space about 1 month ago

6.36k

MTEB Leaderboard

🥇

Embedding Leaderboard

upvoted a paper about 1 month ago

CLEAR: Error Analysis via LLM-as-a-Judge Made Easy

Paper • 2507.18392 • Published Jul 24 • 19

commented a paper about 1 month ago

CLEAR: Error Analysis via LLM-as-a-Judge Made Easy

Paper • 2507.18392 • Published Jul 24 • 19 •

liked a Space about 2 months ago

SWE-Bench Verified Discriminative Subsets Leaderboard

🏆

Display model performance rankings

liked a Space 2 months ago

BlueBench Leaderboard

🥇

An open-source benchmark for enterprise use cases.

upvoted 3 papers 3 months ago

liked a dataset 3 months ago

ibm-research/justrank_judge_scores

Viewer • Updated Jun 8 • 1.51M • 22 • 2

upvoted a paper 3 months ago

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Paper • 2505.17813 • Published May 23 • 57

upvoted a paper 4 months ago

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Paper • 2505.10320 • Published May 15 • 23

liked a Space 4 months ago

591

DreamO

🐨

A Unified Framework for Image Customization