Scale AI

company

Verified

https://scale.com/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

utkarsh4430 submitted a paper 26 days ago

Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR

utkarsh4430 authored a paper 26 days ago

VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap

utkarsh4430 authored a paper 26 days ago

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

View all activity

Papers

Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR

HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?

View all Papers

buckets 3

ScaleAI/hil-bench-swe-images

ScaleAI/hil-bench-sql-artifacts

ScaleAI/hil-bench-swe-databases

Collections 4

View 4 collections

models 5

ScaleAI/swe-oec-32b

33B • Updated Oct 20, 2025 • 2

ScaleAI/swe-oec-7b

8B • Updated Oct 20, 2025 • 2 • 1

ScaleAI/vft-llama-3_1-8b-instruct

8B • Updated Jul 28, 2025 • 4

ScaleAI/tps-dependencies

Updated Jun 25, 2025

ScaleAI/mhj-llama3-8b-rmu

8B • Updated Aug 28, 2024 • 6 • 3

datasets 29

ScaleAI/ROK-FORTRESS_public

Viewer • Updated 27 days ago • 791 • 31

ScaleAI/aspi

Viewer • Updated May 15 • 1.46k • 165 • 1

ScaleAI/hil-bench

Viewer • Updated Mar 31 • 200 • 1.14k • 1

ScaleAI/MultiChallenge

Viewer • Updated Mar 31 • 266 • 433 • 1

ScaleAI/SWE-Atlas-QnA

Viewer • Updated Mar 31 • 124 • 922 • 17

ScaleAI/lhaw

Viewer • Updated Mar 20 • 285 • 952 • 6

ScaleAI/RaR-Medicine

Viewer • Updated Feb 24 • 22.4k • 70 • 1

ScaleAI/RaR-Science

Viewer • Updated Feb 24 • 22.9k • 207 • 1

ScaleAI/SWE-bench_Pro

Benchmark • Updated Feb 23 • 731 • 69.9k • 126

ScaleAI/mrt

Updated Feb 23 • 638 • 6

View 29 datasets