Scale AI

company

Verified

https://scale.com/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

tu-trinh-scale authored a paper 4 days ago

A StrongREJECT for Empty Jailbreaks

tu-trinh-scale authored a paper 4 days ago

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents

tu-trinh-scale authored a paper 4 days ago

Learning to Coordinate with Experts

View all activity

Papers

HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?

SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?

View all Papers

ScaleAI 's Papers 5

Submitted by

Tu Trinh

HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?

ScaleAI

Submitted by

taesiri

SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?

ScaleAI

Submitted by

taesiri

Agentic Rubrics as Contextual Verifiers for SWE Agents

ScaleAI

Submitted by

taesiri

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

ScaleAI

Submitted by

Junkai Zhang

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

ScaleAI