Atla

Enterprise
company
Verified
Activity Feed

AI & ML interests

Scalable oversight

Recent Activity

kaikaidaiย  updated a Space 19 days ago
AtlaAI/selene
kaikaidaiย  updated a model 19 days ago
AtlaAI/Selene-1-Mini-Llama-3.1-8B
kaikaidaiย  updated a Space 19 days ago
AtlaAI/judge-arena
View all activity

Articles

AtlaAI's activity

kaikaidaiย 
posted an update 5 months ago
view post
Post
1081
๐Ÿ“ˆ Early results on the 8B evaluation model we've been training...

@NinaCalvi wrote about the progress we've made this quarter towards training the best 'LLM-as-a-judge' evaluator. We've significantly improved against the baseline and are approaching state-of-the-art evaluation performance with an 8B model.

Next up: training Llama-3.1-70B ๐Ÿ‘€

Here's the full article: https://www.atla-ai.com/post/evaluating-the-evaluator
  • 2 replies
ยท
mbartoloย 
authored 10 papers about 1 year ago