kyle's picture

kyle PRO

kaikaidai

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Atla's profile picture

kaikaidai's activity

New activity in AtlaAI/judge-arena 3 days ago
updated a Space 4 days ago
New activity in AtlaAI/judge-arena 4 days ago
New activity in AtlaAI/judge-arena 5 days ago

add-Flow-Judge-v0.1

#8 opened 8 days ago by
bergr7f
New activity in AtlaAI/judge-arena about 1 month ago

Promotion to get more voters

1
#7 opened about 1 month ago by
softclone
posted an update about 1 month ago
view post
Post
1037
๐Ÿ“ˆ Early results on the 8B evaluation model we've been training...

@NinaCalvi wrote about the progress we've made this quarter towards training the best 'LLM-as-a-judge' evaluator. We've significantly improved against the baseline and are approaching state-of-the-art evaluation performance with an 8B model.

Next up: training Llama-3.1-70B ๐Ÿ‘€

Here's the full article: https://www.atla-ai.com/post/evaluating-the-evaluator
  • 2 replies
ยท
New activity in AtlaAI/judge-arena about 2 months ago
updated a Space about 2 months ago
updated a Space about 2 months ago
New activity in AtlaAI/judge-arena 2 months ago

Push main

#1 opened 2 months ago by
kaikaidai