Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thunder-research-group
's Collections
SNU Thunder-LLM Korean Benchmark Suite
SNU Thunder-LLM English Benchmark Suite
SNU Thunder-LLM Dataset Suite
Post-Training Datasets
SNU Thunder-DeID
SNU Thunder-LLM Korean Benchmark Suite
updated
Jun 13
Upvote
1
thunder-research-group/SNU_Ko-LAMBADA
Viewer
•
Updated
Jun 13
•
2.26k
•
303
thunder-research-group/SNU_Ko-WinoGrande
Viewer
•
Updated
Jun 13
•
1.27k
•
13
thunder-research-group/SNU_Ko-ARC
Viewer
•
Updated
Jun 13
•
3.54k
•
248
thunder-research-group/SNU_Ko-GSM8K
Viewer
•
Updated
Jun 13
•
1.32k
•
17
•
1
thunder-research-group/SNU_Ko-IFEval
Viewer
•
Updated
Jun 13
•
841
•
17
thunder-research-group/SNU_Ko-EQ-Bench
Viewer
•
Updated
Jun 13
•
171
•
17
skt/kobest_v1
Viewer
•
Updated
Mar 28, 2024
•
23.4k
•
4.3k
•
50
Note
We use hellaswag > test set for evaluation
HAERAE-HUB/KMMLU
Viewer
•
Updated
Mar 5, 2024
•
244k
•
23.6k
•
77
HYU-NLP/KR-HumanEval
Viewer
•
Updated
Jun 3
•
328
•
20
Note
We use v1 for evaluation
thunder-research-group/korquad_v2_1
Viewer
•
Updated
Jun 10
•
93.7k
•
10
Upvote
1
Share collection
View history
Collection guide
Browse collections