Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ceval
community
https://cevalbenchmark.com
Activity Feed
Request to join this org
Follow
22
AI & ML interests
We focus on Chinese evaluation of foundation models.
Recent Activity
yuzhen17
updated
a dataset
about 3 hours ago
ceval/ceval-exam
yuzhen17
authored
a paper
about 2 months ago
Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning
jxhe
authored
a paper
about 2 months ago
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
View all activity
Team members
2
models
0
None public yet
datasets
1
ceval/ceval-exam
Viewer
•
Updated
about 3 hours ago
•
13.9k
•
17k
•
274