LLM evals and benchmark datasets - a davidberenstein1957 Collection

davidberenstein1957 's Collections

Smol but mighty

LLM evals and benchmark datasets

Synthetic Data Papers

Dataset Viber annotators

Cool and fun Spaces

Model Leaderboards

Useful datasets

Follow The Money

LLM evals and benchmark datasets

updated Jan 22