Zebra Logic Bench - a allenai Collection

allenai 's Collections

MolmoAct Data Mixture

IFBench

OLMo 2

olmOCR

OLMoE (January 2025)

PixMo

Tulu 3 Datasets

Molmo

OLMoE (November 2024)

Tulu V2.5 Suite

Paloma

SciRIFF

AI2 Safety Toolkit

Zebra Logic Bench

OLMo 2 Preview Post-trained Models

ACE

Zebra Logic Bench

updated Apr 30

ZebraLogic Bench: Testing the Limits of LLMs in Logical Reasoning

Running

89

89

Zebra Logic Bench

🦓

Render a leaderboard for model evaluation
allenai/ZebraLogicBench

Viewer • Updated Jul 11, 2024 • 4.26k • 1.43k • 20

Note 1k puzzles in grid format and 3k in mcqa format. answers are hidden.
allenai/ZebraLogicBench-private

Viewer • Updated Jul 4, 2024 • 4.26k • 1.44k • 9

Note answers are included here. to prevent data leakage, plz apply for access.
Faith and Fate: Limits of Transformers on Compositionality

Paper • 2305.18654 • Published May 29, 2023 • 7

Note Our NeurIPS 2023 (spotlight) paper about the data and analysis.