Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
allenai 's Collections
OLMo 2
olmOCR
DataDecide
OLMoE (January 2025)
PixMo
Tulu 3 Models
Tulu 3 Datasets
Molmo
OLMoE (November 2024)
OLMo Suite
Tulu V2.5 Suite
Reward Bench
Paloma
Tulu V2 Suite
WildBench
SciRIFF
AI2 Safety Toolkit
Zebra Logic Bench
OLMo 2 Preview Post-trained Models
ACE

Zebra Logic Bench

updated 9 days ago

ZebraLogic Bench: Testing the Limits of LLMs in Logical Reasoning

Upvote
5

  • Running
    87
    87

    Zebra Logic Bench

    🦓

    Render a leaderboard for model evaluation


  • allenai/ZebraLogicBench

    Viewer • Updated Jul 11, 2024 • 4.26k • 475 • 15

    Note 1k puzzles in grid format and 3k in mcqa format. answers are hidden.


  • allenai/ZebraLogicBench-private

    Viewer • Updated Jul 4, 2024 • 4.26k • 3.1k • 9

    Note answers are included here. to prevent data leakage, plz apply for access.


  • Faith and Fate: Limits of Transformers on Compositionality

    Paper • 2305.18654 • Published May 29, 2023 • 6

    Note Our NeurIPS 2023 (spotlight) paper about the data and analysis.

Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs