Open Image Preferences Collection Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 9
Maths reasoning Collection Maths reasoning datasets found using https://huggingface.co/spaces/librarian-bots/huggingface-datasets-semantic-search • 14 items • Updated 7 days ago • 2
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 17 days ago • 187
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 10 days ago • 48
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated about 17 hours ago • 43
CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering Paper • 2502.01523 • Published 18 days ago • 1
ScholaWrite: A Dataset of End-to-End Scholarly Writing Process Paper • 2502.02904 • Published 16 days ago • 2
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 12 items • Updated 1 day ago • 76
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation Paper • 2412.15594 • Published Dec 20, 2024 • 1
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published 19 days ago • 23
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction Paper • 2403.02270 • Published Mar 4, 2024 • 3
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published 18 days ago • 15