Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
casey-martin 's Collections
Quality Code Annealing
Agent Trajectories
High Quality Reasoning Datasets
Subject-Matter-Expertise

Subject-Matter-Expertise

updated Apr 24

High quality pretraining and instruction datasets for law, mathematics, and science.

Upvote
-

  • pile-of-law/pile-of-law

    Updated Jan 8, 2023 • 916 • 249

  • EleutherAI/proof-pile-2

    Updated Oct 25, 2023 • 1.79k • 206

  • gabrielaltay/pubtator-central-bigbio-kb-2022-12-18

    Viewer • Updated Jan 7, 2023 • 35.1M • 85.2k

  • bigcode/the-stack-v2-train-smol-ids

    Viewer • Updated Apr 23, 2024 • 40.1M • 1.05k • 42

  • allenai/SciRIFF

    Viewer • Updated Jun 13, 2024 • 433k • 349 • 41

  • zjunlp/Mol-Instructions

    Updated Mar 3, 2024 • 873 • 58

  • AI-MO/NuminaMath-CoT

    Viewer • Updated Nov 25, 2024 • 860k • 5.29k • 490

  • AI-MO/NuminaMath-TIR

    Viewer • Updated Nov 25, 2024 • 72.5k • 2.22k • 138

  • Team-ACE/ToolACE

    Viewer • Updated Sep 4, 2024 • 11.3k • 2.06k • 137

    Note Function calling


  • NousResearch/hermes-function-calling-v1

    Viewer • Updated Aug 30, 2024 • 11.6k • 1.8k • 334

    Note Function calling


  • Salesforce/xlam-function-calling-60k

    Viewer • Updated Jan 24 • 60k • 5.34k • 508

    Note Function calling


  • trendmicro-ailab/Primus-FineWeb

    Viewer • Updated about 1 month ago • 3.39M • 88 • 15
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs