Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

RLHF4MATH

Activity Feed

AI & ML interests

None defined yet.

dasqw's profile picture

models 8

RLHF4MATH/CodeGemma-7B-it-M-DPO

Text Generation • 9B • Updated Jul 26, 2024 • 6

RLHF4MATH/Gemma-7B-it-M-DPO

Text Generation • 9B • Updated Jul 26, 2024 • 14

RLHF4MATH/Gemma-9B-it-SFT3epoch

Text Generation • 9B • Updated Jul 26, 2024 • 6

RLHF4MATH/Mistral-7B-pt-SFT2epoch

Text Generation • 7B • Updated Jul 26, 2024 • 5

RLHF4MATH/Code-Gemma-7B-it-SFT3epoch

Text Generation • 9B • Updated Jul 26, 2024 • 1.04k • 1

RLHF4MATH/Gemma-7B-it-SFT3epoch

Text Generation • 9B • Updated Jul 26, 2024 • 808

RLHF4MATH/Gemma-2-9B-it-M-DPO

Text Generation • 9B • Updated Jul 15, 2024 • 5

RLHF4MATH/Mistral-7B-pt-M-DPO

Text Generation • 7B • Updated Jul 13, 2024 • 5

datasets 6

RLHF4MATH/Gemma-7B-1.1-it-iter1-random-pairs

Viewer • Updated Jul 27, 2024 • 19k • 364 • 1

RLHF4MATH/SFT_510K

Viewer • Updated Jul 25, 2024 • 512k • 33 • 1

RLHF4MATH/prompt_iter4

Viewer • Updated Jul 25, 2024 • 20.8k • 17

RLHF4MATH/prompt_iter3

Viewer • Updated Jul 25, 2024 • 20.8k • 13

RLHF4MATH/prompt_iter2

Viewer • Updated Jul 25, 2024 • 20.8k • 17

RLHF4MATH/prompt_iter1

Viewer • Updated Jul 25, 2024 • 20.8k • 26
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs