Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Regularized Self-Play

community
Activity Feed

AI & ML interests

None defined yet.

Xiaohang Tang's profile picture SON, SEONG HO's profile picture Angela Yuan's profile picture Sangwoong Yoon's profile picture

models 44

RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter3

Text Generation • 3B • Updated Aug 11 • 3

RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter2

Text Generation • 3B • Updated Aug 11 • 3

RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter1

Text Generation • 3B • Updated Aug 11 • 3

RegularizedSelfPlay/Gemma-2-2B-SPPO-It-Iter1

Text Generation • 3B • Updated Aug 11 • 2

RegularizedSelfPlay/Llama-3-8B-Instruct-SPPO-Iter2-gp-8b-gpm-reg0.5-sppo-reversekl-table

Text Generation • 8B • Updated Jul 30 • 3

RegularizedSelfPlay/Llama-3-8B-Instruct-SPPO-Iter2-gp-8b-gpm-reg0.05-sppo-reversekl-table

Text Generation • 8B • Updated Jul 30 • 3

RegularizedSelfPlay/Llama-3-8B-Instruct-SPPO-Iter1-gp-8b-gpm-reg0.05-sppo-reversekl-table

Text Generation • 8B • Updated Jul 30 • 2

RegularizedSelfPlay/Llama-3-8B-Instruct-SPPO-Iter2-gp-8b-gpm-reg0.1-sppo-forwardimportance10-table

Text Generation • 8B • Updated Jul 30 • 3

RegularizedSelfPlay/Llama-3-8B-Instruct-SPPO-Iter3-gp-8b-gpm-reg0.5-sppo-reversekl-table

Text Generation • 8B • Updated Jul 30 • 3

RegularizedSelfPlay/Llama-3-8B-Instruct-SPPO-Iter1-gp-8b-gpm-reg0.5-sppo-reversekl-table

Text Generation • 8B • Updated Jul 30 • 3
View 44 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs