Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Replicate
Cohere
SambaNova
Cerebras
fal
Together AI
Hyperbolic
Nebius AI Studio
Fireworks
Nscale
Novita
HF Inference API
Misc
Proximal Policy Optimization
Inference Endpoints
text-generation-inference

Misc with no match

Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

2
Full-text search
Active filters: Proximal Policy Optimization

LilHairdy/cleanrl_memory_gym

Reinforcement Learning • Updated Sep 17, 2024

estnafinema0/smolLM-variation-ppo

Text Generation • Updated Mar 30 • 9
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs