Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Fireworks
Hyperbolic
fal
Featherless AI
Novita
Cerebras
Together AI
Replicate
Cohere
SambaNova
Nebius AI Studio
Nscale
HF Inference API
Misc
rlvr
Inference Endpoints

Misc with no match

text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

6
Full-text search
Active filters: rlvr

SultanR/SmolTulu-1.7b-Reinforced-GGUF

Text Generation • Updated Dec 17, 2024 • 7 • 1

thuml/rt1-world-model-multi-step-rlvr

Updated 19 days ago • 14

thuml/rt1-world-model-single-step-rlvr

Updated 19 days ago • 13

thuml/webarena-world-model-rlvr

Updated 19 days ago • 11

thuml/bytesized32-world-model-rlvr-binary-reward

Updated 19 days ago • 10

thuml/bytesized32-world-model-rlvr-task-specific-reward

Updated 19 days ago • 25
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs