Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Replicate
Fireworks
Cerebras
Hyperbolic
Nscale
Novita
SambaNova
fal
Together AI
Nebius AI Studio
Cohere
HF Inference API
Misc
GenerativeRL
Eval Results

Misc with no match

Inference Endpoints
text-generation-inference
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

1
Full-text search
Active filters: GenerativeRL

OpenDILabCommunity/LunarLanderContinuous-v2-QGPO

Reinforcement Learning • Updated Dec 4, 2024
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs