Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Apps with no match
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Inference Providers
Inference Providers with no match
Hyperbolic
Fireworks
Cerebras
Together AI
Groq
SambaNova
Replicate
Featherless AI
Nebius AI
fal
Cohere
Nscale
Novita
HF Inference API
Misc
Reset Misc
deep-rl-class
Eval Results
Inference Endpoints
Misc with no match
text-generation-inference
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
7,251
Full-text search
Edit filters
Sort: Trending
Active filters:
deep-rl-class
Clear all
BoschAI/Reinforce-Cartpole-v1
Reinforcement Learning
•
Updated
Mar 30, 2023
ManarAli/Reinforce-pixelcopter
Reinforcement Learning
•
Updated
Mar 30, 2023
BoschAI/Reinforce-pixelcopter
Reinforcement Learning
•
Updated
Mar 30, 2023
joe-hug/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Mar 29, 2023
cfalholt/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Mar 29, 2023
feratur/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Mar 29, 2023
kenzo4433/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Mar 29, 2023
kenzo4433/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Mar 29, 2023
stelladk/Reinforce-PixelCopter-PLE-v0
Reinforcement Learning
•
Updated
Apr 19, 2023
JamesEJarvis/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Mar 29, 2023
mobiusmatt/Reinforce-CartPole-v1initial
Reinforcement Learning
•
Updated
Mar 29, 2023
JamesEJarvis/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Mar 30, 2023
adavies25/Reinforce-Cartpole-1
Reinforcement Learning
•
Updated
Mar 29, 2023
mobiusmatt/Reinforce-Pixelcopter-PLE-v0initial
Reinforcement Learning
•
Updated
Mar 29, 2023
sofiapecora/Reinforce-cartpole2
Reinforcement Learning
•
Updated
Mar 29, 2023
gf2rl/david1
Reinforcement Learning
•
Updated
Mar 29, 2023
gf2rl/david2
Reinforcement Learning
•
Updated
Mar 29, 2023
gf2rl/david3
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/david4
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/h_size_2
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/h_size_16_standard
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/h_size_100_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/h_size_100_success_with_training_5000_episodes
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/max_t_50_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/lr_1e-1_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/lr_1e-3_not_perfect_but_not_a_complete_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/gamma_0_05_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
OMARS200/Cartpole-v1
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/gamma_0_5_Partial_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/partial_observability_pole_pose_only
Reinforcement Learning
•
Updated
Mar 30, 2023
Previous
1
...
56
57
58
59
60
...
100
Next