Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Fireworks
Cerebras
Nebius AI
Novita
Together AI
Groq
Hyperbolic
Cohere
fal
Nscale
Featherless AI
Replicate
SambaNova
HF Inference API
Misc
Reset Misc
custom-implementation
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
25,251
Full-text search
Edit filters
Sort: Trending
Active filters:
custom-implementation
Clear all
scronberg/reinforce-pixel-copter-grad-policy
Reinforcement Learning
•
Updated
Jan 16, 2023
jrnold/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 1, 2023
•
1
TheTeamBuilder/Reinforce-pixelcopter-1
Reinforcement Learning
•
Updated
Jan 16, 2023
wooihen/Reinforce-Pixelcopter-PLE-v0-TEST
Reinforcement Learning
•
Updated
Jan 16, 2023
chenmertens/Reinforce-cartpole
Reinforcement Learning
•
Updated
Jan 16, 2023
chenmertens/Reinforce-pixelcopter
Reinforcement Learning
•
Updated
Jan 16, 2023
Bingsu/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 16, 2023
Bingsu/q-Taxi-v3
Reinforcement Learning
•
Updated
Jan 16, 2023
Bingsu/q-FrozenLake-v1-8x8
Reinforcement Learning
•
Updated
Jan 16, 2023
Liapunov/Reinforce-first
Reinforcement Learning
•
Updated
Apr 7, 2023
pneubauer/basic-q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 16, 2023
pneubauer/basic-q-Taxi-v3
Reinforcement Learning
•
Updated
Jan 16, 2023
RisiPisi/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 16, 2023
babakc/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 17, 2023
RisiPisi/MyTaxi-v3
Reinforcement Learning
•
Updated
Jan 16, 2023
darthrevenge/q-Taxi-v3
Reinforcement Learning
•
Updated
Jan 16, 2023
ARandomFrenchDev/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 16, 2023
RoyDor/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 16, 2023
ARandomFrenchDev/q-Taxi-v3-500x6
Reinforcement Learning
•
Updated
Jan 16, 2023
RoyDor/q-Taxi-v3
Reinforcement Learning
•
Updated
Jan 16, 2023
mkahari/Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Jan 16, 2023
Senura/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 16, 2023
Senura/senura-taxi
Reinforcement Learning
•
Updated
Jan 16, 2023
igorcheb/REINFORCE-LunarLanderContinuous-v2
Reinforcement Learning
•
Updated
Feb 2, 2023
Ashley1902/taxi-v3-v1-16012023
Reinforcement Learning
•
Updated
Jan 16, 2023
iblub/Reinforce-PixelCopter-01
Reinforcement Learning
•
Updated
Jan 16, 2023
Liapunov/Reinforce-second
Reinforcement Learning
•
Updated
Jan 16, 2023
johnhudzinatr/Reforce-CartPole-v1
Reinforcement Learning
•
Updated
Jan 16, 2023
atorre/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Jan 16, 2023
Honza/Reinforce-PixelCopter1
Reinforcement Learning
•
Updated
Jan 16, 2023
Previous
1
...
82
83
84
85
86
...
100
Next