Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nebius AI Studio
Cohere
Fireworks
Nscale
Cerebras
Replicate
Hyperbolic
SambaNova
Together AI
Novita
fal
HF Inference API
Misc
Reset Misc
custom-implementation
Eval Results
Inference Endpoints
Misc with no match
text-generation-inference
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
23,598
Full-text search
Edit filters
Sort: Trending
Active filters:
custom-implementation
Clear all
TUMxudashuai/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Dec 8, 2022
avisubedi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Dec 8, 2022
ThePianist/u8-ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 23, 2023
•
3
urechandro/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Dec 9, 2022
urechandro/q-Taxi-v3
Reinforcement Learning
•
Updated
Dec 9, 2022
hanq0212/RL_course_unit1_part1
Reinforcement Learning
•
Updated
Dec 9, 2022
hanq0212/RL_course_unit1_part2
Reinforcement Learning
•
Updated
Dec 9, 2022
alicjak/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Dec 9, 2022
alicjak/q-Taxi-v3
Reinforcement Learning
•
Updated
Dec 9, 2022
Jasmaur/FrozenLake-v1
Reinforcement Learning
•
Updated
Dec 10, 2022
314anist/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Dec 10, 2022
314anist/q-Taxi-v3
Reinforcement Learning
•
Updated
Dec 10, 2022
lithomas1/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Dec 10, 2022
lithomas1/q-Taxi-v3
Reinforcement Learning
•
Updated
Dec 10, 2022
Clawoo/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Feb 15, 2023
•
1
osanseviero/q-FrozenLake-v1-4x4-noSlippery-test
Reinforcement Learning
•
Updated
Dec 10, 2022
osanseviero/q-FrozenLake-v1-4x4-noSlippery-test2
Reinforcement Learning
•
Updated
Dec 10, 2022
osanseviero/q-FrozenLake-v1-4x4-noSlippery-test3
Reinforcement Learning
•
Updated
Dec 10, 2022
osanseviero/q-FrozenLake-v1-4x4-noSlippery-test4
Reinforcement Learning
•
Updated
Dec 10, 2022
osanseviero/q-Taxi-v3-nice
Reinforcement Learning
•
Updated
Dec 10, 2022
Artachtron/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 1, 2023
JabrilJacobs/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Aug 18, 2023
•
3
kebei/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jun 27, 2023
•
1
Honza/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 7, 2023
Isaacp/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 3, 2023
•
2
ThomasSimonini/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Dec 12, 2022
TomBrains/Reinforce-PolicyGradient
Reinforcement Learning
•
Updated
Dec 12, 2022
osanseviero/q-FrozenLake-v1-4x4-noSlippery-wohoo
Reinforcement Learning
•
Updated
Dec 12, 2022
osanseviero/super_taxi
Reinforcement Learning
•
Updated
Dec 12, 2022
ChechkovEugene/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Apr 14, 2023
•
2
Previous
1
...
32
33
34
35
36
...
100
Next