Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Srgreen
/
ppo-LunarLander-v2-complete
like
1
Reinforcement Learning
LunarLander-v3
ppo
deep-reinforcement-learning
custom-implementation
deep-rl-course
Eval Results (legacy)
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
PPO Agent โ LunarLander-v3
Resultado
PPO Agent โ LunarLander-v3
trained by zero with PPO in CleanRL. No optmized parameters
Resultado
medium reward (10 ep):
195.99 +/- 72.84
Downloads last month
-
Downloads are not tracked for this model.
How to track
Video Preview
Reinforcement Learning
loading
Evaluation results
mean_reward
on LunarLander-v3
self-reported
195.99 +/- 72.84