PPO Agent โ€” LunarLander-v3

trained by zero with PPO in CleanRL. No optmized parameters

Resultado

  • medium reward (10 ep): 195.99 +/- 72.84
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results