| --- |
| library_name: stable-baselines3 |
| tags: |
| - AntBulletEnv-v0 |
| - deep-reinforcement-learning |
| - reinforcement-learning |
| - stable-baselines3 |
| model-index: |
| - name: A2C |
| results: |
| - metrics: |
| - type: mean_reward |
| value: 1218.38 +/- 203.74 |
| name: mean_reward |
| task: |
| type: reinforcement-learning |
| name: reinforcement-learning |
| dataset: |
| name: AntBulletEnv-v0 |
| type: AntBulletEnv-v0 |
| --- |
| |
| # **A2C** Agent playing **AntBulletEnv-v0** |
| This is a trained model of a **A2C** agent playing **AntBulletEnv-v0** |
| using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3). |
|
|
| ## Usage (with Stable-baselines3) |
| ## parameters |
|
|
| ```python |
| model = A2C(policy = "MlpPolicy", |
| env = env, |
| gae_lambda = 0.9, |
| gamma = 0.99, |
| learning_rate = 0.00096, |
| max_grad_norm = 0.5, |
| n_steps = 8, |
| vf_coef = 0.4, |
| ent_coef = 0.0, |
| tensorboard_log = "./tensorboard", |
| policy_kwargs=dict( |
| log_std_init=-2, ortho_init=False), |
| normalize_advantage=False, |
| use_rms_prop= True, |
| use_sde= True, |
| verbose=1) |
| ... |
| ``` |
|
|