The Wayback Machine - https://web.archive.org/web/20220323033202/https://github.com/topics/toolbox
Skip to content
#

toolbox

Here are 760 public repositories matching this topic...

stable-baselines
calerc
calerc commented Nov 23, 2020

The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:

numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0

Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:

  • episode rewards may n
good first issue question
stable-baselines3
YannBerthelot
YannBerthelot commented Jan 18, 2022

🐛 Bug

The documentation of DQN agent (https://stable-baselines3.readthedocs.io/en/master/modules/dqn.html) specifies that log_interval parameter is "The number of timesteps before logging". However, when set to 1 (or any other value) the logging is not made at that pace but is instead made every log_interval episode (and not timesteps). In the example below this is made every 200 timesteps.

bug documentation good first issue

🚀 Geodesic is a DevOps Linux Toolbox in Docker. We use it as an interactive cloud automation shell. It's the fastest way to get up and running with a rock solid Open Source toolchain. ★ this repo! https://slack.cloudposse.com/

  • Updated Mar 22, 2022
  • Shell

Improve this page

Add a description, image, and links to the toolbox topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the toolbox topic, visit your repo's landing page and select "manage topics."

Learn more