reinforcement-learning-algorithms

🐛 Bug

The documentation of DQN agent (https://stable-baselines3.readthedocs.io/en/master/modules/dqn.html) specifies that log_interval parameter is "The number of timesteps before logging". However, when set to 1 (or any other value) the logging is not made at that pace but is instead made every log_interval episode (and not timesteps). In the example below this is made every 200 timesteps.

The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:

numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0

Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:

episode rewards may n

How to run gtrxl with ppo policy? can someone provide an example?

There seem to be some vulnerabilities in our code that might fail easily. I suggest adding more unit tests for the following:

Custom agents (there's only VPG and PPO on CartPole-v0 as of now. We should preferably add more to cover discrete-offpolicy, continuous-offpolicy and continuous-onpolicy)
Evaluation for the Bandits and Classical agents
Testing of convergence of agents as proposed i

May	JUN	Jul
	01
2021	2022	2023

reinforcement-learning-algorithms

Here are 659 public repositories matching this topic...

udacity / deep-reinforcement-learning

DLR-RM / stable-baselines3

🐛 Bug

hill-a / stable-baselines

nicrusso7 / rex-gym

nikhilbarhate99 / PPO-PyTorch

cpnota / autonomous-learning-library

cbfinn / gps

opendilab / DI-engine

NeuromatchAcademy / course-content-dl

JuliaPOMDP / POMDPs.jl

benedekrozemberczki / awesome-monte-carlo-tree-search-papers

MishaLaskin / curl

Omegastick / pytorch-cpp-rl

SforAiDl / genrl

qiwihui / reinforcement-learning-an-introduction-chinese

EricSteinberger / PokerRL

imagry / aleph_star

RITCHIEHuang / DeepRL_Algorithms

huawei-noah / xingtian

david-abel / simple_rl

ericyangyu / PPO-for-Beginners

EricSteinberger / Deep-CFR

FortsAndMills / RL-Theory-book

Madhu009 / Deep-math-machine-learning.ai

sichkar-valentyn / Reinforcement_Learning_in_Python

nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch

bentrevett / pytorch-rl

aditya1702 / Machine-Learning-and-Data-Science

kkuette / TradzQAI

mohamedameen93 / CS-7641-Machine-Learning-Notes

Improve this page

Add this topic to your repo