openai

The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:

numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0

Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:

episode rewards may n

🐛 Bug

The documentation of DQN agent (https://stable-baselines3.readthedocs.io/en/master/modules/dqn.html) specifies that log_interval parameter is "The number of timesteps before logging". However, when set to 1 (or any other value) the logging is not made at that pace but is instead made every log_interval episode (and not timesteps). In the example below this is made every 200 timesteps.

There seem to be some vulnerabilities in our code that might fail easily. I suggest adding more unit tests for the following:

Custom agents (there's only VPG and PPO on CartPole-v0 as of now. We should preferably add more to cover discrete-offpolicy, continuous-offpolicy and continuous-onpolicy)
Evaluation for the Bandits and Classical agents
Testing of convergence of agents as proposed i

A great repo and paper: https://github.com/golsun/deep-RL-trading

This could be useful for FinRL maybe as helper / environment function. Training first on rather simple idealized synthetic prices before feeding real data might be beneficial to learn the agent the "basics". Also it's great for testing.

Sine wave
Trend curves
Random walk

Mar	APR	May
	08
2021	2022	2023

openai

Here are 390 public repositories matching this topic...

jina-ai / clip-as-service

hill-a / stable-baselines

DLR-RM / stable-baselines3

🐛 Bug

minimaxir / gpt-2-simple

huggingface / pytorch-openai-transformer-lm

explosion / spacy-transformers

sberbank-ai / ru-dalle

MorvanZhou / Evolutionary-Algorithm

araffin / rl-baselines-zoo

nicrusso7 / rex-gym

uvipen / Super-mario-bros-PPO-pytorch

DLR-RM / rl-baselines3-zoo

tom-doerr / zsh_codex

germain-hug / Deep-RL-Keras

navneet-nmk / pytorch-rl

SforAiDl / genrl

rish-16 / gpt2client

semiosis / pen.el

kengz / openai_lab

afiaka87 / clip-guided-diffusion

AI4Finance-Foundation / FinRL-Meta

wil3 / gymfc

mpSchrader / gym-sokoban

maraoz / gpt-scrolls

araffin / learning-to-drive-in-5-minutes

akanyaani / gpt-2-tensorflow2.0

tom-doerr / codex-readme

Stable-Baselines-Team / stable-baselines

guillitte / pytorch-sentiment-neuron

gabrielhuang / reptile-pytorch

Improve this page

Add this topic to your repo