openai

The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:

numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0

Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:

episode rewards may n

Oct	NOV	Dec
	16
2020	2021	2022

openai

Here are 319 public repositories matching this topic...

hill-a / stable-baselines

minimaxir / gpt-2-simple

DLR-RM / stable-baselines3

huggingface / pytorch-openai-transformer-lm

explosion / spacy-transformers

MorvanZhou / Evolutionary-Algorithm

araffin / rl-baselines-zoo

nicrusso7 / rex-gym

uvipen / Super-mario-bros-PPO-pytorch

sberbank-ai / ru-dalle

germain-hug / Deep-RL-Keras

DLR-RM / rl-baselines3-zoo

navneet-nmk / pytorch-rl

tom-doerr / zsh_codex

SforAiDl / genrl

rish-16 / gpt2client

kengz / openai_lab

semiosis / pen.el

wil3 / gymfc

mpSchrader / gym-sokoban

maraoz / gpt-scrolls

araffin / learning-to-drive-in-5-minutes

akanyaani / gpt-2-tensorflow2.0

tom-doerr / codex-readme

guillitte / pytorch-sentiment-neuron

Stable-Baselines-Team / stable-baselines

afiaka87 / clip-guided-diffusion

gabrielhuang / reptile-pytorch

robotology / gym-ignition

upb-lea / gym-electric-motor

Improve this page

Add this topic to your repo