openai

The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:

numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0

Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:

episode rewards may n

I'll post it as a question as I am not quite sure that it is a bug. I have been experimenting for a while with the library in a custom environment for a school project and I am really interested in the reproducibility of the result. I have read the disclaimer in the documentation that reads that reproducible results are not guaranteed across multiple platforms or different versions of Pytorch. Ho

There seem to be some vulnerabilities in our code that might fail easily. I suggest adding more unit tests for the following:

Custom agents (there's only VPG and PPO on CartPole-v0 as of now. We should preferably add more to cover discrete-offpolicy, continuous-offpolicy and continuous-onpolicy)
Evaluation for the Bandits and Classical agents
Testing of convergence of agents as proposed i

This is more or less similar to what we were doing long time ago with pybullet (#110, #149).

In few words, after #346 the pure-Python gym-ignition package will depend on scenario. When gym-ignition gets installed in a system (currently only Ubuntu is supported), the wheel of scenario will be installed first. Since the wheel of scenario is not self-contained (i.e. it needs to find in t

May	JUN	Jul
	07
2020	2021	2022

openai

Here are 265 public repositories matching this topic...

hill-a / stable-baselines

minimaxir / gpt-2-simple

DLR-RM / stable-baselines3

huggingface / pytorch-openai-transformer-lm

explosion / spacy-transformers

MorvanZhou / Evolutionary-Algorithm

araffin / rl-baselines-zoo

nicrusso7 / rex-gym

uvipen / Super-mario-bros-PPO-pytorch

germain-hug / Deep-RL-Keras

navneet-nmk / pytorch-rl

SforAiDl / genrl

rish-16 / gpt2client

kengz / openai_lab

DLR-RM / rl-baselines3-zoo

wil3 / gymfc

araffin / learning-to-drive-in-5-minutes

maraoz / gpt-scrolls

mpSchrader / gym-sokoban

akanyaani / gpt-2-tensorflow2.0

guillitte / pytorch-sentiment-neuron

gabrielhuang / reptile-pytorch

Stable-Baselines-Team / stable-baselines

robotology / gym-ignition

upb-lea / gym-electric-motor

gsurma / cartpole

bhattbhavesh91 / gpt-3-simple-tutorial

uvipen / Contra-PPO-pytorch

denisyarats / dmc2gym

gsurma / slitherin

Improve this page

Add this topic to your repo