reinforcement-learning

Description

In many other great docs sites, like https://www.tensorflow.org/api_docs, there's a button at the end of the page to collect simple feedback.

This will help us more accurately improve our docs

Use cas

Continuation of issue #2474 as discussed here

Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?

🐛 Bug

The documentation of DQN agent (https://stable-baselines3.readthedocs.io/en/master/modules/dqn.html) specifies that log_interval parameter is "The number of timesteps before logging". However, when set to 1 (or any other value) the logging is not made at that pace but is instead made every log_interval episode (and not timesteps). In the example below this is made every 200 timesteps.

May	JUN	Jul
	06
2021	2022	2023

reinforcement-learning

Here are 8,237 public repositories matching this topic...

ray-project / ray

[Docs] Add "Was this helpful?" button to our docs pages

Description

Use cas

[AIR] Incorrect function names in `TorchTrainer` and `TensorflowTrainer` examples

[Docs] [Serve] Have a consistent landing page style

eugeneyan / applied-ml

Unity-Technologies / ml-agents

tensorflow / tensor2tensor

ShangtongZhang / reinforcement-learning-an-introduction

ddbourgin / numpy-ml

kmario23 / deep-learning-drizzle

bulletphysics / bullet3

labmlai / annotated_deep_learning_paper_implementations

Hvass-Labs / TensorFlow-Tutorials

VowpalWabbit / vowpal_wabbit

get_weight_from_name python wrapper to work with chain hash

Allow multiple data files as input

deepmind / pysc2

MorvanZhou / Reinforcement-learning-with-tensorflow

tensorlayer / TensorLayer

aws / amazon-sagemaker-examples

google / trax

Bidirectional RNN

MorvanZhou / PyTorch-Tutorial

owainlewis / awesome-artificial-intelligence

lazyprogrammer / machine_learning_examples

tensorpack / tensorpack

keras-rl / keras-rl

yandexdataschool / Practical_RL

jason718 / awesome-self-supervised-learning

datawhalechina / easy-rl

BinRoot / TensorFlow-Book

janhuenermann / neurojs

udacity / deep-reinforcement-learning

wandb / client

arXivTimes / arXivTimes

DLR-RM / stable-baselines3

[Bug] Tensorboard logging not logging every log_interval timesteps

🐛 Bug

Improve this page

Add this topic to your repo