The Wayback Machine - https://web.archive.org/web/20200413195708/https://github.com/topics/reinforcement-learning
Skip to content
#

reinforcement-learning

Here are 4,412 public repositories matching this topic...

brett-daley
brett-daley commented Feb 28, 2020

Description

I'm running a simple test on IWSLT'15 English-Vietnamese:

't2t-trainer --problem=translate_envi_iwslt32k --model=transformer --hparams_set=transformer_base_single_gpu --data_dir=data/ --output_dir=output/

It works but I get different results each time -- even when running on CPU only. I guess there are some random seeds that need to be set (see #485) but t2t-trainer does

tensorlayer
0xtyls
0xtyls commented Jan 3, 2020

I understand that these two python files show two different methods to construct a model. The original n_epoch is 500 which works perfect for both python files. But if I change n_epoch to 20, only tutorial_mnist_mlp_static.py can achieve a high test accuracy (~0.97). The other file tutorial_mnist_mlp_static_2.py only get 0.47.

The models built from these two files looks the same for me (the s

BlackTentacle
BlackTentacle commented Oct 26, 2018

I tried some RNN regression learning based on the code in the "PyTorch-Tutorial/tutorial-contents/403_RNN_regressor.py" file, which did not work for me at all.

According to an accepted answer on stack-overflow (https://stackoverflow.com/questions/52857213/recurrent-network-rnn-wont-learn-a-very-simple-function-plots-shown-in-the-q?noredirect=1#comment92916825_52857213), it turns out that the li

trax
farhanhubble
farhanhubble commented Apr 2, 2020

Lately running into too many Sagemaker issues. Is there any unambiguous documentation on Sagemakers Instances? I could glean the following from different sources:

  1. Sagemaker Instances, Sagemaker being a managed service, have nothing to do with EC2 instances.
  2. Unlike EC2 console, Sagemaker console has no option to view limits or increase limits. One has to go directly to the support page a
icoxfog417
icoxfog417 commented Sep 3, 2019

一言でいうと

AutoMLアルゴリズムとrandom searchを比較した研究。学習したPolicyとrandom samplingとで条件を揃えて比較(randomは複数シードを取り、最終的なモデルは同epoch数学習)。結果randomを大きく超えるものはなかった。また、Weight Shareをすると探索結果が悪くなるという重要な示唆。

論文リンク

https://arxiv.org/abs/1902.08142

著者/所属機関

Christian Sciuto, Kaicheng Yu, Martin Jaggi, Claudiu Musat, Mathieu Salzmann

  • AI Lab, Swisscom
  • CV Lab, EPFL
  • MLO Lab, EPFL

投稿日付(yyyy/MM/d

Improve this page

Add a description, image, and links to the reinforcement-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reinforcement-learning topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.