language-model
Here are 796 public repositories matching this topic...
-
Updated
Oct 22, 2020
-
Updated
Nov 22, 2021 - Python
-
Updated
Nov 22, 2021 - Rust
chooses 15% of token
From paper, it mentioned
Instead, the training data generator chooses 15% of tokens at random, e.g., in the sentence my
dog is hairy it chooses hairy.
It means that 15% of token will be choose for sure.
From https://github.com/codertimo/BERT-pytorch/blob/master/bert_pytorch/dataset/dataset.py#L68,
for every single token, it has 15% of chance that go though the followup procedure.
PositionalEmbedding
-
Updated
Nov 22, 2021 - Python
While running the tutorials is not rare to meet with UserWarnings that are caused by underlying dependencies like transformers or pytorch. I think UserWarnings that are triggered by Haystack's or the user's code should stay visible, but those coming from dependencies could be hidden, as there's nothing we or the final users can do about it.
Examples:
- Tutorial 1: `/home/sara/work/hayst
-
Updated
Nov 11, 2021 - Python
-
Updated
Nov 23, 2021 - Python
-
Updated
Jun 19, 2021 - Python
-
Updated
Nov 12, 2021
-
Updated
Nov 11, 2020 - Python
-
Updated
Apr 23, 2021 - Python
-
Updated
May 7, 2020 - Python
-
Updated
Nov 23, 2021 - Python
-
Updated
May 11, 2021 - Python
-
Updated
Aug 9, 2021 - Python
-
Updated
Nov 22, 2021 - Python
Issue to track tutorial requests:
- Deep Learning with PyTorch: A 60 Minute Blitz - #69
- Sentence Classification - #79
-
Updated
Oct 28, 2021 - Python
-
Updated
Aug 5, 2020
-
Updated
Nov 4, 2021 - Python
-
Updated
Nov 13, 2021 - Go
-
Updated
Jan 1, 2019 - Python
-
Updated
Aug 18, 2021 - Python
-
Updated
Oct 29, 2020 - Python
-
Updated
Dec 14, 2020 - Python
-
Updated
Nov 19, 2021 - Python
-
Updated
Oct 28, 2021 - Jupyter Notebook
-
Updated
Oct 23, 2021 - TeX
Improve this page
Add a description, image, and links to the language-model topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the language-model topic, visit your repo's landing page and select "manage topics."


Currently, the
EncoderDecoderModelclass in PyTorch automatically creates thedecoder_input_idsbased on thelabelsprovided by the user (similar to how this is done for T5/BART). This should also be implemented forTFEncoderDecoderModel, because currently users should manually providedecoder_input_idsto the model.One can take a look at the TF implementation