language-model

🚀 Feature request

Currently, the EncoderDecoderModel class in PyTorch automatically creates the decoder_input_ids based on the labels provided by the user (similar to how this is done for T5/BART). This should also be implemented for TFEncoderDecoderModel, because currently users should manually provide decoder_input_ids to the model.

One can take a look at the TF implementation

From paper, it mentioned

Instead, the training data generator chooses 15% of tokens at random, e.g., in the sentence my
dog is hairy it chooses hairy.

It means that 15% of token will be choose for sure.

From https://github.com/codertimo/BERT-pytorch/blob/master/bert_pytorch/dataset/dataset.py#L68,
for every single token, it has 15% of chance that go though the followup procedure.

While running the tutorials is not rare to meet with UserWarnings that are caused by underlying dependencies like transformers or pytorch. I think UserWarnings that are triggered by Haystack's or the user's code should stay visible, but those coming from dependencies could be hidden, as there's nothing we or the final users can do about it.

Examples:

Tutorial 1: `/home/sara/work/hayst

Issue to track tutorial requests:

Deep Learning with PyTorch: A 60 Minute Blitz - #69
Sentence Classification - #79

Oct	NOV	Dec
	23
2020	2021	2022

language-model

Here are 796 public repositories matching this topic...

huggingface / transformers

🚀 Feature request

brightmart / nlp_chinese_corpus

EleutherAI / gpt-neo

huggingface / tokenizers

codertimo / BERT-pytorch

speechbrain / speechbrain

deepset-ai / haystack

CLUEbenchmark / CLUE

tensorflow / lingvo

CyberZHG / keras-bert

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

chiphuyen / lazynlp

Separius / awesome-sentence-embedding

salesforce / awd-lstm-lm

PaddlePaddle / PaddleSpeech

NVIDIA / OpenSeq2Seq

huggingface / pytorch-openai-transformer-lm

EleutherAI / gpt-neox

prabhuomkar / pytorch-cpp

explosion / spacy-transformers

mihail911 / nlp-library

ymcui / Chinese-ELECTRA

nlpodyssey / spago

brightmart / bert_language_understanding

pykaldi / pykaldi

LiyuanLucasLiu / LM-LSTM-CRF

smilelight / lightNLP

microsoft / DeBERTa

SKTBrain / KoBERT

IsaacChanghau / DL-NLP-Readings

Improve this page

Add this topic to your repo