attention-mechanism

I don't understand the conditional sentence in the loop when infer, it is always false。

if tf.reduce_sum(y_hat, 1) == self.token2idx[""]: break

    assert len(references) == len(hypotheses)

# Calculate BLEU-4 scores
bleu4 = corpus_bleu(references, hypotheses)

I did a few training runs of a simple Reformer module with different parameters and logged the GPU memory usage.

Of course, depending on your machine or other things these values can vary, but I thought it might be useful as a visual guide:

dim = 512, seq_len = 256, depth = 1, heads = 1, batch_size = 1: 452 MB

dim = 512, seq_len = 256, depth = 1, heads = 1, batch_size = 8: 992 MB

In official documents, there is a notice 'Each function object is meant to be used only once (in the forward pass).' in subclass of torch.autograd.Function.
In model SpGraphAttentionLayer, you have use the object of SpecialSpmmFunction(self.special_spmm) twice, one for e_rowsum and one for h_prime.
Is it the right usage for subclass of torch.autograd.Function?

Need help for retraining and cross validation and see if the ROUGE score matches exactly (or better) with the numbers reported in the paper.
I just train for 500k iteration (with batch size 8) with pointer generation enabled + coverage loss disabled and next 100k iteration (with batch size 8) with pointer generation enabled + coverage loss enabled.

It would be great if someone can help re-r

Apr	MAY	Jun
	21
2019	2020	2021

attention-mechanism

Here are 490 public repositories matching this topic...

brightmart / text_classification

benedekrozemberczki / awesome-graph-classification

Kyubyong / transformer

if tf.reduce_sum(y_hat, 1) == self.token2idx["<pad>"]: break

I can't understand the code in eval()

philipperemy / keras-attention-mechanism

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

PetarV- / GAT

sgrvinod / a-PyTorch-Tutorial-to-Image-Captioning

If the length of the correct description is not equal to the predicted description length, the bleu evaluation cannot be performed.

richliao / textClassifier

lucidrains / reformer-pytorch

GPU Memory Benchmark

awslabs / sockeye

Diego999 / pyGAT

Confusion about SpecialSpmmFunction

yunjey / show-attend-and-tell

datalogue / keras-attention

charlesXu86 / Chatbot_CN

Canjie-Luo / MORAN_v2

atulkum / pointer_summarizer

Need help for retraining and cross validation

lvapeab / nmt-keras

kaushalshetty / Structured-Self-Attention

cedrickchee / awesome-bert-nlp

CyberZHG / keras-self-attention

kracwarlock / action-recognition-visual-attention

EagleW / PaperRobot

soobinseo / Transformer-TTS

soskek / attention_is_all_you_need

jiasenlu / AdaptiveAttention

lc222 / seq2seq_chatbot

hirofumi0810 / tensorflow_end2end_speech_recognition

ymfa / seq2seq-summarizer

benedekrozemberczki / SimGNN

danielegrattarola / keras-gat

Improve this page

Add this topic to your repo