Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

encoder-decoder attention?

See original GitHub issue

Dear Author,

Thank you very much for the visualization tool. It is super helpful.
Now, we need to visualize the encoder-decoder attention instead of the self-attention. For instance, while using the following code, we only can visualize the self-attention one. It would be great if you can give some clue on how to visualize the encoder-decoder attention of BART.

from bertviz import model_view
from bertviz import head_view
from transformers import BartForConditionalGeneration, BartTokenizer
model = BartForConditionalGeneration.from_pretrained("facebook/bart-large", force_bos_token_to_be_generated=True)
tokenizer = BartTokenizer.from_pretrained('facebook/bart-large')
sentence_a = "The cat sat on the mat"
sentence_b = "The cat lay on the rug"
inputs = tokenizer.encode_plus(sentence_a, sentence_b, return_tensors='pt', add_special_tokens=True)
input_ids = inputs['input_ids']
attention = model(input_ids,output_attentions=True)[2] ####
#print (attention.size())
input_id_list = input_ids[0].tolist() # Batch index 0
tokens = tokenizer.convert_ids_to_tokens(input_id_list)
model_view(attention, tokens)
head_view(attention, tokens)

Thank you! Best, Shirley

Issue Analytics

State:
Created 3 years ago
Comments:5 (3 by maintainers)

Top GitHub Comments

1reaction

jessevigcommented, May 2, 2021

Hi @xwuShirley, I’ve started a branch to visualize encoder-decoder attention: https://github.com/jessevig/bertviz/tree/encoder-decoder

I’ve added the encoder_decoder.ipynb notebook.

git clone  -b encoder-decoder git@github.com:jessevig/bertviz.git
cd bertviz
jupyter notebook encoder_decoder.ipynb

Please let me know if that works for you. I’ll work on adding support for the model view and pushing a new version to pypi as well soon.

0reactions

chen-yifucommented, Jun 4, 2021

Super helpful!! Thanks!

Top Results From Across the Web

How Does Attention Work in Encoder-Decoder Recurrent ...

Attention is a mechanism that was developed to improve the performance of the Encoder-Decoder RNN on machine translation.

A Guide to the Encoder-Decoder Model and the Attention ...

“Attention allows the model to focus on the relevant parts of the input sequence as needed, accessing all the past hidden states of...

An Explanation of Attention Based Encoder-Decoder Deep ...

Attention focuses on the most important parts of the sequence instead of the entire sequence as a whole. Rather than building a single...

Seq2seq and Attention - Lena Voita

An attention mechanism is a part of a neural network. At each decoder step, it decides which source parts are more important. In...

What is attention mechanism? - Towards Data Science

An encoder decoder architecture is built with RNN and it is widely used in neural machine translation (NMT) and sequence to sequence (Seq2Seq) ......

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

encoder-decoder attention?

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

Visualise attention for translation

Save attention visualizations as local html file