GitHub - aleksandrskoselevs/transformer-attention-tutorial: A single notebook tutorial with step by step implementation of the transformer architecture. Fork of sainathadapa/attention-primer-pytorch

Hi, I'm Aleksandrs Koselevs.

This is a fork of a great pytorch implementation at sainathadapa/attention-primer-pytorch
Which itself is a fork of a great tutorial at greentfrapp/attention-primer
attention_primer.ipynb tries to unify the 5 lessons in a single notebook
My notes appear as A: italic
All training code is removed in the notebook, only doing inference
You don't need to train anything. However, there might be some references to training or --parameter=False in the descriptions
If you want to train something, the task folders contain the training code
The implementation in the notebooks might diverge from those in the task folders
Most images from Vaswani et al. (2017)
The example code in the descriptions in the notebook might diverge from the one in Implementation
batch_size defaults to 1, to simplify things
The model implementations are in all_models.py, which are copied into the notebooks

References

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
1_counting-letters		1_counting-letters
2_difference		2_difference
3_signal		3_signal
4_signal2		4_signal2
5_translation		5_translation
.gitignore		.gitignore
README.md		README.md
all.gif		all.gif
all_models.py		all_models.py
all_tasks.py		all_tasks.py
attention_primer.ipynb		attention_primer.ipynb
requirements.txt		requirements.txt