Skip to content

Keras implementations of multi-headed attention mechanisms

Notifications You must be signed in to change notification settings

Snarik/Megatron

Repository files navigation

Megatron

Megatron is a package/module to supplement keras by adding in Transformer and multiheaded attention mechanisms;

Note Megatron is likely to be deprecated if keras/tf decides to add a native attention mechanism.

About

Keras implementations of multi-headed attention mechanisms

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages