separate between token-lengths and sub-token lengths #2990

helpmefindaname · 2022-11-14T22:05:25Z

This PR should speed up the transformer word embeddings by a tiny bit (and also saves a little gpu usage), as the output embeddings are now of the token-length instead of being expanded to the sub-token-length.

I haven't tested how big the speed impact actually is, this would be a further todo.

Besides this, this PR fixes the Onnx Export ( #2964 #2930 ) by enforcing the order of inputs to be right

alanakbik · 2022-11-20T11:51:33Z

Thanks @helpmefindaname - I'll merge this now and fix the test in the next PR.

Benedikt Fuchs and others added 4 commits November 14, 2022 22:57

separate between token-lengths and sub-token lengths

c62a336

fix doc for jit export

7ebb116

fix doc for jit export

27bf89f

Merge branch 'master' into faster_transformer_word_embeddings

9519a54

alanakbik merged commit 4788db1 into flairNLP:master Nov 20, 2022

helpmefindaname deleted the faster_transformer_word_embeddings branch November 28, 2022 10:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

separate between token-lengths and sub-token lengths #2990

separate between token-lengths and sub-token lengths #2990

helpmefindaname commented Nov 14, 2022

alanakbik commented Nov 20, 2022

separate between token-lengths and sub-token lengths #2990

separate between token-lengths and sub-token lengths #2990

Conversation

helpmefindaname commented Nov 14, 2022

alanakbik commented Nov 20, 2022