Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UMT5 & ByT5 Support #1974

Open
SoshyHayami opened this issue Jul 25, 2024 · 0 comments
Open

UMT5 & ByT5 Support #1974

SoshyHayami opened this issue Jul 25, 2024 · 0 comments

Comments

@SoshyHayami
Copy link

SoshyHayami commented Jul 25, 2024

Feature request

Adding support to ByT5 & UMT5, two popular variants of the T5 Seq2Seq models, would be great.

Motivation

ByT5 is an essential model that outperforms all other variants of T5 for grammar correction and most importantly the Grapheme-to-Phoneme conversion which is the core part of most Text-to-Speech models, I cannot emphasis enough how import latency is in this field.

As for UMT5, it's the most recent variant of T5 and it seem to be the SOTA when it comes to this architecture. unfortunately the latency is a bit high using when we use these models, especially since their smallest models are 300M which is still quite large.

Your contribution

I'm afraind not, It's a bit beyond my current skills.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant