Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing gpt2 encoder #8

Open
ceifa opened this issue Mar 16, 2023 · 1 comment
Open

Missing gpt2 encoder #8

ceifa opened this issue Mar 16, 2023 · 1 comment
Labels
question Further information is requested

Comments

@ceifa
Copy link

ceifa commented Mar 16, 2023

No description provided.

@ceifa ceifa closed this as completed Mar 21, 2023
@ceifa ceifa reopened this Mar 21, 2023
@zurawiki zurawiki added the question Further information is requested label Mar 23, 2023
@zurawiki
Copy link
Owner

Hi according to the docs, the r50k_base tokenizer should work for gpt2. Unfortunately the gpt2 file that are checked in use a different format so I haven't had the time to properly export them in the library.

@zurawiki zurawiki added the bitbuilder:create Assigns BitBuilder to create a Pull Request for this issue. label Jul 25, 2023
@zurawiki zurawiki removed the bitbuilder:create Assigns BitBuilder to create a Pull Request for this issue. label Oct 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants