Add vocab feature #124

chainyo · 2023-06-29T16:28:28Z

Add a simple way for the user to add some extra vocab in the payload. These words will be concatenated into a single string and provided to the model during inference.

aleksandr-smechov

Did you test this with a few keywords? How were the results?

chainyo · 2023-06-29T17:38:26Z

Did you test this with a few keywords? How were the results?

For example, on this YouTube video: https://youtu.be/v2X51AVgl3o

I added some vocab: ["GitHub", "Python", "open-source"," README.md", "Visual Studio Code"]

Here are the results with the vocab on (+) and off (-)

+ Right now, open-source contributions are being used as the new resume.
- Right now, open source contributions are being used as the new resume.

+ In this video, we will be discussing what is open-source contributions and how do you actually do that.
- In this video, we will be discussing what is open source contributions and how do you actually do that.

+ The next place where you can find these projects is GitHub.
- The next place where you can find these projects is GitHub.

+ For example, if you're really good at Python programming language and want to contribute.
- For example, if you are really good at Python programming language and want to contribute.

+ Now open this folder in your visual studio code and open the readme.md file.
- Now open this folder in your Visual Studio code and open the readme.md file.

It's not perfect, for example README.md was not correctly handled nor VSCode.

We could add an extra post-processing step.

aleksandr-smechov · 2023-06-29T18:13:46Z

And do fuzzy match? That could go wrong in unforseen ways. What about if you try prepending the terms with "Make sure these words are spelled correctly: "

chainyo · 2023-06-30T08:35:07Z

And do fuzzy match? That could go wrong in unforseen ways. What about if you try prepending the terms with "Make sure these words are spelled correctly: "

It strictly doesn't change anything on the sample test I use.

aleksandr-smechov · 2023-06-30T10:37:21Z

Ok, let's stick to the initial method of splitting by comma, since the prompt is limited to a certain number of tokens I believe.

aleksandr-smechov · 2023-06-30T11:12:44Z

There could be a simpler way than fuzzy match to post-process maybe - just look for the exact words after lowercasing (and then replacing symbols with spaces) in the custom vocab. For example:

VS Code in the custom vocab dictionary becomes vs code. Open-Source becomes open-source (you would replace with the original vocab item). That way you can lowercase the output and find potential matches without altering the number of characters for the original. Then you can replace symbols with spaces, so in the custom vocab Open-Source would become open source, which you can search for in a lowercased output without altering overall character length.

wdyt?

…hub.com/Wordcab/wordcab-transcribe into 123-use-prompting-for-custom-vocabulary

* add multi gpus handling for transcription * add model index for transcription and diarization * add gpu_index for alignment models * fix diarization gpu indexing * multi gpu setup, with transcription errors * Updated error payload for svix in cortex endpoint * Extra languages performed poorly, commenting out tests that require "he" for this param * update the download_audio function to avoid extension * add audio_duration key in repsonse + fix dual_channel bug * fix tests and endpoint returns * Add a catch for empty audio (#128) * add a catch for empty audio file * rename utterances -> response for coherence * fix quality * Add vocab feature (#124) * add vocab feature * fix youtube endpoint * update the prompt sentence * add vocab feature * fix youtube endpoint * update the prompt sentence * Upgraded base docker image to nvidia/cuda:11.7.1-cudnn8-runtime-ubuntu20.04 * add multi gpus handling for transcription * add model index for transcription and diarization * add gpu_index for alignment models * fix diarization gpu indexing * multi gpu setup, with transcription errors * lower batch_size * fix alignment device index * revert transcribe service to no mapping * update gpu service queue manager * fix Exception returns for endpoints * fixed dual_channel * fix flake and darglint * run black linter * fix nemo config tests * fix typo --------- Co-authored-by: Aleks <aleks@wordcab.com> Co-authored-by: Aleksandr Smechov <35517862+aleksandr-smechov@users.noreply.github.com>

add vocab feature

1bb0fac

chainyo added api Everything related to the API implementation transcription Everything related to the transcription part labels Jun 29, 2023

chainyo requested a review from aleksandr-smechov June 29, 2023 16:28

chainyo linked an issue Jun 29, 2023 that may be closed by this pull request

Use prompting for custom vocabulary #123

Closed

aleksandr-smechov reviewed Jun 29, 2023

View reviewed changes

fix youtube endpoint

b0b73a8

aleksandr-smechov added the enhancement New feature or request label Jun 29, 2023

update the prompt sentence

41da156

chainyo mentioned this pull request Jun 30, 2023

Add audio_duration key in API response #127

Merged

Thomas Chaigneau added 9 commits June 30, 2023 17:46

Merge branch 'main' into 123-use-prompting-for-custom-vocabulary

75e669b

add vocab feature

5f4a239

fix youtube endpoint

3c54b0c

update the prompt sentence

fd34ce8

Merge branch '123-use-prompting-for-custom-vocabulary' of https://git…

4526dad

…hub.com/Wordcab/wordcab-transcribe into 123-use-prompting-for-custom-vocabulary

Delete call_wordcab.py

2343f36

Delete inference_multiple_files.py

6e96b11

Delete conc.py

c559065

Delete nginx.conf

e84ed45

aleksandr-smechov approved these changes Jun 30, 2023

View reviewed changes

chainyo merged commit 7fe0a7b into main Jun 30, 2023

chainyo deleted the 123-use-prompting-for-custom-vocabulary branch June 30, 2023 15:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vocab feature #124

Add vocab feature #124

chainyo commented Jun 29, 2023

aleksandr-smechov left a comment

chainyo commented Jun 29, 2023 •

edited

Loading

aleksandr-smechov commented Jun 29, 2023 •

edited

Loading

chainyo commented Jun 30, 2023 •

edited

Loading

aleksandr-smechov commented Jun 30, 2023

aleksandr-smechov commented Jun 30, 2023

Add vocab feature #124

Add vocab feature #124

Conversation

chainyo commented Jun 29, 2023

aleksandr-smechov left a comment

Choose a reason for hiding this comment

chainyo commented Jun 29, 2023 • edited Loading

aleksandr-smechov commented Jun 29, 2023 • edited Loading

chainyo commented Jun 30, 2023 • edited Loading

aleksandr-smechov commented Jun 30, 2023

aleksandr-smechov commented Jun 30, 2023

chainyo commented Jun 29, 2023 •

edited

Loading

aleksandr-smechov commented Jun 29, 2023 •

edited

Loading

chainyo commented Jun 30, 2023 •

edited

Loading