Upcoming in next release! (this week) #46

Vaibhavs10 · 2023-11-14T18:48:00Z

Speaker Diarization with Pyannote 🤯
Fast CPU support 💻
Streaming ⚡

thomasmol · 2023-11-15T07:18:22Z

Awesome! I have a pyannote implementation here https://github.com/thomasmol/replicate-whisper-diarization if you want to take a look.
Also really curious how streaming would work!

omarsiddiqi224 · 2023-11-15T21:00:55Z

There is this resource as well to help with diarization: https://github.com/MahmoudAshraf97/whisper-diarization

zeke · 2023-11-17T05:36:16Z

Looking forward to diarization 🙏🏼

JuergenFleiss · 2023-11-22T07:09:32Z

Hi, is testing CPU possible already? Would be greatly interested to compare it to faster-whisper. Also, what would the limitations be? Batching should work? Flash attention? Your speeds sound really promising.

souvikqb · 2023-11-22T07:11:08Z

Speaker Diarization with Pyannote 🤯

Fast CPU support 💻

Streaming ⚡

Hey 👋 any update on these releases @Vaibhavs10 ? They would open up a lot of possibilities

acul3 · 2023-11-27T11:43:03Z

Looking forward to this

Btw @Vaibhavs10 have you consider adding vad(voice activity detection) to the pipeline

By my testing..VAD reduce hallucination espscially with audio lot of silence and noise

Thanks

BBC-Esq · 2023-11-29T15:01:24Z

Might I suggest using nemo toolkit instead? It seems to avoid pyannote's requirement of using a huggingface key or what not to access their model. omarsiddiqi224 is the one who posted a link to a repository that relies on it instead of pyannote.

bluusun · 2023-11-29T21:53:24Z

How can the speaker diarization be used? Where does it show? Thanks for adding this!

TomExMachina · 2023-11-30T10:52:11Z

Does anyone have a streaming script or snippet they can share ahead of the release? If you do I will help iterate on it.

souvikqb · 2023-11-30T12:39:47Z

Does anyone have a streaming script or snippet they can share ahead of the release? If you do I will help iterate on it.

I had originally asked this question on Distill Whisper, here's a potential script - huggingface/distil-whisper#4 (comment)

Link to my issue - huggingface/distil-whisper#41 (comment)

Vaibhavs10 · 2023-11-30T15:25:34Z

Heu @souvikqb @TomExMachina - Re: Streaming: A community member made this: https://gist.github.com/Oceanswave/32da596e8bb10c928f6c69c889c3c130 (It works quite well)

Vaibhavs10 · 2023-11-30T15:27:42Z

Hey @bluusun - Currently, the API is a bit spaghetti, however, if you pass a parameter --hf_token <HF token> it should automatically diarise.

@kadirnar recently made a PR to make this more clear #83 (we'll make a release tomorrow or on saturday along with some more goodies 🤞 )

Vaibhavs10 · 2023-11-30T15:32:27Z

@BBC-Esq - I'm opening a new issue to discuss this #85, I think adding support for Nvidia NeMo might make sense and give people the option to choose different backends too.

Vaibhavs10 · 2023-11-30T17:38:36Z

(Closing this issue since the release already happened; we need another patch (to fix the current API) before considering the next steps.)

Tortoise17 · 2023-11-30T17:41:16Z

@Vaibhavs10 CPU usage is now possible with the new release?

Tortoise17 · 2023-11-30T17:44:03Z

@Vaibhavs10 Fast CPU support was also mentioned to make available in this release in addition to diarization and streaming

Vaibhavs10 · 2023-11-30T17:45:22Z

Let's discuss that in a seperate issue. (I'll open one)

This was referenced Nov 14, 2023

Support for CPU Mode #36

Closed

Can whisper support fast audio transcribing in real time #35

Closed

patrick91 mentioned this issue Nov 15, 2023

how to install? #8

Closed

Vaibhavs10 mentioned this issue Nov 30, 2023

[Discussion] Speaker diarisation options #85

Open

Vaibhavs10 closed this as completed Nov 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upcoming in next release! (this week) #46

Upcoming in next release! (this week) #46

Vaibhavs10 commented Nov 14, 2023

thomasmol commented Nov 15, 2023

omarsiddiqi224 commented Nov 15, 2023

zeke commented Nov 17, 2023

JuergenFleiss commented Nov 22, 2023

souvikqb commented Nov 22, 2023 •

edited

Loading

acul3 commented Nov 27, 2023

BBC-Esq commented Nov 29, 2023

bluusun commented Nov 29, 2023

TomExMachina commented Nov 30, 2023

souvikqb commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Tortoise17 commented Nov 30, 2023

Tortoise17 commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Upcoming in next release! (this week) #46

Upcoming in next release! (this week) #46

Comments

Vaibhavs10 commented Nov 14, 2023

thomasmol commented Nov 15, 2023

omarsiddiqi224 commented Nov 15, 2023

zeke commented Nov 17, 2023

JuergenFleiss commented Nov 22, 2023

souvikqb commented Nov 22, 2023 • edited Loading

acul3 commented Nov 27, 2023

BBC-Esq commented Nov 29, 2023

bluusun commented Nov 29, 2023

TomExMachina commented Nov 30, 2023

souvikqb commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

Tortoise17 commented Nov 30, 2023

Tortoise17 commented Nov 30, 2023

Vaibhavs10 commented Nov 30, 2023

souvikqb commented Nov 22, 2023 •

edited

Loading