Investigate on a better gpu usage #72

chainyo · 2023-06-01T17:42:54Z

This PR proposes an improvement to transcription by implementing a batch request process.

Add README instructions for profiling the container
Fix Exception/Error returns through the API -> raised errors should be more transparent for the user
VAD use now onnx and faster-whisper implementation
Transcription is now batched. Current batch size of 32 (less than 13GB of VRAM) -> TODO: make it configurable based on the GPU space.
word_timestamps is disabled for now until it's fully implemented with the batched transcription process

…tter-gpu-usage

chainyo added 18 commits May 23, 2023 18:58

add dual_channel disclaimer

d9b5462

init batch transcription

c1539f5

use torch for matrix multiplication

971a73e

fix _log_mel_spectrogram params typing

7b8fb07

transcripton pipeline is now on steroïds

7732fa5

add merge + collate fn

58f6eaf

benchmark build

b6a900c

investigation time

c3461db

Merge remote-tracking branch 'origin/main' into 6-investigate-on-a-be…

fb3ec95

…tter-gpu-usage

Merge remote-tracking branch 'origin/main' into 6-investigate-on-a-be…

6fd5577

…tter-gpu-usage

vad to onnx + comparison benchmark

2c0d262

add profiling commands

a9bdad8

add a temp test

b84d18e

push functionnal batched request transcription

974b344

add traceback back

7c99c4f

fix transcription pipeline options

62bf5ab

use faster-whisper code for VAD

3176731

fix lint for PR

959f818

chainyo added api Everything related to the API implementation transcription Everything related to the transcription part labels Jun 1, 2023

chainyo requested a review from aleksandr-smechov June 1, 2023 17:42

chainyo self-assigned this Jun 1, 2023

chainyo linked an issue Jun 1, 2023 that may be closed by this pull request

Investigate on a better GPU usage #6

Closed

aleksandr-smechov approved these changes Jun 1, 2023

View reviewed changes

chainyo merged commit 12ec398 into main Jun 1, 2023

chainyo deleted the 6-investigate-on-a-better-gpu-usage branch June 1, 2023 17:51

chainyo mentioned this pull request Jun 7, 2023

Fix word_timestamps #90

Closed

Provide feedback