Skip to content
This repository has been archived by the owner on Jul 22, 2024. It is now read-only.

Audio preprocessing #76

Open
predestination opened this issue Sep 17, 2019 · 2 comments
Open

Audio preprocessing #76

predestination opened this issue Sep 17, 2019 · 2 comments

Comments

@predestination
Copy link

Hey, what are the possible Audio Pre-processing steps that can be used to improve transcript quality? Is there any library in python for denoising or audio enhancement without using deep learning ( as it is taking lot of time for a small audio clip). ?

@tonanhngo
Copy link
Contributor

Hi, if you expect most of your input is noisy or is unique in certain ways (like speaker accent, background noise), then it's better to train the custom acoustic model with this type of audio. The IBM Debater uses this approach and was able to reduce the error rate to ~5%. If you have a few audio clips and want to do noise reduction, I did a quick search and saw a few options:

@predestination
Copy link
Author

Thank you for the reply, I tried noisereduce and logmmse earllier but it didn't improve the transcript quality. Will check the scipy signal.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants