Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VTLN across genders #6

Open
EmreOzkose opened this issue Sep 15, 2021 · 3 comments
Open

VTLN across genders #6

EmreOzkose opened this issue Sep 15, 2021 · 3 comments

Comments

@EmreOzkose
Copy link

Hi!

I am trying to extract mfcc+vtln features for a dataset, but I couldn't get the pipeline exactly. When I run mfcc_vtln.py, I get warp factors for each speakers. So we can use these warps for normalization of dataset. However if we have no speaker ids in dataset, but have female/male labels, can we do vtln across genders? I mean, in this setup, we have only 2 speaker_id which are female and male. Do you think if this will work?

@mmmaat mmmaat changed the title vtln VTLN across genders Sep 15, 2021
@mmmaat
Copy link
Collaborator

mmmaat commented Sep 15, 2021

Hi!
Indeed you cannot use VTLN without speaker information... We never tried VTLN by gender, but it can make sense I guess.
Another option (works only if your dataset has long enough wavs) is to compute one VTLN coefficient per audio sample. In an experiment (no yet published...) we showed that 2 minutes of audio per speaker is enough for the VTLN coefficients to converge.

@EmreOzkose
Copy link
Author

I think computing one VTLN coefficient per audio sample may take too long (for example if we have 100h training data)? Another issue is inference time. If we have 1 sample during test time, warp factor will be 1.0.

@EmreOzkose
Copy link
Author

I wanna note my observations here. I seperated dataset into 2 speaker as I said and run mfcc_vtln.py. When I limit duration with 10min and 20min, warp factors become 1.0 for each speaker. I am incresing duration limit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants