Skip to content

The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.

Notifications You must be signed in to change notification settings

JINHXu/tutorial-speaker-identification-with-nemo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TUTORIAL: Speaker Identification with Nemo

A comprehensible and quick illustration on how to do speaker identification on your own data with NVIDIA's Nemo

  1. record data
  2. data prep
  3. config
  4. fine tune
  5. inference
  6. evaluation

Record your own data of speakers intended to be identified

  • preprocess into intended format
  • slice into around 4 seconds audio samples

Recommended tool pydub

Data Prep (generate train, dev, test manifests)

Configration

Fine-tune SpeakerNet on data of speakers intended to be identified

Inference

(Evaluation)