A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement

A minimum unofficial implementation of the A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement (CRN) using PyTorch.

ToDo

Real-time version
Update trainer
Visualization of the spectrogram and the metrics (PESQ, STOI, SI-SDR) in the training
More docs

Usage

Training:

python train.py -C config/train/baseline_model.json5

Inference:

python inference.py \
    -C config/inference/basic.json5 \
    -cp ~/Experiments/CRN/baseline_model/checkpoints/latest_model.tar \
    -dist ./enhanced

Check out the README of Wave-U-Net for SE to learn more.

Performance

PESQ, STOI, SI-SDR on DEMAND - Voice Bank test dataset, for reference only:

Experiment	PESQ	SI-SDR	STOI
Noisy	1.979	8.511	0.9258
CRN	2.528	17.71	0.9325
CRN signal approximation	2.606	17.84	0.9382

Dependencies

Python==3.*.*
torch==1.*
librosa==0.7.0
tensorboard
pesq
pystoi
matplotlib
tqdm

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
config		config
dataset		dataset
inferencer		inferencer
model		model
trainer		trainer
util		util
.gitignore		.gitignore
README.md		README.md
inference.py		inference.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement

ToDo

Usage

Performance

Dependencies

References

About

Releases

Packages

Languages

haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement

Folders and files

Latest commit

History

Repository files navigation

A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement

ToDo

Usage

Performance

Dependencies

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages