Skip to content

This repository are used to collect some programs that use in TTS.

License

Notifications You must be signed in to change notification settings

zuoxiang95/Speech-Image_Tool

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speech-Image_Tool

This repositories are used to collect some programs that preprocess audio and image.

Image

The Image dir includes 3 functions to make data augmentation.

  1. Image rotation
  2. Random color
  3. Random Gaussian

Speech

The Speech dir includes 4 python scripts.

  1. character_to_pinyin.py: used to translate Chinese character to pinyin;
  2. trim_silence.py: used to trim the silence in begin and end of audio;
  3. mp3_translate_wav.py: used to translate .mp3 to .wav;
  4. generate_audio.py: used to generate audio from Baidu's api;

About

This repository are used to collect some programs that use in TTS.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages