200475-Sentences-Chinese-Text-Normalization-Data

Description

200,475 Sentences - Chinese Text Normalization Data. Annotate the special symbols and Arabic numerals in the sentences as Chinese characters.

For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1102?source=Github

Specifications

Data content

200,475 sentences of text were transcribed in Chinese characters;

Data scale

200,475 original texts with 457,832 annotations;

Content source

Sentences extracted from various types of news, articles, novels, etc.

Language

Chinese;

Annotation

Annotate the special symbols and Arabic numerals in the sentences as Chinese characters;

Applications

TTS, Text normalization;

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
20210927171813646_demo.jpg		20210927171813646_demo.jpg
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

200475-Sentences-Chinese-Text-Normalization-Data

Description

Specifications

Data content

Data scale

Content source

Language

Annotation

Applications

Licensing Information

About

Releases

Packages

Nexdata-AI/200475-Sentences-Chinese-Text-Normalization-Data

Folders and files

Latest commit

History

Repository files navigation

200475-Sentences-Chinese-Text-Normalization-Data

Description

Specifications

Data content

Data scale

Content source

Language

Annotation

Applications

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages