format of zh and ja #4

bakhbyergyen · 2022-04-01T01:46:19Z

hi, I wanted to know that, why zh and ja datasets are split by character? not word by word?
when building a dataset, sentences can be split by words, not characters?
thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

format of zh and ja #4

format of zh and ja #4

bakhbyergyen commented Apr 1, 2022

format of zh and ja #4

format of zh and ja #4

Comments

bakhbyergyen commented Apr 1, 2022