Skip to content

CLDF dataset derived from Liú et al.'s "Basic Words in Chinese Dialects" from 2007

License

Notifications You must be signed in to change notification settings

lexibank/liusinitic

Repository files navigation

CLDF dataset derived from Liú et al.'s "Collection of Basic Words in Chinese Dialects" from 2007

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Líu, L.; Wáng, H.; Bǎi, Y. (2007): Xiàndài Hànyǔ fāngyán héxīncí, tèzhēng cíjí 现代汉语方言核心词·特征词集 [Collection of basic vocabulary words and characteristic dialect words in modern Chinese dialects]. Nánjīng: Fènghuáng.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 19 (linked to 19 different Glottocodes)
  • Concepts: 203 (linked to 202 different Concepticon concept sets)
  • Lexemes: 4,302
  • Sources: 1
  • Synonymy: 1.12
  • Cognacy: 5,909 cognates in 832 cognate sets (382 singletons)
  • Cognate Diversity: 0.15
  • Invalid lexemes: 0
  • Tokens: 21,895
  • Segments: 145 (0 BIPA errors, 0 CLTS sound class errors, 145 CLTS modified)
  • Inventory size (avg): 50.32

Contributors

Name GitHub user Description Role
Liú Lìlǐ data collector DataCollector, Editor, Author
Wáng Hóngzhōng data collector DataCollector, Editor, Author
Bǎi Yíng data collector DataCollector, Editor, Author
Johann-Mattis List @LinguList maintainer Editor

CLDF Datasets

The following CLDF datasets are available in cldf: