Skip to content

CLDF dataset derived from Castro's "Sui Dialect Research" from 2015

License

Notifications You must be signed in to change notification settings

lexibank/castrosui

Repository files navigation

CLDF dataset derived from Castro's "Sui Dialect Research" from 2015

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Castro, Andy and Pan, Xingwen (2015): Sui dialect research. SIL: Guiyang.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Conceptlists in Concepticon:

Notes

This dataset was taken from the original data published by Andy Castro, who was so friendly to also share his original concept list with us in digital form. It comprises 16 varieties of the Sui branch of Tai-Kadai, in plain IPA with morphological segmentation.

Statistics

CLDF validation Glottolog: 100% Concepticon: 88% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 16 (linked to 3 different Glottocodes)
  • Concepts: 592 (linked to 518 different Concepticon concept sets)
  • Lexemes: 9,459
  • Sources: 1
  • Synonymy: 1.01
  • Invalid lexemes: 0
  • Tokens: 40,883
  • Segments: 138 (0 BIPA errors, 0 CLTS sound class errors, 138 CLTS modified)
  • Inventory size (avg): 67.00

Contributors

Name GitHub user Descriptin Role
Johann-Mattis List @LinguList maintainer Editor
Mei-Shin Wu @MacyL maintainer Other
Patience Epps help with concept mapping Other
Andy Castro help with concept mapping and original data DataCurator, DataCollector, Author

CLDF Datasets

The following CLDF datasets are available in cldf: