Skip to content

lexibank/iecor

Repository files navigation

CLDF dataset derived from Heggarty, Paul & Anderson, Cormac & Scarborough, Matthew’s "Indo-European Cognate Relationships database" (IE-CoR version 1.0) from 2019

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Heggarty, Paul & Anderson, Cormac & Scarborough, Matthew 2024. Indo-European Cognate Relationships database (IE-CoR version 1.1). Leipzig: Max Planck Institute for Evolutionary Anthropology

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a https://creativecommons.org/licenses/by/4.0/ license

Available online at https://iecor.clld.org

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 0% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 160 (linked to 152 different Glottocodes)
  • Concepts: 170 (linked to 170 different Concepticon concept sets)
  • Lexemes: 25,731
  • Sources: 0
  • Synonymy: 1.01
  • Cognacy: 25,741 cognates in 4,981 cognate sets (2,341 singletons)
  • Cognate Diversity: 0.19
  • Invalid lexemes: 6,930
  • Tokens: 87,998
  • Segments: 669 (0 BIPA errors, 0 CLTS sound class errors, 669 CLTS modified)
  • Inventory size (avg): 37.44

Possible Improvements:

  • Entries missing sources: 25731/25731 (100.00%)

Contributors

Name GitHub user Description Role
Paul Heggarty @PaulHeggarty Founding Editor Author, DataCurator
Cormac Anderson Founding Editor Author, DataCurator
Matthew Scarborough Author, DataCurator
Hans-Jörg Bibiko @Bibiko patron, code, maintainer Other
Frederic Blum @FredericBlum maintainer Editor

CLDF Datasets

The following CLDF datasets are available in cldf: