Skip to content

xavier-gz/SLI_GlossCorpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

SLI_GlossCorpus

SLI mappings for the Princeton Annotated Gloss Corpus dataset

File Size Description
SLI_GlossCorpus_ILI_Gold_1.0.7z 5.0M The distribution includes a highly simplified XML version of the Princeton Annotated Gloss Corpus (available at http://wordnet.princeton.edu/glosstag.shtml), originally tagged with WordNet 3.0 Sense Keys, and projected by SLI to WordNet 3.0 Inter-Lingual Index (ILI)
SLI_GlossCorpus_CoNLL/ [DIR] SLI mappings for the Princeton Annotated Gloss Corpus dataset in CoNLL format
SLI_GlossCorpus_ILI_CoNLL_1.0.7z 5.2M The distribution includes a version of the Princeton Annotated Gloss Corpus (available at http://wordnet.princeton.edu/glosstag.shtml), originally tagged with WordNet 3.0 Sense Keys, projected by SLI to WordNet 3.0 Inter-Lingual Index (ILI), and encoded in CoNLL format for machine learning
SLI_GlossCorpus_BLC_CoNLL_1.2.7z 5.2M The distribution includes a version of the Princeton Annotated Gloss Corpus (available at http://wordnet.princeton.edu/glosstag.shtml), originally tagged with WordNet 3.0 Sense Keys, projected by SLI to semantic classes (BLC), and encoded in CoNLL format for machine learning
SLI_GlossCorpus_BabelDomains_CoNLL_1.0.7z 4.9M The distribution includes a version of the Princeton Annotated Gloss Corpus (available at http://wordnet.princeton.edu/glosstag.shtml), originally tagged with WordNet 3.0 Sense Keys, projected by SLI to semantic classes (BabelDomains), and encoded in CoNLL format for machine learning
SLI_GlossCorpus_Epinonyms_CoNLL_1.1.7z 5.3M The distribution includes a version of the Princeton Annotated Gloss Corpus (available at http://wordnet.princeton.edu/glosstag.shtml), originally tagged with WordNet 3.0 Sense Keys, projected by SLI to semantic classes (epinonyms), and encoded in CoNLL format for machine learning
SLI_GlossCorpus_Supersenses_CoNLL_1.0.7z 4.9M The distribution includes a version of the Princeton Annotated Gloss Corpus (available at http://wordnet.princeton.edu/glosstag.shtml), originally tagged with WordNet 3.0 Sense Keys, projected by SLI to semantic classes (supersenses), and encoded in CoNLL format for machine learning

These mappings are made available under the terms of Creative Commons Attribution 4.0 International Public License (CC BY 4.0) (which you can find at https://creativecommons.org/licenses/by/4.0/), and are distributed without any warranty.

Information and contact: Xavier Gómez Guinovart (xgg2021@gmail.com)