Skip to content

Latest commit

 

History

History
8 lines (5 loc) · 346 Bytes

README.md

File metadata and controls

8 lines (5 loc) · 346 Bytes

handparsed-treebank

Extra hand parsed data for training models

english-handparsed: PTB style trees with some coverage of words or structures not well represented in WSJ PTB or other common datasets

english-tagged: data which is tagged, but not parsed.

italian-mwt: a collection of Italian phrases tokenized in the style of UD conll datasets.