morphosyn-alternation-largescale

This is the GitHub repository for my current project on a large-scale approach to multifactorial analysis in morphosyntactic alternations.

The files are supposed to be run in order (1-extract.py is the first to be run, and so on.)

There are currently five steps in the process (the files at each process are subject to change):

-Step 1: Extract the data from the treebank and turn them into a Python-readable form -Step 2: Extract clausal information from the treebank, focusing on within-sentence features -Step 2.5-3: Extract clausal information from the treebank, focusing on inter-sentence features -Step 4: Manually correct errors in automatic extraction -Step 5: Encode and analyse data

The code files are those with the step ID at the beginning. In addition, there are CSV files generated from those code files. Currently, 'dec25table.csv' is used as the main source of data in Step 5. Correction files are those containing extra columns which correct the values of the automatically-extracted columns of steps 2-3; the currently used correction files are:

-dec22-table-withintersentence-modified-3000clauses.csv

Attributions:

Universal Dependencies - English gold standard treebank

https://github.com/UniversalDependencies/UD_English-EWT

Syllable count:

https://github.com/eaydin/sylco

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.gitattributes		.gitattributes
1-extract.py		1-extract.py
2-clausetable-sentencelevel.py		2-clausetable-sentencelevel.py
2.5-corenlp-coreference.py		2.5-corenlp-coreference.py
3-clausetable-interclause.py		3-clausetable-interclause.py
4-clausetable-fixes.py		4-clausetable-fixes.py
5-encoding.R		5-encoding.R
6-analysis.R		6-analysis.R
README.md		README.md
dec17table.csv		dec17table.csv
dec22-table-withintersentence-modified-3000clauses.csv		dec22-table-withintersentence-modified-3000clauses.csv
dec22-table-withintersentence-modified.csv		dec22-table-withintersentence-modified.csv
dec22-table-withintersentence.csv		dec22-table-withintersentence.csv
dec25table-corrtable-implicitsubjs-first3000.csv		dec25table-corrtable-implicitsubjs-first3000.csv
dec25table.csv		dec25table.csv
dec26-table-withintersentence.csv		dec26-table-withintersentence.csv
dec26table-first3000-v2-passivisability.csv		dec26table-first3000-v2-passivisability.csv
dec26table-first3000-v3-passivetheme.csv		dec26table-first3000-v3-passivetheme.csv
dec27-sentence-coref-table.csv		dec27-sentence-coref-table.csv
dec28-correctedtable.csv		dec28-correctedtable.csv
dec28-table-withintersentence.csv		dec28-table-withintersentence.csv
dec28table-first3000-v3.csv		dec28table-first3000-v3.csv
desktop.ini		desktop.ini
en_ewt-ud-all.conllu		en_ewt-ud-all.conllu
encoded-dec28.csv		encoded-dec28.csv
fakedata.R		fakedata.R
sept26table.csv		sept26table.csv
sylcount.py		sylcount.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

morphosyn-alternation-largescale

About

Releases

Packages

Languages

kayaulai/morphosyn-alternation-largescale

Folders and files

Latest commit

History

Repository files navigation

morphosyn-alternation-largescale

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages