You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What DH & Repository tooling is out there for working on a Corpus: Corpus Management, Packaging, Semtification, carry out Data Analysis and producing research outputs.
#1
Open
mrchristian opened this issue
Oct 14, 2024
· 0 comments
What DH & Repository tooling (software) is out there for working on a Corpus: Corpus Management, Packaging, Semtification, carry out Data Analysis and producing research outputs.
The reason for asking the question is that for an individual publication how do we make a publication usable, compatible with standards used in existing systems for corpus packaging and data analysis.
The kinds of tasks, functions, capabilities being looked at are:
Collect corpus into one file system
Package corpus with an inventory
Corpus conversion to open standard format, interoperable standard, have validation of open standard format
Corpus versioning and forking
Semantification: Annotate with Names Entifty Recognition,
Semantic concept annotation
Enable NLP anaysis: Word frequency
Enable syntactic and syntactic/semantic markup
Enable TDM
Research outputs: Allow for analysis of finding and results outputted as data and corpus copy if needed as Open Science being compatible
Reporting on Corpus: Bibliometric, Presenting Knowledge and ideas, statitics to back findings, etc
The text was updated successfully, but these errors were encountered:
Also posted to NFDI4Culture: https://tickets.nfdi4culture.de/work_packages/9750/activity
What DH & Repository tooling (software) is out there for working on a Corpus: Corpus Management, Packaging, Semtification, carry out Data Analysis and producing research outputs.
The reason for asking the question is that for an individual publication how do we make a publication usable, compatible with standards used in existing systems for corpus packaging and data analysis.
The kinds of tasks, functions, capabilities being looked at are:
The text was updated successfully, but these errors were encountered: