Skip to content

Commit

Permalink
Slurp pipeline
Browse files Browse the repository at this point in the history
- Update: direct_owned_parents methods: Temporarily commented out, pending OAK fixes. Currently doing this in batch using SPARQL for now.
- Add: Param --onto-exclusions-path: Now we no longer consider exluded terms to be possible slurp candidates.
- Bugfix: Labels: Now they are successfully being fetched using OAK.
- Add: New columns to output
- Update: run.sh: Reverted back to what it was before, undoing deletion of --rf, which was done to address pip installs not being persistent within the ODK docker container between runs.
  • Loading branch information
joeflack4 committed Sep 30, 2022
1 parent 35c8479 commit 1f3ee92
Show file tree
Hide file tree
Showing 10 changed files with 23,025 additions and 23,103 deletions.
6 changes: 3 additions & 3 deletions src/ontology/mondo-ingest.Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -344,14 +344,14 @@ slurp/:
mkdir -p $@

# min-id: the next available Mondo ID
# TODO: `pip install` stuff is temporary until ODK docker up to date w/ recent OAK updates
# TODO: Check if removing --rf from run.sh will fix need for pip install. havent been able to try yet; docker issue
# todo: `pip install` stuff is temporarily here until we come up with a fix. otherwise docker won't work
slurp/%.tsv: $(COMPONENTSDIR)/%.owl $(TMPDIR)/mondo.sssom.tsv $(REPORTDIR)/mirror_signature-mondo.tsv | slurp/
# pip install --upgrade -r $(RELEASEDIR)/requirements-unlocked.txt
pip install --upgrade -r $(RELEASEDIR)/requirements-unlocked.txt
python $(SCRIPTSDIR)/migrate.py \
--ontology-path $(COMPONENTSDIR)/$*.owl \
--sssom-map-path $(TMPDIR)/mondo.sssom.tsv \
--onto-config-path metadata/$*.yml \
--onto-exclusions-path reports/$*_term_exclusions.txt \
--min-id 850000 \
--max-id 999999 \
--mondo-terms-path $(REPORTDIR)/mirror_signature-mondo.tsv \
Expand Down
2 changes: 1 addition & 1 deletion src/ontology/run.sh
Original file line number Diff line number Diff line change
Expand Up @@ -45,7 +45,7 @@ if [ -n "$USE_SINGULARITY" ]; then
-W $WORK_DIR \
docker://obolibrary/$ODK_IMAGE:$ODK_TAG $TIMECMD "$@"
else
docker run -v $VOLUME_BIND -w $WORK_DIR -e ROBOT_JAVA_ARGS="$ODK_JAVA_OPTS" -e JAVA_OPTS="$ODK_JAVA_OPTS" -ti obolibrary/$ODK_IMAGE:$ODK_TAG $TIMECMD "$@"
docker run -v $VOLUME_BIND -w $WORK_DIR -e ROBOT_JAVA_ARGS="$ODK_JAVA_OPTS" -e JAVA_OPTS="$ODK_JAVA_OPTS" --rm -ti obolibrary/$ODK_IMAGE:$ODK_TAG $TIMECMD "$@"
fi

case "$@" in
Expand Down
2,140 changes: 1,070 additions & 1,070 deletions src/ontology/slurp/doid.tsv

Large diffs are not rendered by default.

43 changes: 20 additions & 23 deletions src/ontology/slurp/icd10cm.tsv
Original file line number Diff line number Diff line change
@@ -1,25 +1,22 @@
mondo_id xref label definition parents
ID A oboInOwl:hasDbXref LABEL A IAO:0000115 SC %
MONDO:850001 ICD10CM:A00-B99
MONDO:850002 ICD10CM:C00-D49
MONDO:850003 ICD10CM:D50-D89
MONDO:850004 ICD10CM:E00-E89
MONDO:850005 ICD10CM:F01-F99
MONDO:850006 ICD10CM:G00-G99
MONDO:850007 ICD10CM:H00-H59
MONDO:850008 ICD10CM:H60-H95
MONDO:850009 ICD10CM:I00-I99
MONDO:850010 ICD10CM:ICD-10-CM
MONDO:850011 ICD10CM:J00-J99
MONDO:850012 ICD10CM:K00-K95
MONDO:850013 ICD10CM:L00-L99
MONDO:850014 ICD10CM:M00-M99
MONDO:850015 ICD10CM:N00-N99
MONDO:850016 ICD10CM:O00-O9A
MONDO:850017 ICD10CM:P00-P96
MONDO:850018 ICD10CM:Q00-Q99
MONDO:850019 ICD10CM:R00-R99
MONDO:850020 ICD10CM:S00-T88
MONDO:850021 ICD10CM:U00-U85
MONDO:850022 ICD10CM:V00-Y99
MONDO:850023 ICD10CM:Z00-Z99
MONDO:850001 ICD10CM:A00-B99 Certain infectious and parasitic diseases (A00-B99)
MONDO:850002 ICD10CM:C00-D49 Neoplasms (C00-D49)
MONDO:850003 ICD10CM:D50-D89 Diseases of the blood and blood-forming organs and certain disorders involving the immune mechanism (D50-D89)
MONDO:850004 ICD10CM:E00-E89 Endocrine, nutritional and metabolic diseases (E00-E89)
MONDO:850005 ICD10CM:F01-F99 Mental, Behavioral and Neurodevelopmental disorders (F01-F99)
MONDO:850006 ICD10CM:G00-G99 Diseases of the nervous system (G00-G99)
MONDO:850007 ICD10CM:H00-H59 Diseases of the eye and adnexa (H00-H59)
MONDO:850008 ICD10CM:H60-H95 Diseases of the ear and mastoid process (H60-H95)
MONDO:850009 ICD10CM:I00-I99 Diseases of the circulatory system (I00-I99)
MONDO:850010 ICD10CM:ICD-10-CM ICD-10-CM TABULAR LIST of DISEASES and INJURIES
MONDO:850011 ICD10CM:J00-J99 Diseases of the respiratory system (J00-J99)
MONDO:850012 ICD10CM:K00-K95 Diseases of the digestive system (K00-K95)
MONDO:850013 ICD10CM:L00-L99 Diseases of the skin and subcutaneous tissue (L00-L99)
MONDO:850014 ICD10CM:M00-M99 Diseases of the musculoskeletal system and connective tissue (M00-M99)
MONDO:850015 ICD10CM:N00-N99 Diseases of the genitourinary system (N00-N99)
MONDO:850016 ICD10CM:O00-O9A Pregnancy, childbirth and the puerperium (O00-O9A)
MONDO:850017 ICD10CM:P00-P96 Certain conditions originating in the perinatal period (P00-P96)
MONDO:850018 ICD10CM:Q00-Q99 Congenital malformations, deformations and chromosomal abnormalities (Q00-Q99)
MONDO:850019 ICD10CM:S00-T88 Injury, poisoning and certain other consequences of external causes (S00-T88)
MONDO:850020 ICD10CM:U00-U85 Codes for special purposes (U00-U85)
Loading

0 comments on commit 1f3ee92

Please sign in to comment.