Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dev v2.1 #287 - Simplify the Gen3-DRS download option #304

Merged
merged 24 commits into from
Feb 22, 2022

Conversation

imendes93
Copy link
Contributor

Closes #287

Description

This PR adds the following alterations:

  • added a new --manifest parameter with the input being the .json manifest file downloaded from GTEX
  • --reads parameter receives a CSV file with samples for which the analysis will be limited to
  • filenames requested but not found in original manifest file are saved in a not_found_GTEX_samples.txt file

@imendes93 imendes93 linked an issue Jan 19, 2022 that may be closed by this pull request
@Vlad-Dembrovskyi
Copy link
Contributor

@angarb to test with Simplify-Gen3-DRS-2 tag

Copy link
Contributor

@Vlad-Dembrovskyi Vlad-Dembrovskyi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

main.nf Outdated Show resolved Hide resolved
main.nf Outdated Show resolved Hide resolved
Copy link
Contributor

@Vlad-Dembrovskyi Vlad-Dembrovskyi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fix

main.nf Outdated Show resolved Hide resolved
Copy link
Contributor

@Vlad-Dembrovskyi Vlad-Dembrovskyi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another issue, at filter_manifest step:
https://cloudos.lifebit.ai/public/jobs/61fc0a9b8c574a01e8e9d0fa
image
Nextflow when running on GCP CloudOS probably cannot normally access the bin folder scripts. Either the container must have different options to run, and be set to the user as GCP nextflow user is with docker.runOptions or smth (tricky). Or the script has to be put inside container on path. Latter is easier.

@Vlad-Dembrovskyi
Copy link
Contributor

Vlad-Dembrovskyi commented Feb 10, 2022

Copy link
Contributor

@Vlad-Dembrovskyi Vlad-Dembrovskyi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Last production run finished successfully.

@Vlad-Dembrovskyi Vlad-Dembrovskyi merged commit 7bd8f1d into dev-v2.1 Feb 22, 2022
@Vlad-Dembrovskyi Vlad-Dembrovskyi deleted the dev-v2.1-#287 branch February 22, 2022 09:57
ilevantis pushed a commit that referenced this pull request May 19, 2022
* Fixes env gtex issue #290 (#294)

* Change env() to stdout to save sample_name in gen3_drs

* Fix No such property: baseName for class: String

* Gen3-DRS prints md5 "file is good" to log not stdout

* Improves gen3-drs md5 error message

* Changes gtex input to support new manifest file format [#289] (#296)

* Updates ch_gtex_gen3_ids items #289

* Remove duplicate val(obj_id) in input of gen3-drs

Co-authored-by: cgpu <38183826+cgpu@users.noreply.github.com>

* Comments our fasta requirement for gen3-drs input (#297)

* Comments our fasta requirement for gen3-drs input

* Update usage.md that genome_fasta is only for CRAM

* Update usage.md typo

* Fix missing file from path issue

* change GLS executor from parameter to scope (#305)

* Remove gtex (#299)

* Remove mentions of old GTEX download option from main.nf

* Remove mentions of old GTEX download option from help

* Remove mentions of old GTEX download option from usage.md

* Renames Gen3-DRS into new GTEX download option

* Renames Gen3-DRS into new GTEX download opt in usage.md

* Dev v2.1 #287 - Simplify the Gen3-DRS download option (#304)

* Update usage.md

* Update run_on_sumner.md

* add dockerfile for csvtoolkit

* add process to convert manifest json to csv

* add process to filter manifest by file passed through --reads

* update help message

* fix bug on variable declaration

* Update nextflow.config - fix typo

* Revert "Merge branch 'master' into dev-v2.1-#287"

This reverts commit be2c2ab, reversing
changes made to 04285ef.

* Update main.nf

* patch projectDir error

* Fix oublishDir path for manifest

* Fix oublishDir path for manifest

* Fix typo

* Update filter_manifest.py

* Update filter_manifest.py

* fix bug on saving filenames that were not in manifest file

* Update filter_manifest.py

* remove logging of samples not found in manifest

* Update filter_manifest.py

* Makes filter_manifest txt output optional

Co-authored-by: angarb <62404570+angarb@users.noreply.github.com>
Co-authored-by: Vlad-Dembrovskyi <64809705+Vlad-Dembrovskyi@users.noreply.github.com>
Co-authored-by: Vlad-Dembrovskyi <vlad@lifebit.ai>

* Rename examples/gen3/README.md to examples/GTEX/README.md

Editing folder name to match new "download_from" name.

* Update and rename GEN3_DRS_config.md to GTEX_config.md

Updating parameters

* Delete examples/gen3 directory

* Update usage.md

Moving this information

* Update README.md

* Update README.md

* Delete PRJNA453538.SraRunTable.txt

Not needed

* Delete MCF10_MYCER.datafiles.csv

Not needed

* Create reads.csv

Adding reads.csv example

* Update README.md

* Create manifest.json

Adding example manifest.json

* Update README.md

* Update run_on_cloudos.md

* Update Copying_Files_From_Sumner_to_Cloud.md

Made neater

* Create Star_Index_Generation.md

Co-authored-by: cgpu <38183826+cgpu@users.noreply.github.com>
Co-authored-by: imendes93 <73831087+imendes93@users.noreply.github.com>
Co-authored-by: angarb <62404570+angarb@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Simplify the Gen3-DRS download option
3 participants