Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Session Level Pheno #83

Closed
18 tasks done
surchs opened this issue Nov 30, 2023 · 2 comments
Closed
18 tasks done

Session Level Pheno #83

surchs opened this issue Nov 30, 2023 · 2 comments
Labels
Milestone Used to track other issues that are required to complete the milestone.

Comments

@surchs
Copy link
Contributor

surchs commented Nov 30, 2023

Context

We have a good amount of data in our nodes and are starting to also host richer data. Many of the richer data have at least some longitudinal information on phenotypic data - and we cannot model this because we model phenotypic information at the subject level. This will allow us to provide much richer queries in the near future and is, from the tool and data model side, a relatively easy thing to do.

There is also BEP36 making its way towards completion, and we want to both have some ideas on that and be able to model BEP36 information once it merges.

The overall design ideas live here: https://miro.com/app/board/uXjVNIzFV9w=/

IMportant stuff

  • right now the CLI can handle multiple sessions (rows about the same subject), but it just picks the first value

Why

  • Part of Grow the amount and usefulness of data #11
  • Follows Deploy on Release #64
  • We want to model longitudinal data so we can represent richer information and query them
  • So we can get more value from our own (PD) datasets locally
  • To get ready for such modeling challenges as QPN where imaging and pheno sessions can be completely separate

Outcomes

  • our data model schema supports age, sex, diagnosis, assessment at the session level
  • all (of our own) graphs are running up to date data models (no more subject level pheno)
  • I can search for a subject with 2 phenotypic sessions and no imaging
  • I can annotate (annotation tool) and process (CLI) a table with session ID columns

What

CLI

Annotation tool

"Work"

API

Federation API

Query tool

Documentation-related

What it is not

  • we do not deal with session timing information (i.e. imaging and pheno happening within X days of each other)
  • we do not support multiple session naming systems, if you have an imaging session, it has to be named in the big input .tsv
  • allowing multiple columns "about" age or "sex" (different values for different sessions have to still be encoded longform)
  • any information on ordering of sessions or "baseline" sessions

To close it all out

@surchs surchs added flag:discuss Flag issue that needs to be discussed before it can be implemented. Milestone Used to track other issues that are required to complete the milestone. and removed flag:discuss Flag issue that needs to be discussed before it can be implemented. labels Dec 4, 2023
@rmanaem
Copy link
Contributor

rmanaem commented Dec 6, 2023

Since there were no issues open on the annotation tool and query tool related to this roadmap, I'm turning the checkboxes into issues for both tools.

@alyssadai
Copy link

Closing as all issues have been addressed! 🥳

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Milestone Used to track other issues that are required to complete the milestone.
Projects
Archived in project
Development

No branches or pull requests

3 participants