Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minor issue: tmd_2021.csv column names can be in differing orders from run to run #140

Open
donboyd5 opened this issue Jul 12, 2024 · 1 comment
Assignees

Comments

@donboyd5
Copy link
Collaborator

donboyd5 commented Jul 12, 2024

The tmd_2021.csv column names are not always in the same order even though the set of columns is the same. For example, in a recent excercise in which I created 4 variants of tmd output, under 4 different sets of assumptions, the names of the first 3 of the 213 columns were:

image

This defeats software such as R's vroom, which can read and combine a set of uniformly-structured csv files rapidly in parallel, which is useful when comparing multiple versions of tmd output prepared with different assumptions.

The alternative is to read the files one by one and combine them, which is considerably slower.

Not a big deal, but at some point, @nikhilwoodruff, it would be great if you could force the structure (column order) of tmd_2021.csv to be consistent from run to run.

@nikhilwoodruff nikhilwoodruff self-assigned this Jul 13, 2024
@martinholmer
Copy link
Collaborator

@nikhilwoodruff, What is the timeline for resolving issue #140?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants