Minor issue: tmd_2021.csv column names can be in differing orders from run to run #140

donboyd5 · 2024-07-12T09:48:05Z

The tmd_2021.csv column names are not always in the same order even though the set of columns is the same. For example, in a recent excercise in which I created 4 variants of tmd output, under 4 different sets of assumptions, the names of the first 3 of the 213 columns were:

This defeats software such as R's vroom, which can read and combine a set of uniformly-structured csv files rapidly in parallel, which is useful when comparing multiple versions of tmd output prepared with different assumptions.

The alternative is to read the files one by one and combine them, which is considerably slower.

Not a big deal, but at some point, @nikhilwoodruff, it would be great if you could force the structure (column order) of tmd_2021.csv to be consistent from run to run.

martinholmer · 2024-07-17T00:11:06Z

@nikhilwoodruff, What is the timeline for resolving issue #140?

nikhilwoodruff self-assigned this Jul 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor issue: tmd_2021.csv column names can be in differing orders from run to run #140

Minor issue: tmd_2021.csv column names can be in differing orders from run to run #140

donboyd5 commented Jul 12, 2024 •

edited

Loading

martinholmer commented Jul 17, 2024

Minor issue: tmd_2021.csv column names can be in differing orders from run to run #140

Minor issue: tmd_2021.csv column names can be in differing orders from run to run #140

Comments

donboyd5 commented Jul 12, 2024 • edited Loading

martinholmer commented Jul 17, 2024

donboyd5 commented Jul 12, 2024 •

edited

Loading