Conditional Inference for New Data #15

melondonkey · 2021-05-06T16:48:18Z

Is there a way to use the trained models to do conditional inference on new observations and also get the underlying probabilities rather than sampled data sets? For example, I train on a binary matrix of diagnoses and then as a new patient comes in, I can input their known conditions and get the probability they have the other conditions?

The ability to do that in combination with the TF API would make this a very powerful "auto-complete" model.

tsrobinson · 2021-05-11T12:54:28Z

Thanks for raising this issue!

It is possible to recover the predicted probabilities (rather than labels) by setting cat_coalesce = FALSE and bin_label = FALSE in the complete() function. Since uncertainty over the predictions is handled via multiply imputing the data, the best strategy would then be to average across M completed datasets in order to get good estimates of the average predicted probabilities.

We are actively looking into adding a new function to predict missing values for data not used in training, which would allow you to achieve the proposed workflow above.

tsrobinson added the enhancement New feature or request label May 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conditional Inference for New Data #15

Conditional Inference for New Data #15

melondonkey commented May 6, 2021

tsrobinson commented May 11, 2021 •

edited

Loading

Conditional Inference for New Data #15

Conditional Inference for New Data #15

Comments

melondonkey commented May 6, 2021

tsrobinson commented May 11, 2021 • edited Loading

tsrobinson commented May 11, 2021 •

edited

Loading