Weights are modifying the observation SD not variance #25

seananderson · 2023-10-17T19:39:33Z

SD/weight in the code:

Line 625 in 4d91db1

    
           if(obs_model == 1) {for(i in 1:n_pos) log_lik[i] = normal_lpdf(y[i] | offset[i] + pred[row_indx_pos[i],col_indx_pos[i]] + obs_cov_offset[i], exp(log(sigma_vec[row_indx_pos[i]]) - log_weights_vec[i]));}

Variance/weight in the vignette:

bayesdfa/vignettes/a1_bayesdfa.Rmd

Line 325 in 4d91db1

    
           Just as the `weights` argument can be ueed with `glm`, `lmer` or `glmmTMB`, we allow weights to be used in DFA models. Weights are currently only used for Gaussian reponses and when data is in long format. Specifically, the weights are included by modifying each variance to be $$\sigma^2 / w_i$$. As a concrete example, we'll simulate a dataset, add some examples of standard errors on the survey indices, and then perform the DFA.

I suppose adjusting the SD is fine and maybe the docs just need to be fixed? Or would scaling by inverse variance be more classical?

?fit_dfa alludes to the implementation:

bayesdfa/R/fit_dfa.R

Lines 82 to 84 in 4d91db1

    
           #' @param weights Optional name of "weights" argument in data frame. This is only implemented when data 
        
           #'   are in long format. If not entered, defaults to weights = 1 for all observations. The implementation of weights 
        
           #'   varies slightly by family: Gaussian family models use -log(w_i) in the dispersion formula

The text was updated successfully, but these errors were encountered:

ericward-noaa · 2023-10-23T17:58:31Z

Thanks for catching this. It's an error, because I meant to have the weights affecting the variance, not SD. Before making any changes, probably worth thinking about if this is the approach we want to use, or whether we should adopt the glmmTMB or sdmTMB style, which involves using the weights as a multiplier on the likelihood (see here for a good discussion)

I can really go either way. I was thinking of this classical way initially because surveys brought into a DFA each might have an associated CV, etc. Using those as weights works with the classical implementation when likelihood is normal/lognormal, but the glmmTMB approach would be more general probably across families.

seananderson · 2023-10-23T18:32:09Z

The current implementation has been useful so far for @ecophilina 's work modelling body condition indices. It sounds like brms takes the likelihood multiplier route too. Perhaps weights should multiply the likelihood but what you currently have could be something like an input observation variance? I.e., give it a different argument name. The current implementation seems more akin to what goes into meta-analysis, which seems like a common application. If changing, I guess there would have to be a warning for a while... although given it's already not implemented exactly how it's described, a change needs to be made regardless.

ericward-noaa · 2023-10-23T21:33:00Z

@ecophilina -- see the changes I was thinking about here: #27 . Let me know if that's not clear or if there's another approach you were thinking of

seananderson · 2024-01-29T22:00:00Z

Looking at the implementation, it seems these are modifying the SD not the variance, right? I.e., if so, the argument should change name to inverse SD or the implementation should change to modify the variance not the SD.
https://github.com/fate-ewi/bayesdfa/blob/main/inst/stan/dfa.stan#L622

ericward-noaa · 2024-01-30T17:04:35Z

I'm not sure if this is just a naming issue, or something with the code that needs changing. If it's naming, we need to have 2 vectors -- one for optional likelihood weights, and one for optional inverse variance weights. Happy to change the arg name to something else.

If it's a coding issue, I think where we want to get to is that if someone has standard errors of a survey (se), these become the inverse variance weights $(w_{i} = \frac{1}{{se_{i}^2}})$ and the variance of observations becomes $(\frac{\sigma^2}{{w_i}})$, or equivalently $(\sigma^2 \cdot (se_{i})^2)$.

The workflow that's currently implemented is that

user creates weights themselves, based on the se values, e.g. in the vignette

bayesdfa/vignettes/a1_bayesdfa.Rmd

Line 346 in 87765b6

df$weights <- (1 / df$se)^2

. I did this because this is the approach in lm()
inv_var_weights_vec is manipulated internally to pass the se to Stan,

bayesdfa/R/fit_dfa.R

Line 566 in 87765b6

inv_var_weights_vec = sqrt(1.0/inv_var_weights_vec),

The inv_var_weights_vec is used in the likelihood in Stan to have a standard deviation = $\sigma \cdot se_{i}$, I think the same as above:

bayesdfa/inst/stan/dfa.stan

Line 622 in 87765b6

    
           if(obs_model == 1) {for(i in 1:n_pos) log_lik[i] = normal_lpdf(y[i] | input_offset[i] + pred[row_indx_pos[i],col_indx_pos[i]] + obs_cov_offset[i], sigma_vec[row_indx_pos[i]] * inv_var_weights_vec[i]);}

seananderson · 2024-01-30T17:13:03Z

Ah, I missed this line inv_var_weights_vec = sqrt(1.0/inv_var_weights_vec), It’s all fine then. I was confused since the vector name in the Stan code makes it sound like it’s still the inverse variance, not the inverse SD.

ericward-noaa closed this as completed Jan 12, 2024

ericward-noaa reopened this Jan 30, 2024

seananderson closed this as completed Jan 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weights are modifying the observation SD not variance #25

Weights are modifying the observation SD not variance #25

seananderson commented Oct 17, 2023

ericward-noaa commented Oct 23, 2023

seananderson commented Oct 23, 2023

ericward-noaa commented Oct 23, 2023

seananderson commented Jan 29, 2024

ericward-noaa commented Jan 30, 2024 •

edited

Loading

seananderson commented Jan 30, 2024 via email

Weights are modifying the observation SD not variance #25

Weights are modifying the observation SD not variance #25

Comments

seananderson commented Oct 17, 2023

ericward-noaa commented Oct 23, 2023

seananderson commented Oct 23, 2023

ericward-noaa commented Oct 23, 2023

seananderson commented Jan 29, 2024

ericward-noaa commented Jan 30, 2024 • edited Loading

seananderson commented Jan 30, 2024 via email

ericward-noaa commented Jan 30, 2024 •

edited

Loading