xarray.Dataset.var - xarray.DataArray.var - does it have ddof=1 parameter? #1050

chiaral · 2016-10-18T15:03:53Z

It is not clear from the description whether ddof = 1 is available and/or if it is set to 0.
(https://docs.scipy.org/doc/numpy-1.6.0/reference/generated/numpy.var.html)

for large samples, 1 or 0 don't make a lot of difference, but it would be good to know whether it uses N-1 or N.

shoyer · 2016-10-18T20:52:24Z

Good question. Setting ddof should work. It's passed on to nanvar from NumPy or bottleneck, both of which default to ddof=0.

stale · 2019-01-26T02:57:36Z

In order to maintain a list of currently relevant issues, we mark issues as stale after a period of inactivity
If this issue remains relevant, please comment here; otherwise it will be marked as closed automatically

sjvrijn · 2020-08-08T20:39:11Z

In core/nanops.py there are some explicit defaults of ddof=0 within xarray, but I'm not sure if those are always used or if there are also cases where var (or std) are directly passed on to numpy/bottleneck/dask.

I'm considering two different options to clarify this:

Add a docstring section on the ddof parameter specifying it uses ddof=0 as default for the reduction methods that use it, i.e. var and std. Possibly just copied from numpy's var page.
Refer to numpy's documentation page in the docstring of all reduction methods for further reference.

Both would require some logic in core/ops.py: either to check for which reduce methods need a ddof paragraph, or to create the proper url (which has to adjust min and max to np.amin and np.amax respectively)

Is there any clear preference from anyone about this?

max-sixty · 2020-08-08T21:59:18Z

Thanks for finding that @sjvrijn

I don't have a view on what we should use, so I would vote to defer to numpy (and pandas, which also seems to use 1), referencing that documentation to the extent xarray isn't changing anything.

But others probably have a stronger view on which ddof we should use?

shoyer added topic-documentation contrib-help-wanted labels Oct 18, 2016

stale bot added the stale label Jan 26, 2019

stale bot closed this as completed Feb 25, 2019

dcherian reopened this Feb 25, 2019

stale bot removed the stale label Feb 25, 2019

dcherian mentioned this issue Nov 8, 2021

Generate reductions for DataArray, Dataset, GroupBy and Resample #5950

Merged

dcherian closed this as completed in #5950 Mar 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xarray.Dataset.var - xarray.DataArray.var - does it have ddof=1 parameter? #1050

xarray.Dataset.var - xarray.DataArray.var - does it have ddof=1 parameter? #1050

chiaral commented Oct 18, 2016

shoyer commented Oct 18, 2016

stale bot commented Jan 26, 2019

sjvrijn commented Aug 8, 2020

max-sixty commented Aug 8, 2020

xarray.Dataset.var - xarray.DataArray.var - does it have ddof=1 parameter? #1050

xarray.Dataset.var - xarray.DataArray.var - does it have ddof=1 parameter? #1050

Comments

chiaral commented Oct 18, 2016

shoyer commented Oct 18, 2016

stale bot commented Jan 26, 2019

sjvrijn commented Aug 8, 2020

max-sixty commented Aug 8, 2020