Support new_axes= keyword in atop #1612

mrocklin · 2016-10-04T15:44:15Z

Add new single-chunk dimensions with the new_axes= keyword, including
the length of the new dimension. New dimensions will always be in a
single chunk.

>>> def f(x):
...     return x[:, None] * np.ones((1, 5))

>>> z = atop(f, 'az', x, 'a', new_axes={'z': 5})

mrocklin · 2016-10-04T15:46:56Z

This could use high-level review on API. Can anyone think of a cleaner way to do this or attractive alternatives?

@jcrist @shoyer

shoyer · 2016-10-04T16:09:49Z

This looks like a slightly less general version of #1511? The ability to insert new axes is certainly more important than resizing chunks (I can no longer remember exactly what use case I had in mind there).

mrocklin · 2016-10-04T21:13:52Z

Hrm, indeed. I had forgotten about #1511 .

The solution presented here does less, but the changes are also more modest, which reduces the concerns about maintenance bloat a bit (though not to zero).

I think the relevant question now is "Are we likely to want multi-chunk new dimensions?" If the answer is "yes" then we should think harder about this API.

I'm actually unsure what adding a new multi-chunk dimension would look like. The user defined function doesn't have a clear way to output multiple chunks. Thoughts?

shoyer

The solution presented here does less, but the changes are also more modest, which reduces the concerns about maintenance bloat a bit (though not to zero).

You also figure out a more elegant/minimal way to adjust top than I did :).

I think the relevant question now is "Are we likely to want multi-chunk new dimensions?" If the answer is "yes" then we should think harder about this API.

My version didn't actually support mulit-chunking new dimensions. It supported changing the chunking of existing dimensions on the output while keeping "block indices" intact. This is a pretty natural sort of thing to do (adjusting the size of each chunk), but maybe more complex than warranted without clear use cases.

shoyer · 2016-10-05T00:11:24Z

dask/array/core.py

@@ -1750,6 +1753,8 @@ def atop(func, out_ind, *args, **kwargs):
        Block pattern of the output, something like 'ijk' or (1, 2, 3)
    concatenate: bool
        If true concatenate arrays along dummy indices, else provide lists
+    new_axes: dict


This (and concatenate) should go after *args, and be marked as "keyword only"

mrocklin · 2016-10-05T12:19:11Z

My version didn't actually support mulit-chunking new dimensions. It supported changing the chunking of existing dimensions on the output while keeping "block indices" intact. This is a pretty natural sort of thing to do (adjusting the size of each chunk), but maybe more complex than warranted without clear use cases.

Ah, I see. Is there a way to combine that use case into this one or a way to make this change future proof so that it doesn't conflict with future desired changes?

Add new single-chunk dimensions with the ``new_axes=`` keyword, including the length of the new dimension. New dimensions will always be in a single chunk. >>> def f(x): ... return x[:, None] * np.ones((1, 5)) >>> z = atop(f, 'az', x, 'a', new_axes={'z': 5})

shoyer · 2016-10-05T14:20:14Z

I don't think there's any conflict here. Every valid argument to your new_axes was also a valid argument for my out_chunks, but new_axes is certainly a more intuitive name. I think this can go in as is.

mrocklin force-pushed the atop-new-axes branch from 8cf1487 to e979541 Compare October 4, 2016 21:10

shoyer reviewed Oct 5, 2016

View reviewed changes

mrocklin force-pushed the atop-new-axes branch from e979541 to 6dc7cbe Compare October 5, 2016 12:20

mrocklin merged commit 66685f4 into dask:master Oct 5, 2016

mrocklin deleted the atop-new-axes branch October 5, 2016 17:37

sinhrks added this to the 0.11.1 milestone Oct 11, 2016

shoyer mentioned this pull request Dec 31, 2016

New function for applying vectorized functions for unlabeled arrays to xarray objects pydata/xarray#964

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support new_axes= keyword in atop #1612

Support new_axes= keyword in atop #1612

mrocklin commented Oct 4, 2016

mrocklin commented Oct 4, 2016

shoyer commented Oct 4, 2016

mrocklin commented Oct 4, 2016

shoyer left a comment

shoyer Oct 5, 2016

mrocklin Oct 5, 2016

mrocklin commented Oct 5, 2016

shoyer commented Oct 5, 2016

Support new_axes= keyword in atop #1612

Support new_axes= keyword in atop #1612

Conversation

mrocklin commented Oct 4, 2016

mrocklin commented Oct 4, 2016

shoyer commented Oct 4, 2016

mrocklin commented Oct 4, 2016

shoyer left a comment

Choose a reason for hiding this comment

shoyer Oct 5, 2016

Choose a reason for hiding this comment

mrocklin Oct 5, 2016

Choose a reason for hiding this comment

mrocklin commented Oct 5, 2016

shoyer commented Oct 5, 2016