Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alternate approach to serializing netcdfs for dask.distributed #1095

Closed
wants to merge 1 commit into from

Conversation

shoyer
Copy link
Member

@shoyer shoyer commented Nov 8, 2016

Alternative to #1083

Based on @mrocklin's suggestion of (ab)using pickle.

The good news is that this version can handle many more types out of the box (no datetime64 or timedelta64, though), and I don't need to do anything special for all of my other special array types further away from netCDF files.

cc @pwolfram -- maybe you'll have more luck with this version?

@mrocklin
Copy link
Contributor

mrocklin commented Nov 8, 2016

Ah, so you've handled the leaf case and everything else follows naturally from there thanks to pickle. Nice.

@pwolfram
Copy link
Contributor

pwolfram commented Nov 8, 2016

This seemed to work, see dask/distributed#629 comment.

@pwolfram
Copy link
Contributor

Note, I did get a memory error for usage via ds.sum().

@shoyer
Copy link
Member Author

shoyer commented Nov 21, 2016

Closing in favor of #1128, which simply makes all datastores pickle-able.

@shoyer shoyer closed this Nov 21, 2016
@shoyer shoyer deleted the distributed-2 branch July 31, 2018 22:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants