Skip to content

v0.7.6

Compare
Choose a tag to compare
@snarayan21 snarayan21 released this 10 May 22:22
· 46 commits to main since this release
97eae28

🚀 Streaming v0.7.6

Streaming v0.7.6 is released! Install via pip:

pip install --upgrade mosaicml-streaming==0.7.6

💎 New Features

1. device_per_stream batching method

Users can now construct batches such that each device sees only samples from a single stream. This is very useful in cases where different data sources have samples/tensors of different sizes, but the model should still see samples from these different data sources at each optimizer step.

2. Add ndarray type for Spark dataframes.

Enable parsing Spark's ArrayType (of ShortType, LongType, IntegerType, FloatType, DoubleType) when converting a Spark dataframe to MDS.

3. Support for Alipan storage

Adds support for Alipan, Alibaba's cloud storage service.

What's Changed

New Contributors

Full Changelog: v0.7.5...v0.7.6