SparseML v1.0.1 Patch Release

jeanniefinks released this 13 Jul 11:04

This is a patch release for 1.0.0 that contains the following changes:

Quantized ONNX graph folding resolution that prevents and extra quant/dequant pair being added into the residuals for BERT style models. This was causing an accuracy drop after exporting to ONNX of up to 1% and is now fixed.

Assets 3