Skip to content

SparseML v1.0.1 Patch Release

Compare
Choose a tag to compare
@jeanniefinks jeanniefinks released this 13 Jul 11:04
0fa9f72

This is a patch release for 1.0.0 that contains the following changes:

  • Quantized ONNX graph folding resolution that prevents and extra quant/dequant pair being added into the residuals for BERT style models. This was causing an accuracy drop after exporting to ONNX of up to 1% and is now fixed.