DeepSparse v1.4.0

jeanniefinks released this 17 Feb 20:08

· 5 commits to release/1.4 since this release

New Features:

OpenPifPaf deployment pipelines support (#788)
VITPose example deployment pipeline (#794)
DeepSparse Server logging with support for metrics, timings, and input/output values through Prometheus (#821, #791)

Changes:

Inference speed improved by up to 20% on dense FP32 BERT models.
Inference speed improved by up to 50% on quantized EfficientNetV1 and by up to 10% on quantized EfficientNetV2.
YOLOv5 integration upgraded to the latest upstream.

Resolved Issues:

DeepSparse no longer improperly detects each core as belonging to its own socket on some virtual machines, including those on OVHcloud.
When running networks with any Quantized Depthwise Convolution with a nontrivial w_zero_point parameter no longer produces an assertion failure. Trivial in this case means that the zero point is equal to 128 for uint8 data, or 0 for int8 data.
At executable_buffer.cpp (see #899), an assertion failure no longer occurs.
In quantized transformer models, a rare assertion failure no longer occurs.

Known Issues:

None

Assets 14