[bug-fix] enforce all converted quantized ONNX initializers are same dtype #1181

bfineran · 2022-11-23T17:34:32Z

torch 1.12 exports some Q/DQ inputs as initializers now instead of constants - this causes a pass in the quantized graph conversion step to miss a conversion from INT8 to UINT8. this PR adds an explicit pass to convert any missed INT8 to UINT8 initializers. without this fix, models may fail on ORT and deepsparse compile due to a mismatch between INT8 and UINT8 inputs

test_plan:
@KSGulin has reproduced the error to verify fix and commandeer
need to verify compilation + accuracy

…dtype

KSGulin

Tested the scenarios:

Torch==1.12.1 + quantize_conv_activations: False in quantization modifier
Torch==1.12.1 + NO quantize_conv_activations: False in quantization modifier
Torch==1.9.1 + quantize_conv_activations: False in quantization modifier

All passed. Great fix! 🥳

corey-nm

lgtm!

…dtype (#1181)

…dtype (#1181) (#1182)

[bug-fix] enforce all converted quantized ONNX initializers are same …

4b2b881

…dtype

bfineran requested review from anmarques, KSGulin and corey-nm November 23, 2022 17:34

bfineran assigned bfineran and KSGulin Nov 23, 2022

KSGulin approved these changes Nov 23, 2022

View reviewed changes

corey-nm approved these changes Nov 23, 2022

View reviewed changes

bfineran merged commit 51f051e into main Nov 23, 2022

bfineran deleted the quant-convert-force-uint8 branch November 23, 2022 18:50

bfineran added a commit that referenced this pull request Nov 23, 2022

[bug-fix] enforce all converted quantized ONNX initializers are same …

efcc578

…dtype (#1181)

bfineran added a commit that referenced this pull request Nov 23, 2022

[bug-fix] enforce all converted quantized ONNX initializers are same …

379f3e3

…dtype (#1181) (#1182)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bug-fix] enforce all converted quantized ONNX initializers are same dtype #1181

[bug-fix] enforce all converted quantized ONNX initializers are same dtype #1181

bfineran commented Nov 23, 2022 •

edited

Loading

KSGulin left a comment

corey-nm left a comment

[bug-fix] enforce all converted quantized ONNX initializers are same dtype #1181

[bug-fix] enforce all converted quantized ONNX initializers are same dtype #1181

Conversation

bfineran commented Nov 23, 2022 • edited Loading

KSGulin left a comment

Choose a reason for hiding this comment

corey-nm left a comment

Choose a reason for hiding this comment

bfineran commented Nov 23, 2022 •

edited

Loading