Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
run_benchmark.sh		run_benchmark.sh
run_tuning.sh		run_tuning.sh

README.md

Step-by-Step

This example loads a neural network for answering a query about a given context paragraph. It is converted from ONNX Model Zoo and confirm its accuracy and speed based on SQuAD v1.1.

Prerequisite

1. Environment

onnx: 1.12.0
onnxruntime: 1.13.1

Validated framework versions can be found in main readme.

2. Prepare Model

Download model from ONNX Model Zoo.

wget https://github.com/onnx/models/raw/main/text/machine_comprehension/bidirectional_attention_flow/model/bidaf-11.onnx

3. Prepare Dataset

Download SQuAD dataset from SQuAD dataset link.

Run

1. Quantization

Quantize model with dynamic quantization:

bash run_tuning.sh --input_model=path/to/model \ # model path as *.onnx
                   --dataset_location=path/to/squad_v1/dev-v1.1.json
                   --output_model=path/to/model_tune

2. Benchmark

bash run_benchmark.sh --input_model=path/to/model \ # model path as *.onnx
                      --dataset_location=path/to/squad_v1/dev-v1.1.json
                      --mode=performance # or accuracy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ptq

ptq

README.md

Step-by-Step

Prerequisite

1. Environment

2. Prepare Model

3. Prepare Dataset

Run

1. Quantization

2. Benchmark

Files

ptq

Directory actions

More options

Directory actions

More options

Latest commit

History

ptq

Folders and files

parent directory

README.md

Step-by-Step

Prerequisite

1. Environment

2. Prepare Model

3. Prepare Dataset

Run

1. Quantization

2. Benchmark