Skip to content

Latest commit

 

History

History
26 lines (17 loc) · 1.63 KB

File metadata and controls

26 lines (17 loc) · 1.63 KB

Migrate quantization from POT API to NNCF API

Colab

Ultralytics YOLOv5 results

This tutorial demonstrates how to migrate quantization pipeline written using the OpenVINO Post-Training Optimization Tool (POT) to NNCF Post-Training Quantization API. This tutorial is based on Ultralytics YOLOv5 model and additionally it compares model accuracy between the FP32 precision and quantized INT8 precision models and runs a demo of model inference based on sample code from Ultralytics YOLOv5 with the OpenVINO backend.

Notebook Contents

The tutorial consists from the following parts:

  1. Convert YOLOv5 model to OpenVINO IR.
  2. Prepare dataset for quantization.
  3. Configure quantization pipeline.
  4. Perform model optimization.
  5. Compare accuracy FP32 and INT8 models
  6. Run model inference demo
  7. Compare performance FP32 and INT8 models

Installation Instructions

This is a self-contained example that relies solely on its own code.
We recommend running the notebook in a virtual environment. You only need a Jupyter server to start. For details, please refer to Installation Guide.