[Bug]: After conversion of the INT8 quantized tflite model into the OpenVINO IR format, the model exhibits poor performance. #26777

cheahber · 2024-09-25T06:15:00Z

OpenVINO Version

2024.4.0

Operating System

Other (Please specify in description)

Device used for inference

CPU

Framework

Keras (TensorFlow 2)

Model used

ResNet50

Issue description

Operating System - Ubuntu 22.04
CPU - Intel® Core™ i7-7700K CPU
GPU - Mesa Intel® Arc(tm) A770 Graphics (DG2)
Memory - 32 GB

I have examined the FP32 tflite model conversion process as demonstrated in this notebook: https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/tflite-to-openvino/tflite-to-openvino.ipynb

However, my tflite model is already quantized to INT8, and I aim to convert it directly into the OpenVINO IR format using the ov.convert_model() function. I anticipate that the INT8 IR format will surpass both the FP32 and FP16 models in inference speed.
Contrary to expectations, the INT8 IR format model runs slower than its FP32 and FP16 counterparts.

Step-by-step reproduction

Python script for replicating the results: tflite_openvino.zip

Steps to reproduce:

Install the required dependencies.
Execute the provided Python script directly; the results will be displayed in the console.

Relevant log output

This is the snippet of the log output.

=== Inference Time Comparison ===
FP32: 0.007910 seconds per inference
FP16: 0.006492 seconds per inference
INT8: 0.010170 seconds per inference

Issue submission checklist

I'm reporting an issue. It's not a question.
I checked the problem with the documentation, FAQ, open issues, Stack Overflow, etc., and have not found a solution.
There is reproducer code and related data files such as images, videos, models, etc.

cheahber added bug Something isn't working support_request labels Sep 25, 2024

andrei-kochin added the category: TFL FE OpenVINO TensorFlow Lite FrontEnd label Sep 25, 2024

andrei-kochin assigned gkrivor Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: After conversion of the INT8 quantized tflite model into the OpenVINO IR format, the model exhibits poor performance. #26777

[Bug]: After conversion of the INT8 quantized tflite model into the OpenVINO IR format, the model exhibits poor performance. #26777

cheahber commented Sep 25, 2024

[Bug]: After conversion of the INT8 quantized tflite model into the OpenVINO IR format, the model exhibits poor performance. #26777

[Bug]: After conversion of the INT8 quantized tflite model into the OpenVINO IR format, the model exhibits poor performance. #26777

Comments

cheahber commented Sep 25, 2024

OpenVINO Version

Operating System

Device used for inference

Framework

Model used

Issue description

Step-by-step reproduction

Relevant log output

Issue submission checklist