TDA4VM: Using the 8bit method to quantify the yolox model causes the large target detection position becomes smaller

chao yang

Part Number: TDA4VM

HI,

Used SDK8.2

Problem phenomenon:
Using the 8bit method to quantify the yolox model will cause the large target detection position becomes smaller

Experimental record:
I have trained a variety of yolox models, and there will be a similar problem that the large target detection position becomes smaller to varying degrees.
1. Using 16bit quantization can be improved, but the project cannot accept the increase in inference time too much.
2. Using the Troubleshooting Guide for Accuracy/Functional Issues provided by TI to investigate, where is the specific problem that has not been located?
3. Try to use the method suggested by TIDL-RT: Quantization, and the experimental conclusion also shows that the large target detection position becomes smaller. (excluding the use of QAT)

Hope TI can give better suggestions or solutions.

img-8bit：

16bit

over 3 years ago

0 chao yang over 3 years ago

Prodigy 170 points

use_data.zip

Hi, TI

this is my data.

0 Manu Mathew over 3 years ago in reply to chao yang

TI__Genius 11236 points

We use mixed precision to reduce the quantization accuracy loss for object detection layers.

See an example here: https://github.com/TexasInstruments/edgeai-benchmark/blob/master/configs/detection.py#L314

It is seen that moving the first and last convolution layers to 16bit provides removes most of the accuracy loss. Such layers can be specified using the parameter: 'advanced_options:output_feature_16bit_names_list'

(Here I am referring to our onnxruntime-tidl that offloads supported layers to TIDL - this is what edgeai-benchmark uses). TIDL-RT will have a similar parameter to specify layers to be put in 16bits - you can get the information for TIDL documentation.

0 chao yang over 3 years ago in reply to Manu Mathew

Prodigy 170 points

Thanks,Manu

I had previously experimented with mixed precision methods at some specific detection layers,I also found this method to improve the problem. Tt increases inference time about 3~4ms, which is unacceptable for our function.

0 Manu Mathew over 3 years ago in reply to chao yang

TI__Genius 11236 points

Try putting the first convolution layer and last convolutions layers (last convolution in all branches) - only these to 16 bit. This should not increase the complexity significantly.

0 chao yang over 3 years ago in reply to Manu Mathew

Prodigy 170 points

Thanks, Manu

0 Wang Wang over 3 years ago in reply to chao yang

Intellectual 310 points

Hell Chao Yang

I am working on the TDA4 EVM .

We want to run yolox on our EVM . Could you help us ？

We‘ve completed the application of yolov5.

Tks a lot !

0 Manu Mathew over 3 years ago in reply to Wang Wang

TI__Genius 11236 points

Option 1: (simplest)

You can use https://github.com/TexasInstruments/edgeai-modelmaker

It's very simple to use this repository.

Option 2:

Other option is you can yourself train using https://github.com/TexasInstruments/edgeai-mmdetection

and compile using this example: github.com/.../benchmark_custom.py

0 Wang Wang over 3 years ago in reply to Manu Mathew

Intellectual 310 points

Hello Manu :

My problem is as below

https://e2e.ti.com/support/processors-group/processors/f/processors-forum/1123296/tda4vm-how-to-use-yolox-on-the-tda4-evm

We want to konw how to use it on the EVM.

I've transform the onnx file to the yolox_net.bin and yolox_io.bin files.

Tks a lot !

0 Manu Mathew over 3 years ago in reply to Wang Wang

TI__Genius 11236 points

I replied in the other thread.

0 Wang Wang over 3 years ago in reply to chao yang

Intellectual 310 points

By the way , you can also try to set the value of the quantizationStyle = 3 .（base on the yolox_m_ti_lite_45p5_64p2.onnx and yolox_m_ti_lite_metaarch.prototxt at the url github.com/.../edgeai-yolox）

The following images are the different value of the quantizationStyle. And I set the confidence_threshold = 0.4 （In the yolox_m_ti_lite_metaarch.prototxt）

quantizationStyle = 2

quantizationStyle = 3

The impoter file is as below:

tidl_import_yolox_m_ti_lite_45p5_64p2.txt

Fullscreen

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
modelType          = 2
numParamBits       = 8
numFeatureBits     = 8
quantizationStyle  = 3
#quantizationStyle  = 2
inputNetFile       = "../../test/testvecs/models/public/onnx/yolox_m_ti_lite_45p5_64p2.onnx"
outputNetFile      = "../../test/testvecs/config/tidl_models/onnx/yolox_m_ti_lite_45p5_64p2/tidl_net_yolox_m_ti_lite_45p5_64p2.bin"
outputParamsFile   = "../../test/testvecs/config/tidl_models/onnx/yolox_m_ti_lite_45p5_64p2/tidl_io_yolox_m_ti_lite_45p5_64p2_"
inDataNorm  = 1
inMean = 0 0 0
inScale = 1.0 1.0 1.0
inDataFormat = 1
inWidth  = 640
inHeight = 640 
inNumChannels = 3
numFrames = 1
inData  =   "../../test/testvecs/config/detection_list.txt"
perfSimConfig = ../../test/testvecs/config/import/device_config.cfg
inElementType = 0
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

modelType          = 2
numParamBits       = 8
numFeatureBits     = 8
quantizationStyle  = 3
#quantizationStyle  = 2
inputNetFile       = "../../test/testvecs/models/public/onnx/yolox_m_ti_lite_45p5_64p2.onnx"
outputNetFile      = "../../test/testvecs/config/tidl_models/onnx/yolox_m_ti_lite_45p5_64p2/tidl_net_yolox_m_ti_lite_45p5_64p2.bin"
outputParamsFile   = "../../test/testvecs/config/tidl_models/onnx/yolox_m_ti_lite_45p5_64p2/tidl_io_yolox_m_ti_lite_45p5_64p2_"
inDataNorm  = 1
inMean = 0 0 0
inScale = 1.0 1.0 1.0
inDataFormat = 1
inWidth  = 640
inHeight = 640 
inNumChannels = 3
numFrames = 1
inData  =   "../../test/testvecs/config/detection_list.txt"
perfSimConfig = ../../test/testvecs/config/import/device_config.cfg
inElementType = 0
#outDataNamesList = "convolution_output,convolution_output1,convolution_output2"
metaArchType = 6
metaLayersNamesList =  "../../test/testvecs/models/public/onnx/yolox_m_ti_lite_metaarch.prototxt"
postProcType = 2

Good luck ！

0 Manu Mathew over 3 years ago in reply to Wang Wang

TI__Genius 11236 points

What is the status now? Is the issue resolved?

0 Wang Wang over 3 years ago in reply to Manu Mathew

Intellectual 310 points

Hell Manu：

Another thread has been resolved ！

https://e2e.ti.com/support/processors-group/processors/f/processors-forum/1123296/tda4vm-how-to-use-yolox-on-the-tda4-evm

Thanks a lot !

Good luck !

0 chao yang over 3 years ago in reply to Manu Mathew

Prodigy 170 points

Hi，Manu

Try putting the first convolution layer and last convolutions layers (last convolution in all branches) - only these to 16 bit. This should not increase the complexity significantly.

I tried this method，but it doesn't solve the problem I'm having.

Currently I do the following:

I use 16bit for the convolution of the position in the three branches of yolox.

outputFeature16bitNamesList = "1203,1206,1238,1233,1236,1279,1218,1221,1258"

I found this method to improve the problem. Tt increases inference time about 3ms

0 chao yang over 3 years ago in reply to Wang Wang

Prodigy 170 points

Thanks, I tried this method，but it doesn't solve the problem I'm having.

quantizationStyle = 3

It turns scale into a power of 2. I use this method with bigger losses.

Processors

Processors forum

TDA4VM: Using the 8bit method to quantify the yolox model causes the large target detection position becomes smaller