TDA4VM: There is a coefficient difference between the quantization model inference result and the onnx inference result

Cherry Zhou

Part Number: TDA4VM

Hi team,

Using sdk 0701.

Does the quantization process occur during the conversion of the model by the tidlModelImport tool? The customer calls the PC_dsp_test_dl_algo.out tool on the converted int8 model to inference the picture, and then finds a coefficient difference between the parsed inference result and the inference result of onnx.

The quantization script is as follows:

modelType          = 2
numParamBits       = 8
numFeatureBits     = 8
inputNetFile       = "../../test/testvecs/collate/fs_noCoord/models/fs_noCoord.onnx"
outputNetFile      = "../../test/testvecs/collate/fs_noCoord/infer/8/tidl_net_fs_noCoord.bin"
outputParamsFile   = "../../test/testvecs/collate/fs_noCoord/infer/8/tidl_io_fs_noCoord_"
inDataNorm  = 1
inMean = 123.675 116.28 103.53 
inScale = 0.017125 0.017507 0.017429
resizeWidth = 480
resizeHeight = 480
inWidth  = 480
inHeight = 480
inNumChannels = 3
inData = ../../test/testvecs/collate/fs_noCoord/calib_list.txt
postProcType = 0


The inference results for the int8 model are as follows:

-33.00000000
-41.00000000
-42.00000000
-40.00000000
-41.00000000
-32.00000000
-27.00000000
-22.00000000
-25.00000000
-37.00000000
...



The inference for onnx is as follows:

-14.12847805
-17.56087685
-18.81339073
-17.77967834
-18.89683342
-14.82417774
-13.74363136
-11.10289955
-12.24503803
-17.49886703
....

The customer would like to know how to get the desired coefficients for the final model output to be converted to the output of the float model. Thanks!
Best Regards,
Cherry

over 3 years ago

0 Cherry Zhou over 3 years ago

TI__Mastermind 22235 points

Hi,

May I know is there any updates? Thanks.

Best Regards,

Cherry

0 Anand Pathak over 3 years ago in reply to Cherry Zhou

TI__Genius 9065 points

Hi Cherry,

The parameters are quantized during the model import process and the corresponding parameter scales can be found from "*_paramDebug.csv" generated as part of import artifacts (in ti_dl/test/testvecs/config/tidl_models/onnx/****_paramDebug.csv". The quantized parameters can be divided by the corresponding quantization scale to get back approximate floating point values, there will however be a minor difference wrt actual float values of the paramters due to quantization loss.

Regards,

Anand

0 Cherry Zhou over 3 years ago in reply to Anand Pathak

TI__Mastermind 22235 points

Hi Anand,

Thanks for your reply!

And for the file you provided, the customer would like to know which is the corresponding quantization parameter.

The customer generated parametric debug tables are as follows: (The Scale column and the actual coefficients do not match well )

Could you help check this? Thanks!

Best Regards,

Cherry

0 Cherry Zhou over 3 years ago in reply to Cherry Zhou

TI__Mastermind 22235 points

Hi Anand,

May I know is there any updates about the follow-up questions? Thanks!

Best Regards,

Cherry

0 Anand Pathak over 3 years ago in reply to Cherry Zhou

TI__Genius 9065 points

Hi Cherry,

The scales are not the actual coefficients. For a particular layer, the quantized coefficients for that layer need to be divided by the corresponding scale to get the actual coefficients.

You can refer to the "Weights Quantization statistic Analysis" section of the following documentation for more information:

https://software-dl.ti.com/jacinto7/esd/processor-sdk-rtos-jacinto7/08_00_00_12/exports/docs/tidl_j7_08_00_00_10/ti_dl/docs/user_guide_html/md_tidl_fsg_steps_to_debug_mismatch.html

The "quantParamFloat" variable in the function "TIDL_CompareParams" mentioned in the documentation should give the required coefficient values.

Regards,

Anand

Processors

Processors forum

TDA4VM: There is a coefficient difference between the quantization model inference result and the onnx inference result