This thread has been locked.
If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.
Hi,
I have some questions about TI PSDK9.0 Quantization.
I am using edgeai-tidl-tools to create TIDL-RT.
TIDL-RT binary files are created by using tvm-compilation-onnx.py. I can find some parameters that looks like for quantization such as dataQ, roundBits, min/maxValue, tensorScale in bin.svg file.
Q1.
Can I manually change the quantization parameters?
If there is a documentation for parameters, Can I receive it?
Q2.
Are asymmetric, non-power of 2 scale, per-channel quantization supported in psdk9.0?
These contents have not been updated in a edgeai-tidl-tools document.
I found parameters [quantization_scale_type, output_feature_16bit_names_list, params_16bit_names_list, quantization_scale_type, channel_wise_quantization] in delegate_options in onnxrt_ep.py
Can I have documents for those parameters?
- github.com/.../onnxrt_ep.py
Q3.
Is there a tool available for calculating the similarity between fp32 model and int8 model created by tidl quantization?
Thanks,
YoungHeon
YoungHeon,
1. It's not possible to do so in SDK 9.0, however post SDK 9.1 you can control quantization via prototxt: https://github.com/TexasInstruments/edgeai-tidl-tools/blob/master/docs/tidl_quantParams.md
2. It can be controlled via quantization_scale_type being set to 4 (Updated documentation reflects this: https://github.com/TexasInstruments/edgeai-tidl-tools/blob/master/docs/tidl_fsg_quantization.md)
3. There is no tool currently available, but the scripts here can help do the same: https://github.com/TexasInstruments/edgeai-tidl-tools/blob/master/docs/tidl_osr_debug.md#script-1--layer-level-activation-comparisons-
Thanks,
Varun