TDA4VM: Quantized model performance

Ashay Mulye

Part Number: TDA4VM

Hi,

I have compiled a NN with the following settings:

compile_options = {
                            "tidl_tools_path": os.environ["TIDL_TOOLS_PATH"],
                            "artifacts_folder": "/home/TestCode/compiled_model",
                            "tensor_bits": 16,
                            "accuracy_level": 1,
                            "advanced_options:calibration_frames": len(calib_images),
                            "advanced_options:calibration_iterations": 3,
                            "debug_level": 1,
                            "deny_list": "Slice",
                            }

Initially, I tried with 8 bits, but the accuracy was quite low. I updated it to 16 bits, but the performance seems to be bad. Around 50 images were used for calibration. Checked it with 32 bits and there the performance seems to be reasonable.

Lastly, is it possible to have a "statically quantized model using onnx" to be compiled with edge ai tidl? If yes, how can this be done?

PS: this is on host emulation, more specifically on a docker container.

Thanks

Ashay

over 2 years ago

0 Anshu Jain over 2 years ago

TI__Guru 56820 points

Hi Ashay,

You can refer following troubleshooting documentation to understand various things you can try to improve accuracy :

https://github.com/TexasInstruments/edgeai-tidl-tools/blob/master/docs/tidl_osr_debug.md#steps-to-debug-functional-mismatch-in-host-emulation

Ashay Mulye said:
Lastly, is it possible to have a "statically quantized model using onnx" to be compiled with edge ai tidl? If yes, how can this be done?

For TDA4VM, something similar to this can be done via Quantization Aware Training ( QAT), can you can understand about it from here : https://github.com/TexasInstruments/edgeai-tidl-tools/blob/master/docs/tidl_fsg_quantization.md#c-quantization-aware-training-qat

Regards,

Anshu

Processors

Processors forum

TDA4VM: Quantized model performance