Part Number: TDA4VM
use TI qat train script(github.com/.../edgeai-torchvision), output model is still fp32, and there is no error log, why and how to debug?
This thread has been locked.
If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.
Part Number: TDA4VM
use TI qat train script(github.com/.../edgeai-torchvision), output model is still fp32, and there is no error log, why and how to debug?
When doing QAT with xnn.quantize in edgeai-torchvision, weights are discretized (internally) and compensated for the accuracy loss due to that discretization. Although, the weights saved in the checkpoint are floating point, it has been compensated for the error that will occur when quantized.
When TIDL reads the QAT model, it would apply the same kind of discretization for quantization. Please try the QAT model in TIDL or OSRT (for example ONNXRuntime with TIDL offload) - it should give good accuracy.
For a similar discussion, please see this thread:
https://github.com/TexasInstruments/edgeai-torchvision/issues/7