This thread has been locked.

If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.

TDA4VM: The output of Pytorch model of QAT is different from ONNX model

Part Number: TDA4VM

I follow the QAT document to QAT on our pytorch model and the QAT will change all ReLU layer to PACT2 layer. Then I will convet pytorch model to onnx model in order to set the model on TDA4. But I find some problem when transfering pytorch model to onnx model. 

As shown above, left is the output of QAT pytorch model, right is the output of onnx model, they are different. I wonder if onnx model can realize the function of PACT2, because the output of onnx is not power of 2. The onnx model can only do clip but cannot set the output to being power of 2. If the onnx model cannot set the output to being power of 2, why the QAT is useful for increasing accuracy for TDA4 INT8 model?

Pls answer my question asap, thx for your support.