This thread has been locked.

If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.

QAT quantization scheme

Is there a possibility to apply int16, fp16 instead of int8 at specific layers in QAT?