Is there a possibility to apply int16, fp16 instead of int8 at specific layers in QAT?
This thread has been locked.
If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.
Hi,
Could you please share which flow you are using ? OSRT Or TIDL-RT ?
Hi Cecilia,
From the QAT, we only support 8 bit quantization, the mixed precision quantization support it not available as part of QAT offering.
Though i would recommend you to please check out PQT (Post Quantization Tool) documentation as part of edgeai-tidl-tools repos where you can opt for mixed precision quantization option. Something like all layers are in 8 bit and few(selected ones) in 16 bit or Vice Versa.
Hope this helps you.
Please find the link to PQT here : https://github.com/TexasInstruments/edgeai-tidl-tools/blob/master/docs/tidl_fsg_quantization.md#a-post-training-quantization-ptq
Thank you.