PROCESSOR-SDK-AM69A: Problem of Tensor Quantization in GELU Activation Function

Koki Igari

Tool/software:

Dear TI Team,

I am attempting to perform inference of an AI model, which includes convolutional layers, on a TI processor. The activation function used in the convolutional layers is GELU, but there is a significant issue with quantization error becoming very large in this part.

GELU is represented by the following equation.

f(x)=0.5x(1+tanh(√2/π(x+0.044715x^3)))

Due to the cubic term of x, the quantization scale value became very large, causing many elements of the tensor to become zero. I consider that either support for GELU in tidl-tools or floating-point operation capability in the accelerator is necessary. Additionally, a makeshift solution would be manually setting the quantization scale for this particular layer.

Are these methods feasible? Additionally, are there any other possible solutions?

I have attached the ONNX file for the convolution layers and the sample npy file for the model input.

4861.model_files.zip

regards,

Koki

over 1 year ago

0 Varun Tripathi over 1 year ago

TI__Genius 10155 points

Koki,
Can you share compiled artifacts where you see the scales aren't coming as per your expectations? (Vs. what you plan to set them in your makeshift solution)

Regards,
Varun

Processors

Processors forum

PROCESSOR-SDK-AM69A: Problem of Tensor Quantization in GELU Activation Function