TDA4VM: Inference with Relu8 as slow as Relu6

Yu Chen

Part Number: TDA4VM

Hi,

I am running a network with Relu/Relu6/Relu8.

The cost of time is 12ms with Relu and 33ms for Relu6 and Relu8.

As in supplied documents "ReLU8 can be performed with just shift operation in fixed point inference without needing a floating-point computation /look up table".

It shouldn't be the same speed as Relu6.

I am using Relu followed by a Clip(0,8) to implement relu8. SDK version is "rtos-j721e-evm-07_01_00_11".

over 4 years ago

0 Anshu Jain over 4 years ago

TI__Guru 56820 points

Hi,

Instead of using Relu followed by Clip(0,8), directly use Clip (0,8).

Regards,
Anshu

Processors

Processors forum

TDA4VM: Inference with Relu8 as slow as Relu6