This thread has been locked.

If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.

TMS320C6748: DSPF_sp_fftSPxSP time cost

Part Number: TMS320C6748

According to http://software-dl.ti.com/sdoemb/sdoemb_public_sw/dsplib/latest//exports/DSPLIB_C674x_TestReport.html 

the DSPF_sp_fftSPxSP cost 1069 clks when N=128 and 1965 clks when N=256.

I run the test project provided by TI, D:\ti\dsplib_c674x_3_4_0_0\packages\ti\dsplib\src\DSPF_sp_fftSPxSP.

Even with all optimization level -o3 and --opt_for_speed level 5, I can only get 1159 clks when N=128 and  2119 clks when N=256.

Why I cannot get 1069 and 1965 with the provided project?

Thank you.