This thread has been locked.

If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.

Time consuming of C6678 4096 FFT

hi TI professors,

    I'm realizing 4096 points FFT in C6678 single core. I look the DSP performance consume provided by TI Co. The 1K points single  float FFT use time 6269 cycles, But I test 1K points about 77804 cycles and 4096 points about 384008 cycles. I  use the DSPF_sp_fftSPxSP() function to realize the FFT.

    I have set optimization level -o3. What can I do to reduce the Calculation period  of FFT?