Hi,
My customer has a question about FFT performance.
He found the following document in web site (not TI web site).
In page 10, you can see FFT performance table and it says DSP: 0.86 usec for TI C6678 @ 1.2 GHz for 1024 point FFT.
Do you have more detailed information about his ? For example, verified sample code, used API information, FFT lib package information, and etc...
I just tried fft_sp_1d_c2c_direct API from C66x FFT lib (fftlib_c66x_2_0_0_2) on my C6678 EVM (Core0 only) and I gave the following performance:
======================= CCS console=========================
...
...
...
FFT memory buffers:
Buffer Size(bytes) Alignment
0 8192 3
1 8192 3
2 8 3
3 0 3
4 0 3
5 0 3
6 0 3
7 0 3
8 0 3
9 0 3
fft_sp_1d_c2c_direct size= 1024
max_diff = 0.006134 N = 1024 Cycle: 9866
...
...
...
================================================
It says fft_sp_1d_c2c_direct API takes 9866 CPU cycle @ 1Ghz per core. If all cores do the same in parallel, 9866 / 8 = 1,233.25 CPU cycles would be potentially expected. Father more, assuming it works with 1.2Ghz device, the expected processing time would be 1,233.25 / 1.2 = 1.0277 usec.
So, I see some gap between 0.86 usec (from the above document) and 1.0277 usec (from EVM verification).
Best Regards,
Naoki