Hello,
I want to benchmark the floating-point FFT on C6678, I executed that DSPLIB routine : _DSPF_sp_fftSPxSP ; which is that last update written in linear assembly, for N=16k (16384) I mesured a count of : 466 244 clock cycles ; all my data is in L2, L1D cache is activated in cache (32k), and the compiler optimizations are active (-o3) ..
Is that what is expected to get ?
Thanks