Hi.
I have written simple function that converts buffer with fixed point values into a buffer with floating point values. I have compared number of cycles that the function was running on simulation (C674X CPU cycle accurate simulator) and the ZOOM expirementer board (XDS100v1 debugger). Both the code and data are placed in L2SRAM of the DSP, all the interrupts are disabled. I can see that simulator shows about twice less cycles than the real board.
Cycles measurment was done by using Target->Clock->Enable option of the CCS4.
How can this be explained? Is it possible that enabling L1 cache will solve the problem?