Other Parts Discussed in Thread: CCSTUDIO
Hi,
I'm using CCS, version 3.3.38.2/BIOS 5.31.02/Code Gen Tools v6.0.8.
With Compiler options: -k -o3 -fr"$(Proj_dir)\Debug" -i"C:\CCStudio_v3.3\MyProjects\fest\freq_est_tuned\Debug" -d"_DEBUG" -mt -mw -mv6400
Linker options: -c -m".\Debug\freq_est.map" -o".\Debug\freq_est.out" -w -x -i"C:\CCStudio_v3.3\c6400\mthlib\lib" -l"fastrts64x.lib"
I have a design doing chain processing (FIRs among others) on some big packets. It works, and all my 20 loops
are otimized. Did profiling and everything looks OK. I'm using L2 Cache (128k) configured in DSP/BIOS.
When I ran CacheTune & Profile Viewer I get L1D stall cycles 40%. Most of them are L1D. miss.write (95%).
For example for a fir filter I read two linear arrays (data[ ] & coef[ ] ) and write to an output y[ ] array. Looking in the
CacheTune graph in all the reads I have only compulsory misses; but all my writes misses every access.
With L2 Cache 4-way why I don't get in all my writes just compulsory misses?
All my loops have this problem when writing to arrays (SDRAM simple/linear arrays). I don't get on these writes any hit.
Thanks,
Andrew