Dear All,
I am new to TI DSP and have just started working on C6455 DSP. I compiled my code on CCS V6.1.1 and profiled it.
I was mainly interested in : 'Functional unit utilization Ratio' : (number of instructions) / (number of Cycle)
With optimization level : -O3, use of intrinsic, #pragma MUST_ITERATE, loop unrolling etc.., i could achieve 'Functional unit utilization Ratio' as '2'. Now this value is too small considering 8 Functional units in C6455 DSP.
Can you guys suggest me how to improve performance by effective usage of available functional units ?