Hello,
I've been using the Profile Clock in CCS to measure performance for my RM48L952.
The thing is that I don't think it's working properly. I tried to measure the execution time for one instruction (a simple mov) is 16 cycles @160MHz. I tried testing the same mov in a RM42 launchpad and it took 12 cycles @50MHz.
Correct me if I'm wrong, but that instruction shouldn't take just 1 cycle to execute? I know the cortex R architecture has a pipeline of 7 or 8 stages (can't remember now, mey be confusing it with cortex A). Is it possible that at every brake point there the pipeline is flushed and that's why it's taking so long ? Is the lockstep responsible for such delay?
I'm unable to understand this.