Hi,
We are implementing an algorithm on C6678 on all 8 cores (1 master, 7slaves). Initially the algorithm was implemented on a single core without using a task. The processing time using 8 cores was almost double that of single core.
Specifically, the time taken for a piece code inside a task, on single core was 123ms. The time taken for the same code executed without a task, on a single core was 7ms. Is this the OS overhead? Are there any methods to kill this overhead? The code profiled above includes calling the rand() function, if that is in any way relevant.
Thanks,
Harini