Hello,
when I use OpenMP on C6678EVM the operation is much slower than without OpenMP (than single core operation). Much: At least 1000 times.
I tried:
- Copy OpenMP Image Processing Demo to my CCS Workspace (of MCSDK Beta 2.1)
- Delete everything except the mcip_master_main.c file
- Insert two for loops. One with and one without OpenMP
#pragma omp parallel for shared(a) private(i)
- Second test: Test a "realworld" algorithm (medical image registration)
#pragma omp parallel for reduction(+:SSD)
Result: The first algorithm takes much (>1000 times ?) longer to compute. The second one never returns (don't know wether it is too slow or wether there is another issue).
I wonder how this can be, because I use the unchanged OpenMP-Image-Processing demo. Any ideas ??? Must calculation take place in a special (ly initialized) task ?
Compilation was as release binary, onboard XDS100 is attached while testing.
Thank you.
Best regards,
Roelof Berg