Hello,
I am new to Embedded systems and especially to OMAP 35. I am trying to run C++ image processing code
on the OMAP 35x EVM board. My problem is that the speed is very slow(comparing to Intel PC) when coming to passing filters on float gray images.
I am compiling with Code Sourcery arm gcc compiler, with options -O3 -mtune=cortex-a8 -march=armv7-a -ftree-vectorize -funroll-loops -mfpu=neon -mfloat-abi=softfp.
Is there something special I should configure concerning memory access ?
Is there some known benchmark of float computation ( preferably something which is like passing filter on image) which I can run on my board ?
Is there any profiling tool or method which can tell me which code lines are the most heavy ? (I have gprof but it tells only which functions are most heavy)