Hi TI Team,
I am currently trying to integrating ti openvx's vision functions to my application to reduce down to the host arm's cpu resource use.
Integrating it self is quite straight forward, but I found that overhead for I/O takes big portion.
For example, If I run FAST or Harris opencv function on Arm, it takes arround 3% of arm,
and if I run those on DSP with openvx, it uses dsp resource (arround 6%) , but it also increase 3% of arm resource use to I/O with vxMapArrayRange function.
(I call vxMapArrayRange / vxUnmapArrayRange every time after FAST graph run.)
So, there is no merit to use openVx in this situation.
Do you have any idea / trick to remove those overhead?
or
Can I just copy the detected result without vxMapArrayRange / vxUnmapArrayRange ?