Tool/software: Linux
Customer follow SDK provided vecadd example, got result as below, Buffer map/unmap time consumption is very big compare to kernel execution.
please refer below snapshot, is is the nature? or how to improve it?
This thread has been locked.
If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.
Tool/software: Linux
Customer follow SDK provided vecadd example, got result as below, Buffer map/unmap time consumption is very big compare to kernel execution.
please refer below snapshot, is is the nature? or how to improve it?