I have a program running on A72 under Linux which exchanges data with the C7 DSP.
For this purpose I mapped a part of shared memory via /dev/mem and mmap().
Now when I have code on A72 operating on this shared memory it is somehow much slower compared to using local data.
My question is: Is this something with caches? How can I work around this? Is /dev/mem bad? Do I need the ION mem allocator maybe?
Thanks!
Br Jan