Hello,
We are building a framework for distributed image processing and would like to implement communication between the cores on top of Hyperlink using the IPC peripheral (by memory mapping the IPC periperal registers of the other DSP) and by mapping the other core`s memory.
I am currently a bit worried about latency. By running the memoryMappedExample project I can see Hyperlink has round-trip latency of 500-1000 cycles, so a one-way latency somewhere between 250-500 cycles.
One one-way latency is caused by accessing the IPC peripheral of the remote core, after that the remote core will start to access memory of the local core.
What I wonder - does every cache-line fetch (64 byte) introduce a round-trip latency?
If so, would using EDMA3 for larger-than-cache-line access, allow the data transfer to happen at once without additional round-trips in between?
Thx