I remember reading in some documentation before that the DSPLink transfer rate from the GPP->DSP is faster than the DSP->GPP because the GPP-> DSP transfer is cacheable but the DSP-> is not cacheable, I don't remeber where that was?? Is that because the DSP program probably resides on the DSP's cache and the GPP is able to see the DSP's memory so the transfer can occur between the GPP and DSP cache while from the DSP side the DSP's MMU can't see the GPP's memory so it has to write it back to the DSPLink memory region and the GPP then needs to read the data from the external memory?
If my previous summary is correct, is it possible to map the GPP buffers so the DSP MMU can see them and if yes would that resule in DSP->GPP buffer transfer performance smilar to that of the GPP->DSP?