Dear All,
Iam working on the H264 Decoder Optimization, in which after Inverse transform the data is added with the Residual data and is written in to the Output Buffer which is in external memory.
What I did is, I have taken a dummy buffer in the Internal memory say 1 Macroblock size ie "short IntBuffer[16][16];" (Considering only Luma) after Inverse Transform instead of writing the data in to the external memory Iam writing to the "IntBuffer" (Internal Memory) thinking that I can do DMA for copying that buffer to the external memory. But Iam seeing performance degradation. Can anyone put some light on the issue Iam facing?
One more thing is, I have set L1D Cache as 32KB, L1P Cache as 32KB and L2 Cache as 64KB.
Thanks in Advance,
Prashant