I'm working on an image processing application that operates on VGA images (640x480). The on-chip memory (up to 176 K in our case) isn't large enough to store complete images. What is the preferred scheme for getting the best performance out of VLIB using the L1 and L2 cache? The options include:
- Placing all of the image data in external memory and depending on automatic caching
- Breaking down the image data into small chunks and manually placing them in memory-mapped caches
- Something in between
Thanks
Michael