Does anyone know or any example code that is using the VLIW? I am stuck with the lack of processing power with my current project. I am already using the L1 and L2 cache which has increased the performance of my maths function 5x but its still not enough. I need to do 50 16-bit summation in less than 5us and right now can only achieve about 20. So I need to at least double the processing speed.