Hello,
In TMS320c6748 manual it is written that
– 2 SP × SP → SP Per Clock
– 2 SP × SP → DP Every Two Clocks
– 2 SP × DP → DP Every Three Clocks
– 2 DP × DP → DP Every Four Clocks
But when I do assembly programming I see that MPYSP to multiply floating point numbers takes 4 clock cycles. Then why is it written that we can multiply two single precision numbers in every clock cycle. How can I acheive that? Please help. Thanks in advance.
My requirement is to get dotproduct of two floating point arrays where one is byte aligned and other not byte alighned.
With Regards
Shalini