Hi,
Now I am developing Correlator for software GPS on c6678 EVM. And I have a question.
What kind of assembler code do write to realize 32(16x16) multiplies?
I understand 8 multiplies, sample as follow
DMPY2 .M1 A1:A0, A3:A2, A7:A6:A5:A4
|| DMPY2 .M2 B1:B0,B3:B2,B7:B6: B5:B4
Do I need to study special coding method?
in Instruction Set Reference-
1.1.1 4x Multiply
core can now execute up to 32 (16x16-bit) multiplies per cycle.