TMS320C6000 Optimizing Compiler v7.4: MPY32 instruction on C674x (C code)

Marat Shchuchinsky

Expert 2595 points

Hello friends,

I implement algorithm fo C64x subcore under DM8148 device.

I noticed that MPY32 instruction has two versions corresponds to destination type:

- when destination is 32 bits wide - MPY32 save only lower 32 bits of result (one register)

- when destination is 64 bits wide - MPY32 save full 64 bits result of multiplication into register pair.

I do not clearly understood what kind of strategy in choosing this MPY32 instruction uses the compiler if the size of the multiplication result is not defined uniquely. For example (let say that all 'a', 'b', 'c', 'd' and 'e' variables in this polynom are signed integer 32 bits wide):

Int32 result = ( (a1 * a2) >> a3 + (b1 * b2) >> b3 + d + (e1 * e2) ) >> e3;

How can I get the compiler (without re-writing code in assembler) always use the second version of the MPY32 instructions - where the result is always stored in a register pair.

ThanX alot and best regards

over 11 years ago

0 Alberto Chessa over 11 years ago

Mastermind 6670 points

Hi,

You can rewrite the expression in C using intrinsics (see _mpy32ll in compiler manual) or write the multiplication as:

((long long)a1*s2)

These generates somthing like MPY32 B4,A4,A5:A4.

Note that,as far as I know, when you use Int64, the shift operation became a call to the C runtime library since the CPU is not able to shift on 64bits type.

0 George Mock over 11 years ago in reply to Alberto Chessa

TI__Guru**** 251300 points

Mr. Chessa is correct. For more background, please see this application note.

Thanks and regards,

-George

Code Composer Studio™︎

Code Composer Studio forum

TMS320C6000 Optimizing Compiler v7.4: MPY32 instruction on C674x (C code)