Hello,
In document : TMS320x2803x Piccolo Control Law Accelerator(CLA) Reference Guide (Page 49) as follow:
;-----------------------------------------------------------------------
; Given A, B and C are 32-bit floating-point numbers
; Calculate Y2 = (A * B)
; Y3 = (A * B) + C
;
_Cla1Task2:
MMOV32 MR0, @_A ; Load MR0 with A
MMOV32 MR1, @_B ; Load MR1 with B
MMPYF32 MR1, MR1, MR0 ; Multiply A*B
|| MMOV32 MR0, @_C ; and in parallel load MR0 with C
MADDF32 MR1, MR1, MR0 ; Add (A*B) to C
|| MMOV32 @_Y2, MR1 ; and in parallel store A*B
MMOV32 @_Y3, MR1 ; Store the A*B + C
MSTOP ; end of task
;-----------------------------------------------------------------------
I made diagramming Pipeline Activity for 8 instructions as following:
I don't sure that _Y3 = A*B + C. Can you tell me about this matter ?
Regards