Does it really take on the order of 18-21 clock cycles to perform one single precision floating point operation via the VFP (VFPv3 lite)?
I was under the assumption that it was more like 1-2 clock cycles. Apparently there are a number of versions of the VFP (according to the ARM site), some of which can do over 1 FLOP per clock cycle, but not the one in the 3703?
Can you confirm the number of clock cycles for the VFP when performing a single precision floating point operation via the VFP on the AM37073 please.