Tool/software:
In the compiler documentation SPRU514,
for the option "--fp_mode",
there is:
"Note that there are algorithmic differences between the TMU hardware instructions and the library routines, so the results of operations may differ slightly."
What does "may differ slightly" mean?
We would need a clear description of the TMU performance that we can safely rely on.
Is there somewhere a precise description of the "slight differences" ?
Is there a way to measure the differences ?
A systematic test for the 2^64 possibilities (two 32-bits singles for the division) is not realistic.
Best regards