TDA4VM: Is it normal that the mathlib powdp() function runs slower on the TDA4VM C66 than on the CPU?

Lin AC

Part Number: TDA4VM
Other Parts Discussed in Thread: MATHLIB

My test code

1
2
3
4
5
6
double a = 142.224389823827;
double b = 0;
for (long i = 0; i < 100000000; i++)
{
    b = powdp(a,2); // a^2
}
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

double a = 142.224389823827;
double b = 0;
for (long i = 0; i < 100000000; i++)
{
    b = powdp(a,2); // a^2
}

This code takes 20 seconds to run on the TDA4VM C66 and 4 seconds on the TDA4VM CPU.

is this normal?

10 months ago

0 Asha Bhandarkar 10 months ago

TI__Genius 10170 points

Hi,

Can you specify what processor you are referring to when talking about TDA4VM CPU?

Can you provide more details on how you are measuring performance?

Best,

Asha

0 Lin AC 10 months ago in reply to Asha Bhandarkar

Prodigy 35 points

TDA4VM CPU: Arm Cortex-A72 64bits 2.0GHz

Test code:

Fullscreen

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
#include <math.h>
#include <stdio.h>
int main()
{
    double a = 142.224389823827;
    double b = 0;
    for (long i = 0; i < 100000000; i++)
    {
        b = pow(a,2); // a^2
    }
    
    return 0;
}
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

#include <math.h>
#include <stdio.h>


int main()
{
    double a = 142.224389823827;
    double b = 0;
    for (long i = 0; i < 100000000; i++)
    {
        b = pow(a,2); // a^2
    }
    
    return 0;
}

it takes 4 senconds

Compare with C66x，run code as:

Fullscreen

1
2
3
4
5
6
7
8
9
10
11
12
13
#include <ti/mathlib/mathlib.h>
int main()
{
    double a = 142.224389823827;
    double b = 0;
    for (long i = 0; i < 100000000; i++)
    {
        b = powdp_i(a,2); // a^2
    }
    
    return 0;
}
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

#include <ti/mathlib/mathlib.h>

int main()
{
    double a = 142.224389823827;
    double b = 0;
    for (long i = 0; i < 100000000; i++)
    {
        b = powdp_i(a,2); // a^2
    }
    
    return 0;
}

and it takes 20 seconds

0 Asha Bhandarkar 10 months ago in reply to Lin AC

TI__Genius 10170 points

Hi,

I wouldn't expect such performance on C66x when using the mathlib libraries.

Are you running on a TI TDA4VM EVM with the C66x processor running at 1.35GHz?

Best,

Asha

0 Lin AC 10 months ago in reply to Asha Bhandarkar

Prodigy 35 points

Hi，

I am sure it is running on the TDA4VM EVM with C66x, but I am not sure about the running frequency.

do you mean the frequency of the c66x is configurable?

0 Asha Bhandarkar 10 months ago in reply to Lin AC

TI__Genius 10170 points

Hi,

I just wanted to check what board you are running on, thank you for clarifying that it is a TI EVM.

I would expect you would get values closer to what we publish in the MATHLIB test report (MATHLIB_c66x_TestReport.html). The numbers here are reported in cycles however, not time.

I can get back to your issue and see if I can reproduce the numbers that you are seeing and provide an update by 5/30.

Best,

Asha

0 Lin AC 10 months ago in reply to Asha Bhandarkar

Prodigy 35 points

Hi，

Do you have any progress recently？

0 Asha Bhandarkar 10 months ago in reply to Lin AC

TI__Genius 10170 points

Hi,

I have made progress, however I will still need a few days to investigate further. I will try to provide an update on this by 6/4. I apologize for the delay.

Best,

Asha

0 Brijesh Jadav 9 months ago in reply to Asha Bhandarkar

TI__Guru**** 451155 points

Hi Asha Bhandarkar,

Any further update on this thread?

Regards,

Brijesh

+1 Asha Bhandarkar 9 months ago in reply to Brijesh Jadav

TI__Genius 10170 points

Hi,

I apologize for the delay in concluding the conversation on this issue.

I was able to run the same test code you have provided on C66x and for 100000000 iterations, I am seeing it takes a while for the C66 to complete execution as you have noted. We are currently not performing performance tests between A72 and C66x, so if in this case A72 is performing better, you can continue using the A72 core to run various functions if that is what best meets your performance and accuracy needs. Overall, we are not supporting further improvements on the C66x MATHLIB functions in our SDK releases, so the optimizations and performance at the function level will remain the same.

Best,

Asha

0 Lin AC 9 months ago in reply to Asha Bhandarkar

Prodigy 35 points

OK，Thanks!

Processors

Processors forum

TDA4VM: Is it normal that the mathlib powdp() function runs slower on the TDA4VM C66 than on the CPU?