TDA4VM: Limiting MSMC

Dagan Levy

Part Number: TDA4VM

How do we limit the memory size allocated for the NN accelerator (MSMC) from 8 mb to 2 mb as suggested by Pramod?

Thanks,
- Yuval

over 2 years ago

0 Nikhil Dasan over 2 years ago

TI__Guru* 87481 points

Hi Yuval,

Please let me know if you have cloned and setup the below repo?

GitHub - TexasInstruments/edgeai-tidl-tools: Edgeai TIDL Tools and Examples - This repository contains Tools and example developed for Deep learning runtime (DLRT) offering provided by TI’s edge AI solutions.

If yes, you could do the change in the device config present in the tidl tools folder and reimport the model

Regards,

Nikhil

0 Pramod Kumar Swami over 2 years ago in reply to Nikhil Dasan

TI__Genius 14090 points

Adding more details:

you should change the value MSMCSIZE_KB to 2048?

With this change, you do not need to change anything on the board itself for runtime measurements, and only change required is in the CNN model conversion process. Even though the board has 8 MB MSMC memory size, the inference solution will use only 2 MB from the available memory.

2. In order to simulate AM62A runtime, you need to do below changes along with limiting MSMC size to 2 MB

"advanced_options:high_resolution_optimization":1,
"tensor_bits": 16,

With these changes, you can get very close approximation of runtime to be expected in AM62A using tidl-tools version 9.0 (expected at July). The number what you get with these settings can be slightly poor (~20%) compared to what you will get on AM62A, but it should give you good reference and these settings can also be reliably used for relative comparison between 2 models

Thanks

Pramod

0 Dagan Levy over 2 years ago in reply to Pramod Kumar Swami

Prodigy 40 points

Hi Pramod
We changed
"MSMCSIZE_KB = 7968" to "MSMCSIZE_KB = 2048" in the device_config.cfg in tidl_tools and did 3 conversion runs (tests):

1.
"tensor_bits": 16,
"advanced_options:high_resolution_optimization": 1,
failed compilation,

------------------ Network Compiler Traces -----------------------------
*** stack smashing detected ***: <unknown> terminated
Aborted (core dumped)

2.
"tensor_bits": 16,
"advanced_options:high_resolution_optimization": 0,
failed compilation

------------------ Network Compiler Traces -----------------------------
*** stack smashing detected ***: <unknown> terminated
Aborted (core dumped)

3.
"tensor_bits": 8,
"advanced_options:high_resolution_optimization": 0,
successful compilation.

I will attach the log for experiment (2) with debug level 2 in email

0 Pramod Kumar Swami over 2 years ago in reply to Dagan Levy

TI__Genius 14090 points

Sure please attach the logs or one sample network if you can share.

Thanks,

With Regards,

Pramod

0 Pramod Kumar Swami over 2 years ago in reply to Pramod Kumar Swami

TI__Genius 14090 points

Most likely it is because of network resolution or intermediate tensor size being very high, can you please tell the intermediate tensor size (worst case)?

Pramod

0 Dagan Levy over 2 years ago in reply to Pramod Kumar Swami

Prodigy 40 points

Hi Pramod
Since e2e is open to the general public I cannot share the logs here, I attached them in mail
Also, we tried it on a very simple (one convolution) model and it converted successfully... due to time-constraints it is not ideal to do trial-error runs until we get a sample network that crashes

Thanks,
- Yuval

0 Dagan Levy over 2 years ago in reply to Pramod Kumar Swami

Prodigy 40 points

biggest intermediate tensor dimension size is 8847360 (tensor size would be twice that, since it's 16bit 17,694,720)

0 Pramod Kumar Swami over 2 years ago in reply to Dagan Levy

TI__Genius 14090 points

Next steps:

1. Share some of the logs from the folder of executables (edgeai-tidl-tools). The logs should be with files having extension of .csv, .txt, .log

Thanks

Pramod

0 Pramod Kumar Swami over 2 years ago in reply to Pramod Kumar Swami

TI__Genius 14090 points

Hi Yuval/Hani,

I am sharing a updated ti_cnnperfsim.out (with debug mode and more traces enabled) at below link, you can use this to generate additional logs. Please replace existing file with same in edgeai-tidl-tols with this file

https://tidrive.ext.ti.com/u/3iCgaoKgoGDMIKBn/45342fc8-52ca-4322-a881-88c43f6fc5a9?l

I will be sharing access code via email

Thanks,

with Regards,

Pramod

0 Dagan Levy over 2 years ago in reply to Pramod Kumar Swami

Prodigy 40 points

replied with files via email

Processors

Processors forum

TDA4VM: Limiting MSMC