Tool/software: TI C/C++ Compiler
Hello,everyone!Currently,i am working on a project.In this project,i am dealing with a matrix transposition.In details,Given an complex matrix with 32 rows and 16*1024 columns,when i tried the function :DSPF_sp_mat_trans_cplx () on DDR3,it took 50ms to finish.Plus,i tried to do it by DMA,it seems the length of row is larger than the length of stride in A dimension.So,i used twice DMA to do this transposion.Unfortunately,it doubled the time to finish it,almost 100ms.
Now,i am desperated for any help.Please tell me any way to finish this task faster and more efficient.Look forward to your advice!