I based on the code of edma3 with sysbois provided by TI to transfer a 32-bit integer data block MSMCSRAM memory of a DSP6678 to MSMCSRAM memory of another DSP6678 with PCIE protocol, and I want to know the locations where I should put the counters to measure the number of cycles consumed.