TMS570LC4357: Rx Issue When SCI used over DMA

Srihari Manthri

Part Number: TMS570LC4357
Other Parts Discussed in Thread: HALCOGEN

Hi Team,

In our Application for TMS570LC4357, For Serial communication we are using the SCI3 over DMA. the packet configuration of the DMA is similar as per the example provided in the halcogen, with TTYPE configured similar as Frame Transfer. On the Rx side we are seeing the data is getting missed after receiving some number of bytes.

Observation 1:
We have connected the PC to the Rx side of processor and we are transmitting the data from the PC with 100ms rate and bytes which we are sending at a time is 200.
In the DMA packet the configured Rx Frame size is 512 and element size as 1. We have observed that sometime after receiving ~1.6Lakh bytes the data is getting missed. The size of data missed and instance of this occurrence is not fixed every time. it may change with the iterations.

Observation 2:
As the connections mentioned in the Observation 1, if we try to increase the data size to 300 with the same rate, then the occurrence of issue is pretty quick if the configured frame size is same.

Observation 3:
As the size mentioned in the Observation 1, if we keep it same i.e. 200, and if we increase the rate of that transmission from the PC to 200ms then we have observed that the occurrence of that issue gets delayed to some extent.

Currently we configured the SCI to operate with 57600 baud rate with 1 stop bit and no parity.

Request you to please help us in resolving this issue as this is impacting the quality of the SCI operation as the time progresses when we used it over DMA.

Regards.

over 4 years ago

0 QJ Wang over 4 years ago

TI__Guru**** 197126 points

Srihari Manthri said:
The size of data missed

Do you mean the transfer size is overwritten?

0 QJ Wang over 4 years ago in reply to QJ Wang

TI__Guru**** 197126 points

Can you change the DMA frame size to the number of bytes you are sending: 200bytes, or 200 bytes?

0 QJ Wang over 4 years ago in reply to QJ Wang

TI__Guru**** 197126 points

I did a test between SCI2 and SCI3. SCI3 transfers 200 bytes (frames) per transaction, and transfers totally 100 times. SCI2 receives all the data correctly.

Both SCI2 and SCI use DMA.

0 Srihari Manthri over 4 years ago in reply to QJ Wang

Intellectual 280 points

Hi Wang,

Thanks for your quick response. Actually we cannot control the size of the frame in the receive packet, as it is not fixed every time that how many bytes we will be receiving.

In Fact, we also wanted to know the significance of the shared ram. I have attached here the ti ticket which indicate that to use the shared ram memory region for storing the DMA buffers. I would like to understand in detail what would be the impact if we do not use the shared ram concept. If we don't use that still it will work? or any other impacts also it will have.

Currently, in our code we have not configured the shared ram, we are using the RAM sections only for the DMA buffers, where the policies used for the complete RAM is Normal, write through with no write allocate with read/write access in privileged and user mode both Here, we are able to establish the communication but observing this issue of missing the data bytes as time progresses.

0 QJ Wang over 4 years ago in reply to Srihari Manthri

TI__Guru**** 197126 points

Since the memory is configured as write-through with no write allocate, the data is written to main memory (either cache hit, or cache miss). But all the cacheable memory is read-allocate. When CPU reads data from SRAM (or DMA receive buffer), it will read the data present in the cache (on cache hit) and not new data available in the SRAM.

You can perform cache invalidate before reading operation. After cache invalidate, the CPU reads from the cache will then be coherent.

0 Kishore Bolisetti over 4 years ago

Prodigy 185 points

Hi Wang, I believe DMA engine will copy the received UART data into the physical memory space (DMA buffers). User application when accessing if not ensuring cache coherency, can read incorrect data from cached DMA buffers?

Do you suggest if we have to put all DMA buffers into one memory area forced through linker and use MPU to set specific memory protection attributes? Do you think we need to inhibit cache for this region? or we need to keep it enabled? if need to be enabled, then what is the suggested cache policies we need to use?

Could you please provide your feedback on these clarifications.

Thanks,

Kishore

0 QJ Wang over 4 years ago in reply to Kishore Bolisetti

TI__Guru**** 197126 points

Hello Kishore,

Kishore Bolisetti said:
User application when accessing if not ensuring cache coherency, can read incorrect data from cached DMA buffers?

QJ> Yes, you are correct.

Kishore Bolisetti said:
Do you suggest if we have to put all DMA buffers into one memory area forced through linker and use MPU to set specific memory protection attributes? Do you think we need to inhibit cache for this region? or we need to keep it enabled? if need to be enabled, then what is the suggested cache policies we need to use?

QJ> I prefer to allocate a shared memory region for DMA buffers. For example

#pragma SET_DATA_SECTION(".sharedRAM")

uint32 tx_buffer[128]; /* transmit buffer in sys ram */
uint32 rx_buffer[128]= {0}; /* receive buffer in sys ram */

#pragma SET_DATA_SECTION()

Linker cmd:

MEMORY
{

...
SHAREDRAM (RW) : origin=0x0807A000 length=0x0005000
}

SECTIONS
{

...
.sharedRAM : {} > SHAREDRAM
}

HALCoHen MPU setting:

Select a unused MPU region for SHAREDRAM memory region, configure it as Write-through and write-allocate.

But DMA write and before CPU read, cache invalidation is still needed.

1. DMA writes data to rx_buffer[]

2. Invalidate the cache lines for rx_buffer[]

3. CPU reads rx_buffer[]. Since read-allocate policy, a cache line is allocated and rx_buffer[] is copied from SRAM to cache.

0 QJ Wang over 4 years ago in reply to QJ Wang

TI__Guru**** 197126 points

If write-back and write-allocate policy is used, you need to perform a cache clean after CPU write to tx_buffer[] and before DMA transfers.

0 QJ Wang over 4 years ago in reply to QJ Wang

TI__Guru**** 197126 points

TMS570LC43x device has a module called uSCU. This unit ensures data coherency between the d-cache and other masters (e.g. DMA and Ethernet) on write-trough regions.

So the step #2 (invalidate cache) is not needed.

0 Kishore Bolisetti over 4 years ago

Prodigy 185 points

Hi Wang. We have configured the settings as per your guidance. But we have not found the 'Write-through and write-allocate' configuration but we did 'Write-through and no write-allocate' and still saw that the receive bytes were missing. In this case, we have not done explicit flush and invalidate the shared memory area, but it is not making any difference if we do it also.

When we completely disabled the cache for share memory region, we are not seeing any issue. We were still wondering, there is something to do with cache.

You mentioned 'uSCU' will take care making the cache cohering when DMA engine copies the data into RX DMA buffers, if that is the case, why do you think we are still seeing the issue?

Thanks,

Kishore

0 QJ Wang over 4 years ago in reply to Kishore Bolisetti

TI__Guru**** 197126 points

Hi Kishore,

DMB instructions may be needed to ensure that writes to the SRAM is complete before reading the data.

Arm-based microcontrollers

Arm-based microcontrollers forum

TMS570LC4357: Rx Issue When SCI used over DMA