Hello,
Over the past few months we have begun using DMA on the SCI2 port for transmit operations only. Another coworker has also been using DMA for transmit operations on the SCI1 port in a separate project, and both of these DMA implementations have been seemingly stable. We are now at the stage of combining both implementations in order to use DMA on both SCI1 and SCI2 for transmit operations only. This should be possible according to the technical reference manual, as it indicates that the DMA supports concurrent operations on up to two channels.
After turning on the DMA for both SCI ports I began to notice strange behavior; specifically, the SCI1 DMA seemed to stop working after ~1 minute of operation, and the SCI2 DMA would stop after 5-10 minutes or so. This was empirically observed by monitoring the end devices connected to these ports (a PC-side COM port application for SCI1, and a second MCU on SCI2). In an effort to determine what exactly was causing this behavior, I implemented some basic DMA test functions and am seeing unexpected results.
One of the most bizarre things that I have been seeing is a delay in the DMA pending register being set. I have code which initiates the DMA transfer by configuring the control packet and writing the appropriate bit of the SCI SETINT register, and following this enters a loop which waits for the DMA PEND register to indicate that the channel has been serviced. This logic follows the following blurb from the TRM:
The DMA controller cannot recognize two software requests on the same channel if the first
software request is still pending. If such a request occurs, the DMA will discard it. Therefore,
the user software should check the pending register before issuing a new software request.
Strangely, the loop seems to exit after only a single register check. There is no implemented delay or sleep function within the loop, which indicates that the pending register is cleared nearly immediately after starting the transfer (within microseconds). The buffer size + baud rate are configured to take roughly ~350 microseconds to transfer, so this indicates that the pending register is not being properly set. I have the BTC interrupt set a flag (declared globally and volatile) as well upon transfer completion, and this flag is also not set after the pending loop is skipped. What is perhaps most strange is that the transfer actually completes! I can observe that the bytes are sent using my PC COM reader. Using the JTAG debugger on the Launchpad, I have consistently observed the following sequence of events:
- DMA transfer function is called, which configures the control packet and calls dmaSetCtrlPacket() and dmaSetChEnable() HAL functions. dmaSetChEnable() is called to trigger on a HW request
- A SW breakpoint is triggered on the line of code where the SCI SETINT register bit 16 is set, which allows the SCI transfer to occur. At this point the COM reader does not show any data
- I allow the code to continue running and a SW breakpoint triggers on the line of code following the PEND register check loop. The loop counter has a value of '1' which indicates that the loop has exited nearly immediately. The BTC interrupt completion flag has not been set, indicating that the interrupt has not executed. However, I see data transferred using the COM reader at this point, indicating that the transfer has concluded
- At a seemingly arbitrary point in code execution, a breakpoint in the BTC interrupt function occurs and the flag is now set
- While stepping through the code, an additional check of the PEND register now shows that the transfer is pending. This happens after the BTC interrupt occurs, which should be the end of the transfer
A watered-down version of the code is presented below:
void unit_test(void) { //Initialize test variables, other function calls //... /* * Start a transfer on UART0. Loop until the DMA is no longer * pending and verify that the callback flag is incremented. */ dma_ut_uart0_complete = 0U; dma_tf_rtn = DMA_start_transfer(DMA_UT_CH_UART0, DMA_UT_DATA_UART0, DMA_UT_DATA_UART0_LEN, DMA_UT_DATA_UART0_ADDR); if(dma_tf_rtn != DMA_STARTED) { dma_unit_rtn = FAILURE; } else { /* Enable interrupt to kick-off transfer */ DMA_unit_test_enable_int_uart0(); } dma_pend_ctr = 0U; while(DMA_is_pending(DMA_UT_CH_UART0) == DMA_PENDING) { dma_pend_ctr++; if(dma_pend_ctr > DMA_UT_PENDING_CTR_THRESHOLD) { dma_unit_rtn = FAILURE; break; } } //At this point, the value of dma_pend_ctr is either 0 or 1 (not consistent) /* Verify that callback complete flag has incremented */ if(dma_ut_uart0_complete != 1U) { dma_unit_rtn = FAILURE; } /* * Start another transfer and immediately attempt a third. Verify * that the function call returns FAILURE as the DMA is busy. Verify * that the transfer ultimately completes. */ dma_ut_uart0_complete = 0U; //--- NOTE!!! --- //The BTC interrupt breakpoint occurs while stepping through //the following function. Also within this function is a call //to DMA_is_pending() which now shows that the DMA channel //is pending - even though the interrupt has indicated that it //has completed! dma_tf_rtn = DMA_start_transfer(DMA_UT_CH_UART0, DMA_UT_DATA_UART0, DMA_UT_DATA_UART0_LEN, DMA_UT_DATA_UART0_ADDR); if(dma_tf_rtn != DMA_STARTED) { dma_unit_rtn = FAILURE; } else { /* Enable interrupt to kick-off transfer */ DMA_unit_test_enable_int_uart0(); } } //End unit test //DMA_is_pending() DMA_Pending_Status_t DMA_is_pending(uint8_t dma_pend_channel) { DMA_Pending_Status_t dma_pending = DMA_INVALID_ARGS; uint32_t dma_pend_ch = 0U; /* Sanity check */ if(dma_pend_channel < DMA_NUM_CHANNELS) { dma_pend_ch = (dmaREG->PEND) & (uint32_t)(1U << dma_pend_channel); if(dma_pend_ch > 0U) { dma_pending = DMA_PENDING; } else { dma_pending = DMA_NOT_PENDING; } } return dma_pending; } //DMA_start_transfer() DMA_Transfer_Return_t DMA_start_transfer(uint8_t dma_tf_channel, uint8_t* dma_tf_data, uint16_t dma_tf_len, uint32_t dma_tf_dest) { DMA_Transfer_Return_t dma_start_rtn = DMA_ARG_ERROR; DMA_Pending_Status_t dma_pend = DMA_PENDING; uint16_t dma_append_rtn = 0U; uint8_t* dma_dat_ptr = dma_tf_data; uint16_t dma_len = 0U; /* * Sanity check: * - Channel is valid * - Channel is enabled * - Destination memory location is valid */ if((dma_tf_channel < DMA_NUM_CHANNELS) && (dma_channels[dma_tf_channel].dma_ch_en > 0U) && (dma_tf_dest != 0U)) { /* Check DMA pending status */ dma_pend = DMA_is_pending(dma_tf_channel); /* * If buffering is enabled: * 1) Append data if the pointer is not NULL and length > 0 * 2) Check to see if the DMA is busy * 3) Delete old + Fetch new data for transfer if not busy * 4) If fetch return is > 0, send to DMA * * If the pointer/length are NULL/0, the DMA will be * primed without buffering any data. */ if((dma_channels[dma_tf_channel].dma_ch_buff_en > 0U) && (dma_channels[dma_tf_channel].dma_ch_buff_ptr != NULL)) { /* 1) */ if((dma_tf_data != NULL) && (dma_tf_len > 0U)) { dma_append_rtn = DBUF_append(dma_channels[dma_tf_channel].dma_ch_buff_ptr, &dma_dat_ptr, dma_tf_len); /* Return value > 0 indicates that not all data was appended */ if(dma_append_rtn > 0U) { /* TODO Generate WCAN? Anything to flag? */ } } /* 2) */ if(dma_pend == DMA_NOT_PENDING) { /* 3) */ if(dma_channels[dma_tf_channel].dma_ch_buff_tf_len > 0U) { DBUF_delete(dma_channels[dma_tf_channel].dma_ch_buff_ptr, dma_channels[dma_tf_channel].dma_ch_buff_tf_len); } /* * After call to Fetch, dma_dat_ptr will point to the "tail" * of the data in the buffer, e.g. the data to transmit. */ dma_len = DBUF_fetch(dma_channels[dma_tf_channel].dma_ch_buff_ptr, &dma_dat_ptr, dma_channels[dma_tf_channel].dma_ch_buff_ptr->dbuf_size); /* Store length for deletion next cycle */ dma_channels[dma_tf_channel].dma_ch_buff_tf_len = dma_len; /* Check fetch return to set function return */ if(dma_len == 0U) { dma_start_rtn = DMA_BUFFER_EMPTY; } } else { /* DMA is busy - set flag to do nothing */ dma_len = 0U; dma_start_rtn = DMA_BUSY_BUFFERED; } } /* * If buffering is not enabled, verify that the data pointer * and length are valid. */ else if((dma_tf_data != NULL) && (dma_tf_len > 0U)) { /* If the DMA is free, set length of transfer */ if(dma_pend == DMA_NOT_PENDING) { dma_len = dma_tf_len; } /* If the DMA is busy, set length to 0 to prevent transfer */ else { dma_len = 0U; dma_start_rtn = DMA_BUSY_NO_BUFFER; } } } /* * If the above checks have passed, start the transfer */ if(dma_len > 0U) { /* * Populate DMA control packet * * CHCTRL - Channel Control * ELCNT - Element count * ELDOFFSET - Element Destination Offset * ELSOFFSET - Element Source Offset * FRDOFFSET - Frame Destination Offset * FRSOFFSET - Frame Source Offset * PORTASGN - Port Assignment * RDSIZE - Read Size * WRSIZE - Write Size * TTYPE - Transfer Type * ADDMODERD - Address Mode Read * ADDMODEWR - Address Mode Write * AUTOINIT - Auto Initialization Mode */ dma_channels[dma_tf_channel].dma_ctrl.CHCTRL = 0U; dma_channels[dma_tf_channel].dma_ctrl.ELCNT = 1U; dma_channels[dma_tf_channel].dma_ctrl.ELDOFFSET = 0U; dma_channels[dma_tf_channel].dma_ctrl.ELSOFFSET = 0U; dma_channels[dma_tf_channel].dma_ctrl.FRDOFFSET = 0U; dma_channels[dma_tf_channel].dma_ctrl.FRSOFFSET = 0U; dma_channels[dma_tf_channel].dma_ctrl.PORTASGN = PORTA_READ_PORTB_WRITE; dma_channels[dma_tf_channel].dma_ctrl.RDSIZE = ACCESS_8_BIT; dma_channels[dma_tf_channel].dma_ctrl.WRSIZE = ACCESS_8_BIT; dma_channels[dma_tf_channel].dma_ctrl.TTYPE = FRAME_TRANSFER; dma_channels[dma_tf_channel].dma_ctrl.ADDMODERD = ADDR_INC1; dma_channels[dma_tf_channel].dma_ctrl.ADDMODEWR = ADDR_FIXED; dma_channels[dma_tf_channel].dma_ctrl.AUTOINIT = AUTOINIT_OFF; dma_channels[dma_tf_channel].dma_ctrl.SADD = (uint32_t)dma_dat_ptr; dma_channels[dma_tf_channel].dma_ctrl.DADD = dma_tf_dest + DMA_DEST_ADDR_OFFSET; dma_channels[dma_tf_channel].dma_ctrl.FRCNT = (uint32_t)dma_len; /* Set DMA control packet */ dmaSetCtrlPacket((dmaChannel_t)dma_tf_channel, dma_channels[dma_tf_channel].dma_ctrl); /* Set the DMA channel to trigger on HW request */ dmaSetChEnable((dmaChannel_t)dma_tf_channel, DMA_HW); /* Set return */ dma_start_rtn = DMA_STARTED; } return dma_start_rtn; } /* * ---- ISR call sequence */ void dmaBTCAInterrupt(void) { /* * MISRA-C:2004 11.3/A Exception: * The TI HAL code defines structures to represent the peripheral * registers. The physical memory address of the peripheral registers * is cast as a pointer to the defined HAL structure, and this pointer * is used to access the registers. De-referencing the pointer therefore * provides access to the register value. */ uint32_t dma_int_channel = dmaREG->BTCAOFFSET; if(dma_int_channel != 0U) { /* * Offset register is 0 if there are no interrupts pending, and * (channel + 1) otherwise. Decrement the offset by 1 to get the * interrupt channel */ dmaGroupANotification(BTC, dma_int_channel - 1U); } } void dmaGroupANotification(dmaInterrupt_t inttype, uint32_t channel) { /* Execute channel callback based on type */ if(channel < DMA_NUM_CHANNELS) { switch(inttype) { case FTC: { if(dma_channels[channel].dma_cbs.dma_ch_ftc_cb != NULL) { (*dma_channels[channel].dma_cbs.dma_ch_ftc_cb)(); } break; } case LFS: { if(dma_channels[channel].dma_cbs.dma_ch_lfs_cb != NULL) { (*dma_channels[channel].dma_cbs.dma_ch_lfs_cb)(); } break; } case HBC: { if(dma_channels[channel].dma_cbs.dma_ch_hbc_cb != NULL) { (*dma_channels[channel].dma_cbs.dma_ch_hbc_cb)(); } break; } case BTC: { if(dma_channels[channel].dma_cbs.dma_ch_btc_cb != NULL) { //This callback is set during the unit test for //the test channel (UART0) (*dma_channels[channel].dma_cbs.dma_ch_btc_cb)(); } break; } default: { /* Should never reach - do nothing */ break; } } /* End switch() */ } /* End if() */ } static void DMA_unit_test_btc_callback_uart0(void) { sciREG1->CLEARINT = (1U << DMA_UT_UART_DMA_BIT); dma_ut_uart0_complete++; dma_ut_uart0_cend = get_CPU_cycles(); }
I understand that there is likely some delay within the VIM in generating the interrupt, so I can understand that this fires later than expected. What concerns me though is that the PEND register does not indicate that the channel is pending service after initiating the transfer, and only seems to indicate that it is pending after some sort of delay. To summarize, my questions are essentially:
- Why am I observing a delay in the DMA PEND register being set for the channel?
- Is the use of the DMA PEND register to check for the channel being unavailable a reliable method?
- Why does the DMA PEND register continue to indicate that the channel is pending after the BTC interrupt occurs?
- Does the use of JTAG interfere with this testing in any way? E.g. staving off interrupts and/or messing with timing
Any advice or wisdom is much appreciated. If I have misunderstood the operation of the DMA or use of the PEND register, please correct my understanding as well.
Best,
James