HPI on DM365 fails if used simultaneously with Codec Engine and EDMA

HiPer

Other Parts Discussed in Thread: DM3730, OMAP3530

Problem statement: Host Port Interface fails when HPI data transfers interfere with EDMA(dm365mmap module) transfers and Codec Engine (h264dec) activity.

Hardware description: 2 evaluation modules, EVMOMAPL137 and EVMDM365 are connected by daughter card, which creates link between AEMIF expansion connector(P13) on EVMOMAPL137 and AEMIF expansion connector(J14) on EVMDM365. EVMOMAPL137 system boots and controls EVMDM365 via EMIFA/HPI link.

System settings: EVMDM365 is configured to HPI mode by setting SW4 switch to "111". AEMIF on EVMOMAPL137 is set to "Normal Mode" with "Extended wait cycles" enabled. Detailed configuration is here

1) Asynchronous Wait Cycle Configuration Register (WP0=1, MAX_EXT_WAIT=FFh)

2) Asynchronous n Configuration Register (SS = 0, EW = 1, W_SETUP = 0, W_STROBE = 0, W_HOLD = 0, R_SETUP = 0, R_STROBE = 7, R_HOLD = 0, TA = 0, ASIZE = 1)

Software: ti-dvsdk_dm365-evm_4_00_00_22 is used on DM365.

Test case: OMAP starts DM365 by uploading customized UBL, UBoot and Linux Kernel and following HPI Boot flow guidelines for DM365. After being started, DM365 runs using NFS file system. Now it can be controlled by means of linux commands issued via serial console. At the same time, OMAP system can perform data writes/reads to/from DM365's memory space.

I have made a simple program for DM365 which performs following tasks

decodes h264-video file (using DVSDK API);
performs dummy EDMA transfers (using DMAI's "Framecopy_..." routines with acceleration being activated);
allocates buffer which is used for holding data, transferred via HPI, and reports its physical address and size (using DMAI's "Buffer_..." routines).

Omap side application accepts remote buffer physical address and starts non-burst writes, followed by reads to ensure data integrity. After each write-read cycle system verifies "Asynchronous Timeout" bit in "EMIFA Interrupt Raw Register". If timeout occurred, application resets "Asynchronous Timeout" bit and repeats write-read cycle.

Simplified application for writing meaningless data into some buffer on DM365 is provided below:

int BufferDWSize   = 256*1024/4;    // predefined size of the remote buffer on DM365
int BufferPhysAddr = 0x84422000;    // predefined address of the remote buffer on DM365
int cntAemifErr = 0;                // Aemif errors counter
int cntReadErr = 0;                 // read-back errors counter

// data write loop
for (i = 0; i < BufferDWSize; i++) {
    int ref_value = i;
    int test_value = 0;

    do {
        // reset Asynchronous Timeout bit and wait for some time
        if (AEMIF_EIRR & 1) {
            AEMIF_EIRR = 1;
            AEMIF_EIMR = 1;
            taskDelay(sysClkRateGet()/1000);
        }
        HPIA_WRITE = BufferPhysAddr + i*4;    // set destination address
        HPID_WOAI_WRITE = ref_value;          // write some value
        test_value = HPID_WOAI_READ;          // read value back

        cntAemifErr += (AEMIF_EIRR & 1);
        cntReadErr += (test_value != ref_value);

    // repeat if either timeout or read error happened
    } while ((AEMIF_EIRR & 1) || (test_value != ref_value));
}

Issues: In its original state, when running all the above-mentioned tasks, system fails within several seconds from the start. In this condition, HPI module on DM365 seems to go into some unpredictable state, because Host System (OMAP) application stays infinitely in inner loop and AEMIF timeouts occur every time after setting destination address for HPI write/read operation. At the same time, DM365 starts displaying NFS error message ("nfs: server 192.168.1.242 not responding, still trying"), stops video decoding (presumably because it can't read data from file system anymore), does not respond to keyboard input and sometimes after a minute or so displays kernel error message.

Also, read-back errors happen much more often than AEMIF timeouts. It means, it is impossible to tell that data has been delivered properly without reading it back or applying some kind of error correction.

I have performed several tests in different configurations and discovered that system works fine in these cases:

	HPI	EDMA	Decode
1	not active	active	active
2	active	active	not active
3	active	not active	active

It also works fine if all EDMA-transfers are replaced by memcpy (when Framecopy job is created with acceleration deactivated). Neither read-back errors, nor AEMIF timeouts take place in this case. No errors at all. However, this mode is undesirable, because it creates heavy CPU load.

It can be seen from this table, that HPI seems to conflict with EDMA and Decode jobs only when both are active.

Questions:

Could you make suggestion on how EDMA transfers and H264 decoding task on DM365 can be configured in order to avoid conflicting situations with HPI transfers initiated by Host system?
Could you provide some guidelines on how to avoid HPI stalling conditions (how to get information about when HPI fifo is full and cannot deliver data because memory subsystem is busy and when it is empty and can accept new data) ?
Also, could you provide some recommendations on how to design hardware and software for safe HPI-data transfer. Because in some cases data appear to be corrupt and there is no indication about this condition to Host system because "Aemif Timeout" flag remains zero during such transfers ?
I may admit, that AEMIF timeouts might not always be registered by OMAP hardware and therefore read-back errors take place more often. Could you tell me then what can happen to HPI-module if supposed AEMIF timeout has been ignored and data write continues without making a pause ? Can it go to some unpredictable state and corrupt data within some arbitrary memory range ?
If AEMIF timeout happened, how much host application should wait before repeating transfer again ?

over 14 years ago

0 Jeff Zhang over 14 years ago

Intellectual 280 points

Hello HiPer,

It's lucky to notice your post. I'm writing HPI related stuffs, which will configure DM368 to HPI mode. But I'm totally a newbie in this respect. Could you give me any guide? Especially for the following thing:

OMAP starts DM365 by uploading customized UBL, UBoot and Linux Kernel and following HPI Boot flow guidelines for DM365.

How to do that, or where can I get HPI boot flow guideline for DM365, or could you provide me the code directly? Any help will be greatly appreciated!

My email is jeffzh.cn@gmail.com

Thanks in advance,

Jeff

0 HiPer over 14 years ago in reply to Jeff Zhang

Prodigy 200 points

refer to sprufg5a.pdf, paragraph "11.2.7 ARM ROM HPI Mode"

0 Jeff Zhang over 14 years ago in reply to HiPer

Intellectual 280 points

HiPer,

Thank you for your reply.

It's helpful for me to understand HPI boot mode. But how does Host ACK its presence? Could you share the code at the Host side with me. My Host will be OMAP3530/DM3730. I couldn't find out similar HPI driver for Host.

Thanks a lot!!

Jeff

0 HiPer over 14 years ago in reply to Jeff Zhang

Prodigy 200 points

First of all, I assume you are already familiar with HPI User's Guide, you have connected your Host processor to DM368 in the same way as it is described on page 9 "HPI Block Diagram" picture and you can access to HPIC, HPIA, HPID. Please pay attention to HCNTLA and HCNTLB signals, as there might be an error in documentation and real signals can be interchanged (refer to this post). Also, don't be confused about not being able to access DM368's DDR2 memory range, because at the very beginning DDR controller is not initialized. Let's focus on two 16K blocks called ARM0 RAM and ARM1 RAM.

Now, lets jump to boot process. I will give you process description in plain text.

Reset you DM368. I suppose OMAP3530 has at least one unused GPIO pin, so you can route it to RESETN pin of DM368.
Set halfword ordering bit in HPIC.HWOB. "1" worked for me.
Wait until RBL signals the host that the device is ready via the host interrupt / HINTN signal. You may check for "HPIC.HINT==1" condition in while loop.
Acknowledge interrupt by setting HPIC.HINT to "1".
Verify whether DM368 can accept interrupt by polling HPIC.ARMINT for this this condition "HPIC.ARMINT == 0".
Do ACK via interrupt. Set HPIC.ARMINT to "1". Now DM368 is completely under your control.
Upload your UBL to DM368 starting from 0x00010020 (you may customize and rebuild basic UBL from PSP, refer to Building the User Boot Loader for more detailed explanation). I assume you already familiar with HPI Write Bursting, so I don't have to explain how to write something to DM368's memory space.
Write Magic Number and UBL entry point as described in "Table 125. UBL Descriptor for HPI Boot mode". You may try addresses provided in the table. But it didn't work in my case and I used other addresses. To be exact, "0x00017FE0" for magic number and "0x00017FE4" for entry point.
Poll HPIC.ARMINT for this this condition "HPIC.ARMINT == 0".
Notify that copying is done by interrupting DM368. Set HPIC.ARMINT to "1".
RBL must jump to your customized UBL.
I used notification from DM368's UBL to Host CPU via interrupt in order to understand that initialization by UBL is done. You can do the same if you consider it reasonable.
I also thought it would be wise to check whether DDR2 is accessible after UBL is done. So you may write something to 0x80000000(please verify DM368's memory map), read it back and compare. This step will ensure that initialization is properly done and we are ready to copy u-boot, linux kernel and ramfs binaries.
For some reasons it doesn't always work from the first attempt in my case. So if step 13 failes, I jump to step 5 again.

I think it is enough information for now. Good luck with your booting process coding. And please try to follow official guidelines, as you may not encounter the same problems as I because your DM368 may have them fixed.

0 Jeff Zhang over 14 years ago in reply to HiPer

Intellectual 280 points

Many thanks for your so detailed explaination. It's enough for me.

PS: I can't verify answer here. If you would, please post the answer here: http://e2e.ti.com/support/dsp/davinci_digital_media_processors/f/100/p/122515/436769.aspx#436769, so I can Verify Answer.

Thanks again,

Jeff

0 HiPer over 14 years ago in reply to Jeff Zhang

Prodigy 200 points

You are welcome.

I sincerely hope you will not have the same problem with data corruption when HPI data transfers happen simultaneously with stream decoding. I would recommend you to test this issue asap on your system, so that you may consider to use other interfaces for data transmission between two processors, such as ethernet. It is causing a great deal of trouble in our design.

0 Steve Chen1 over 12 years ago in reply to HiPer

Prodigy 190 points

Dear HiPer,

Not sure if you are still able to follow this thread. My question to you is that have you gotten the issue of HPI data transfer simultaneously with stream decoding fixed?

Processors

Processors forum

HPI on DM365 fails if used simultaneously with Codec Engine and EDMA