pa SimpleExample questions

Aamir Husain

I have been looking through the simpleExample that comes with the PA in the pdk subdirectory and I had a few questions that need clarification.

The setupFlow0 function sets up the default return queue as 902 which is what is used for PA cmd reply. However in the case of pkts received by the PA on loopback, the CONFIG_ADDPORT0_ROUTE is used in config.h to set up the queue CONFIG_FIRST_RX_PKT_Q i.e. 904 for routing packets to the host overriding this default setup. The CONFIG_PACOM_REPLY is also setup to use flow 0 and it uses the dest queue as 902 also. I guess my question really is why the need to setup the default return queue number as it is always setup by the PA for all packets being received such as command reply and others packets being received from the loopback. Is it just for completeness sake?

Thanks, Aamir

over 13 years ago

0 Eric Ruei over 13 years ago

TI__Intellectual 2810 points

Hi, Aamir:

It is my pleasure to talk to you again.
Please note that the PA simple example is a simple example, but it may not be the best example for CPPI flow configuration and etc.
Let me try to answer your questions below:

I have been looking through the simpleExample that comes with the PA in the pdk subdirectory and I had a few questions that need clarification.

The example talks of two tables of layer 2/3 configuration information. When setting up the MAC address with the pa_addMac it make changes to the memL2Ram by writing the mac address amongst other things into the memory. Similarly it does this for the IP address into the memL3Ram when doing pa_addIP. However where does the port info get stored or is that because it is just a port number so it does not need to be saved like the IP and Mac addresses? I gather these are just duplicates of the actual tables stored in the PA for the purpose of preparing the actual command to send to the PA to setup the tables and the PA hardware actually contains the LUT table and the configuration is just supposed to write to those tables? If doing custom LUT2 would I need a layer 4 Ram section?

[Eric] PA LLD maintains local L2/L3 tables so that LLD is able to link L3/L4 entry to the previos L2/L3 entry. We can also verify the new entry against the entries in the corresponding table to detect duplicate entry and ordering violation which means the module user tries to add an entry after a more specific entry is already at the table.
The L4 related information is stored at the L4 handle returned by API Pa_addPort() and Pa_addCoustomLut2().

The length of the buffers are setup to be 304 bytes. Now for configuration of PAs what is the maximum size of Tx packets required for configuration only? Is it okay to use 256 byte buffers and use multiple of these when TX during PA config setup just like for regular Tx by PA of Ethernet packets. In other words can the configuration by broken into multiple packets and pushed onto the queue as a multi-packet configuration, the response of which to a command like say Pa_addMac is then forwarded back to the LLD through the Pa_forwardResult.
[Eric] The current LLD implementation requires a single linear buffer. You can call API with a large linear buffer and then copy the output into multiple small buffers, setup CPPI buffer link. However, must of command packets are pretty small and can be fit into a 256-byte buffer. Please refer to the command size definitions at the "Command buffer minimum size requirements" section.

If I want to allocate 256 byte buffers for all incoming packets to the DSP through the PA in external memory. How do I go about ensuring that the first 12 bytes off every packet are not written to? I need to put some proprietary stuff in those bytes when stored in external memory. Would that be through the rx_sop_offset in the flow to be configured? So for example an Ethernet packet greater than 256 bytes would use multiple buffers with the first 12 bytes of each buffer skipped.
[Eric] No, rx_sop_offset is only applicable to the first packet segment. However, you can set the buffer pointer to be 12-byte off from the original buffer pointer and the buffer size to be 12 off from the original buffer length so that the first 12 bytes will not be written by PASS.

If my answer is good enough, could you please click the "verify answer" botton.

Best regards,

Eric

Thanks, Aamir

0 Aamir Husain over 13 years ago in reply to Eric Ruei

Expert 2715 points

Eric Ruei wrote the following post at 02-21-2012 3:20 PM:

Hi, Aamir:

Good to hear from you Eric

I have been looking through the simpleExample that comes with the PA in the pdk subdirectory and I had a few questions that need clarification.

Okay so it has a two-fold usage of linking entries and detecting duplicates. I also take it from your statement above that for port info, just the handles are needed to store the details unlike in the L2 and L3 case which need the RAM tables for linking etc? How though does one deal with duplicate entries and ordering violations if I try to add a duplcaite custom LUT2 entry?

I am not sure what document are you referring to in the section comment above - Can you please elaborate. Okay so it is actually the LLD that has a one buffer requirement and the PA itself can accept multiple buffer command packets. I can use your workaround that you suggested if the maximum size exceeds 256 bytes as I want to use 256 byte buffers for messages to the PA.

If I want to allocate 256 byte buffers for all incoming packets to the DSP through the PA in external memory. How do I go about ensuring that the first 12 bytes off every packet are not written to? I need to put some proprietary stuff in those bytes when stored in external memory. Would that be through the rx_sop_offset in the flow to be configured? So for example an Ethernet packet greater than 256 bytes would use multiple buffers with the first 12 bytes of each buffer skipped.
[Eric] No, rx_sop_offset is only applicable to the first packet segment. However, you can set the buffer pointer to be 12-byte off from the original buffer pointer and the buffer size to be 12 off from the original buffer length so that the first 12 bytes will not be written by PASS.

Okay so I do not need to use rx_sop_offset and can just adjust by 12 bytes each buffer pointer for every buffer I use and the length adjustment that you mention too. Thanks!

Also Eric can you please answer my question on the ALE that I posted seperately today. Thanks.

Aamir

0 Eric Ruei over 13 years ago in reply to Aamir Husain

TI__Intellectual 2810 points

Hi, Aamir:

I also take it from your statement above that for port info, just the handles are needed to store the details unlike in the L2 and L3 case which need the RAM tables for linking etc?
[Eric] Yes.
How though does one deal with duplicate entries and ordering violations if I try to add a duplcaite custom LUT2 entry?
[Eric] There is no ordering violation for LUT2 where all entries are in ascending order for binary search.
The routing information of the new entry will replace the old one. You should set the parameter "replace" to TRUE if you want to replace the existing entry.

I am not sure what document are you referring to in the section comment above - Can you please elaborate.
[Eric] ti/drv/pa/pa.h or ti/drv/pa/docs/paDocs.chm or ti/drv/pa/docs/doxygen/html/index.html

Best regards,

Eric

0 Aamir Husain over 13 years ago in reply to Eric Ruei

Expert 2715 points

Thanks for the doc link. It would appear that only the exception route command could be >256 and I think I will not need it so I am okay.

0 long cui over 13 years ago in reply to Eric Ruei

Prodigy 135 points

Hi,Eric

I got some problems with PA ,too.I am changing the example code of PA_emac example and trying to let both emac0 and emac1 do interal_loopback at the same time. The two SGMII and different mac,so I need to add another mac address to the L2 table using function Pa_addMac. The help document says that the pa would search from lowest entry location to highest entry location until the first matching entry is found, but when I add a new Pa_addMac by make a copy of Add_MACAddress fuction and change the corresponding parameter in Pa_addMac(i am not sure if this is the right way ,or you simply has to add a new Pa_addMac function in the old Add_MACAddress), the pass simple discard the packet intead of continue searching for a match. I tried to change nextRtFail parameter in the added Pa_addMac from discard to pa_DEST_CONTINUE_PARSE_LUT1,and the pa_addMac returned config error. How should I do and a new entry to pa l2? And do you always have to push a descriptor to the command queue in order to send the command to the PA , or just add a new Pa_addMac function can config the pa ?

0 Eric Ruei over 13 years ago in reply to long cui

TI__Intellectual 2810 points

Hi, Long:

You do not need to worry about the ordering unless you are adding two entries where the matching criteria of one is the subset of the other. For example, the first entry requires both source address and destination address matching and the second entry only requires the destination address matching and the destination address is the same. In this case, you need to add the second entry which is more general than the other one at first.

The Pa_addMAC() function only formats the command packet and specifies the desired command destination. It is up to the application to forward the command packet to PASS. You need to follow the example of Add_MACAddress() at PA_emac_example for each MAC entry.

Best regards,

Eric

0 long cui over 13 years ago in reply to Eric Ruei

Prodigy 135 points

Hi,Eric:

You answer is helpful, but i still got stuck. The PA_emac_example use Emac0 and its mac address as default destination settings. Now I add a " Add_MACAddress1()" following Add_MACAddress(), adding Emac1's mac address as a new mac entry. Both entry use only destination mac and input Emac port. Both entry are unique. The debug went well, and When I sent the pack with Emac1's destination address, it succeeded. But then I change the packet back to Emac0's mac address ,I always got "received 0 packets so far" feed back. It seemed that when PA failed to match the packet's mac address(Emac0's) with the new adding entry(with Emac1's address), it simply tossed the packet instead of continue searching for a hit(which is the old entry). I changed Add_MACAddress1() and Add_MACAddress()'s position and sent packet with Emac0, now I could receive the packet well. How shall I configure Pa to let it search all the mac entry before toss the packet? I tried to change the nFailInfo for a fail match from "pa_DEST_DISCARD" to "pa_DEST_CONTINUE_PARSE_LUT1", still didn't work. Looking forwarding to you suggestions.

Best regards

Long Cui

0 Eric Ruei over 13 years ago in reply to long cui

TI__Intellectual 2810 points

Dear Long:

We are glad to learn that you start to use TI product as a college student and will try our test to help you succeed. It is a little tricky to setup the CPSW for loopback operation. The best way is to disable ALE learning at all ports and enable ALE bypass for ingress traffics and enable SGMII internal loopback at both EMAC ports. When you send packets to CPSW CPPI port through queue 648, use psFlags to specify the desired EMAC ports. The loopbacked packets will be delivered to PASS.

Please note that the EMAC port 0 is not connected to the RJ45 so that it can be used in internal loopback mode only.
Besides, the nextFail route will be invoked when the input packet matches this rule (MAC rule) and is forworded to the next stage (IP rule) and no match is found.

To help you debug, please provide the following information:
- CPSW and EMAC port settings
- Two MAC rules (ethInfo, routeInfo and etc.)
- test packets
- use CCS to dump the following memory region (16 32-bit words) before and after the packet is delivered to queue 648
- 0x2090b00
- 0x2090c00
- 0x2000000

Best regards,

Eric

0 long cui over 13 years ago in reply to Eric Ruei

Prodigy 135 points

Dear Eric:

Thanks for your help again! Your TI guys are so cool and helpful! I am using pdk_C6670_1_0_0_17, PA_emac exmaple. The CPSW, ALE table SGMII mode(both internal),Q648, psFlags are all setted just as described above. You answer proved my wondering about the nextFail setting was wrong, and I finally found the problem. For each new mac entry, there is a handle specified for L2 to reference L2 arouting information . Instead of using a new one for a new entry I added, I simply copied the function Pa_addmac and used the old handle. Now I have both Emac0&1 initiated and working on internal loopback mode on EVM board in a single program successfully. I wouldn't have done it without your guys help these days, so thank you again!

Although I reached my goal, there a few points left still confused me. First is about adding new ALE entries in function Init_Cpsw. It seems in loopback mode, the program sets both port0&1 with port1's mac address, why? I quote as following:

if(cpswLpbkMode == CPSW_LOOPBACK_NONE)
Switch_update_addr(0, macAddress0, 0);
else
Switch_update_addr(0, macAddress1, 0);

Switch_update_addr(1, macAddress1, 0);
Switch_update_addr(2, macAddress2, 0);

Now i send a packet with destination mac address 0x 10,11,12,13,14,15(for Emac0),with the settings above , the console window reads:

"Following is the ALE table before transmits.

Port =0, Mac address 10:11:12:13:14:15 ,unicast_type =0