• Join
  • Sign In with my.TI Login
Texas Instruments
  • Products
  • Applications
  • Tools & Software
  • Support & Community
  • Sample & Buy
  • About TI
Sample & Purchase Cart Sample & Purchase Cart
  • Search
  • Advanced
TI E2E™ Community
  • Support Forums
  • Blogs
  • Groups
  • Videos
  • 简体中文
  • More ...
TI Home » TI E2E Community » Support Forums » Embedded Software » WinCE » WinCE forum » TI OMAP35X WinCE BSP 6.15 - Why is OMAP35X RAM Access much slower than RAM Access on IMX31?
Share
WinCE
  • Forum
Options
  • Subscribe via RSS

Forums

TI OMAP35X WinCE BSP 6.15 - Why is OMAP35X RAM Access much slower than RAM Access on IMX31?

  • Madhvi
    Posted by Madhvi
    on Mar 10 2011 17:14 PM
    Expert5080 points

    Eugen

    I ran my experiments on a WinCE 7 compact environment (since that is my current development setup). I will go back to wince 6 and try to reproduce the issue you are seeing. I will get back to you once I find something.

    -Madhvi

     


    Please click Verify Answer button if this response answers your question. For updated information on TI WINCE BSP, please check out the WinCE Handbook.

     

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Richard Hendricks
    Posted by Richard Hendricks
    on Mar 11 2011 11:05 AM
    Prodigy70 points

    Madhvi,

      I am downloading Windows Embedded Compact 7 now and will try doing an EVM build without and with your recommended changes above.   I will let you know of my findings.

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Eugen Feraru
    Posted by Eugen Feraru
    on Mar 15 2011 14:30 PM
    Intellectual375 points

    Hi Madhvi,

    This issue, accessing the File System taking an excessive amount of time, has been identified to be related to the attempt to improve Data Cache flushing, implemented by both 6.X and A8 BSPs. The implementation is not adequate, causing poor RAM Performance As File IO. 
       
    The fix consists in defaulting to Microsoft's Windows CE default ARM Flush Cache Routines (OALFlushDCache and OALFlushDCacheLines). 

    Thank you,

    Eugen

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Madhvi
    Posted by Madhvi
    on Mar 17 2011 15:07 PM
    Expert5080 points

    Eugen

    Using Microsoft routines to flush data cache is not the right workaround since these routines only take care of L1 cache. The routines in BSP take more time since they take care of both L1 and L2 cache. So just using Microsoft routines, could lead to cache coherency issues. We will not be recommending these changes for our BSPs.

    Now for the original problem, Microsoft acknowledges that the excessive data flushing is a know issue of Object Store (RAM based filesystem), it needs to flush the cache constantly to prevent data loss when accidental reset/power failure (assume RAM still maintaining data)

    As per Microsoft suggestion, the possible workaround is to use RAMDISK or RAMFMD to mount as root if Object Store is not a must requriement. The idea of using RAMDISK/RAMFMD is to replace the Object Store and mount it as root. (i.e. set SYSGEN_FSROMONLY=1). Unless you exclude the ObjectStore, you will still suffer from excessive data cache flushing issue. Actually, that is due to the cache size on modern CPU getting larger and larger so the flush all penalty is amplified.

    -Madhvi

     


    Please click Verify Answer button if this response answers your question. For updated information on TI WINCE BSP, please check out the WinCE Handbook.

     

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Richard Hendricks
    Posted by Richard Hendricks
    on Mar 17 2011 15:25 PM
    Prodigy70 points

    Madhvi,

      If that is the case, we could try shutting off the L2 in our BSP and see what the performance tradeoff might be.  Is it possible to setup the L2 cache as write-through, so no modified data is stored there, and instead of flushing the L2 line by line, invalidating the whole L2?  This might be an option.  It would at least give speed improvements when reading data.

     

      Have you duplicated the RAM copy performance we are seeing?

     

    I do understand the need for cache coherency on the L2, but interestingly we have not run into stability problems yet.  I would expect to have problems running applications if the L2 is not being flushed properly.

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Madhvi
    Posted by Madhvi
    on Mar 17 2011 15:57 PM
    Expert5080 points

    Richard Hendricks

    Madhvi,

      If that is the case, we could try shutting off the L2 in our BSP and see what the performance tradeoff might be.

    You could try - I am not sure what the side-effects would be.

    Richard Hendricks

      Have you duplicated the RAM copy performance we are seeing?

    Yes

    Richard Hendricks

    I do understand the need for cache coherency on the L2, but interestingly we have not run into stability problems yet.  I would expect to have problems running applications if the L2 is not being flushed properly.

    well it depends on what applications you are running/testing. Again, its not something we would recommend as a workaround.

     

     


    Please click Verify Answer button if this response answers your question. For updated information on TI WINCE BSP, please check out the WinCE Handbook.

     

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Richard Hendricks
    Posted by Richard Hendricks
    on Mar 17 2011 16:15 PM
    Prodigy70 points

    Madhvi,

      I am wondering, do you know the details of where CE is trying to do that cache flushing?  I am curious, because it seems like the amount of cache flushing would have to be horrendous to cause such a performance degradation.  Are they flushing after every 4 bytes or something?  After all, the flush routine should be able to run through very quickly since after the first pass much of the L2 should be invalid and not result in a write to the system RAM, even if they are attempting to flush the full cache.  Hmmm.  If they are doing a full cache flush instead of just flashing specific memory areas, then that could be a problem I guess since the source area for the read/write would also be invalidated.

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Madhvi
    Posted by Madhvi
    on Mar 17 2011 16:19 PM
    Expert5080 points

    Sorry I dont have the details - but you could post the question to MSDN forums for more information.

    -Madhvi

     


    Please click Verify Answer button if this response answers your question. For updated information on TI WINCE BSP, please check out the WinCE Handbook.

     

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Madhvi
    Posted by Madhvi
    on Mar 17 2011 16:39 PM
    Expert5080 points

    The only information I can help you with is that the "flush all D-cache" is called by the kernel around 25000+ times on my setup while writing a 32 MB file with 256K buffers (and my Storage memory is around 88 MB)

    -Madhvi

     


    Please click Verify Answer button if this response answers your question. For updated information on TI WINCE BSP, please check out the WinCE Handbook.

     

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • DavidVescovi
    Posted by DavidVescovi
    on Mar 17 2011 21:56 PM
    Genius4200 points

    So in this case (file IO) a 500mhz ARM11 beats an ARM A8 @800mhz. ???

     

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Madhvi
    Posted by Madhvi
    on Mar 18 2011 09:57 AM
    Expert5080 points

    You cannot compare 2 processors/products just by the processor speed. The issue here is not that the h/w is in-capable but that the software is not optimized for the newer processors.

    -Madhvi

     


    Please click Verify Answer button if this response answers your question. For updated information on TI WINCE BSP, please check out the WinCE Handbook.

     

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • DavidVescovi
    Posted by DavidVescovi
    on Mar 18 2011 10:14 AM
    Genius4200 points

    I couldn't agree more ... thats why benchmarks are so helpfull. It sure points to an area thats needs optimization.

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • P X
    Posted by P X
    on Jul 29 2011 02:56 AM
    Prodigy30 points

    The nand read performance is poor in wince. Is it related to the L2 cache issue?

    Below is the update from CE 6.0 Monthly Update Feb 2011, can this resolve this issue?

    110211_KB982563 - This update implements L2 cacheable page table support for ARM processor.

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • Eugen Feraru
    Posted by Eugen Feraru
    on Jul 29 2011 06:49 AM
    Intellectual375 points

    Simply applying the monthly update, it did not help.

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
  • a a72605
    Posted by a a72605
    on Oct 18 2012 12:26 PM
    Prodigy30 points

    I'm seeing similar slow file copy on OMAP4430 and WE7.

    Is there any suggested fix to improve file copying performance to match i.MX31?

    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
1234
TI E2E™ Community
  • Support Forums
  • Blogs
  • Videos
  • Groups
  • Site Support & Feedback
  • Settings
TI E2E™ Community Groups
  • TI University Program
  • Make the Switch
  • Microcontroller Projects
  • Motor Drive & Control
Other Communities
  • Deyisupport
  • Designsomething.org
  • beagleboard.org
  • TI on Element 14
  • TI on TechXchangeSM
Other Technical & Support Resources
  • WEBENCH® Design Center
  • Product Information Centers
  • Technical Documents
  • TI Design Network
  • TI Technical Articles
  • TI Training

All content and materials on this site are provided "as is". TI and its respective suppliers and providers of content make no representations about the suitability of these materials for any purpose and disclaim all warranties and conditions with regard to these materials, including but not limited to all implied warranties and conditions of merchantability, fitness for a particular purpose, title and non-infringement of any third party intellectual property right. TI and its respective suppliers and providers of content make no representations about the suitability of these materials for any purpose and disclaim all warranties and conditions with respect to these materials. No license, either express or implied, by estoppel or otherwise, is granted by TI. Use of the information on this site may require a license from a third party, or a license from TI.

Content on this site may contain or be subject to specific guidelines or limitations on use. All postings and use of the content on this site are subject to the Terms of Use of the site; third parties using this content agree to abide by any limitations or guidelines and to comply with the Terms of Use of this site. TI, its suppliers and providers of content reserve the right to make corrections, deletions, modifications, enhancements, improvements and other changes to the content and materials, its products, programs and services at any time or to move or discontinue any content, products, programs, or services without notice.

Follow Us Texas Instruments on Facebook Texas Instruments on Twitter Texas Instruments on LinkedIn Texas Instruments on Google+
TI Worldwide | Contact Us | my.TI Login | Site Map | Corporate Citizenship | mobile m.ti.com (Mobile Version)

TI is a global semiconductor design and manufacturing company. Innovate with 100,000+ analog ICs and
embedded processors, along with software, tools and the industry’s largest sales/support staff.

© Copyright 1995-2013 Texas Instruments Incorporated. All rights reserved.
Trademarks | Privacy Policy | Terms of Use