• Join
  • Sign In with my.TI Login
Texas Instruments
  • Products
  • Applications
  • Tools & Software
  • Support & Community
  • Sample & Buy
  • About TI
Sample & Purchase Cart Sample & Purchase Cart
  • Search
  • Advanced
TI E2E™ Community
  • Support Forums
  • Blogs
  • Groups
  • Videos
  • 简体中文
  • More ...
TI Home » TI E2E Community » Support Forums » Digital Signal Processors (DSP) » C6000 Multicore DSP » Keystone Multicore Forum (C66, 66A, AM5) » C6678 Memory performance
Share
C6000 Multicore DSP
  • Forums
  • Announcements
Options
  • Subscribe via RSS
Training Available
TI provides self-paced online training that introduces the primary components of the KeyStone II family of SoC devices.

  • KeyStone II SoC Overview >
  • KeyStone II Software Overview >
  • KeyStone II ARM Cortex-A15 Corepac Overview >
  • More Information >
  • Check out
    Multicore Mix blog
    • $core_v2_blog.Current.Name

      OpenMP - All aboard!

      Posted 1 day ago
      by Debbie Greenstreet
      With so many end products today relying on multicore DSPs for...
    • $core_v2_blog.Current.Name

      A look back: Two years of Multicore Mix

      Posted 2 days ago
      by Lauren Reed1
      A big thank you to everyone who participated in our contest last...
    • $core_v2_blog.Current.Name

      It’s our second anniversary, but you get the present!

      Posted 9 days ago
      by Lindsey Bare
      It’s hard to believe it’s already been two years...

    Forums

    C6678 Memory performance

    This question is answered
    Ya D
    Posted by Ya D
    on Sep 19 2011 02:22 AM
    Intellectual485 points

    Hi,

    I am looking for information about memory performance in the C6678 DSP.

    More specifically, when L1 and L2 are SRAM only, what is the throughput for each of them ?

    Also, what is the throughput of the shared memory when accessed by all cores simultaneously

    (that is, how many cores can read from the shared memory together) ?

    Same for the DDR3. Does DDR3 access interfere with shared memory access (or anything else)?

    I looked in the documentations and could not find anything related.

     

    Thank you.

    KeyStone C6778
    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    All Replies
    • Chad Courtney
      Posted by Chad Courtney
      on Sep 19 2011 10:04 AM
      Mastermind22595 points

      Ya,

      There is a performance application note that will be released soon, that will address these and more questions regarding memory and peripheral performance.

      Best Regards,

      Chad

      ------------------------------------------------------------------------------------------------------------

      Please click the Verify Answer button on this post if it answers your question.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Ya D
      Posted by Ya D
      on Sep 19 2011 13:36 PM
      Intellectual485 points

      Thank you.

      Do you know when? ( its a little bit urgent)

      Is there any way to get quick answers soon?

      Thank you again.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Chad Courtney
      Posted by Chad Courtney
      on Sep 19 2011 14:04 PM
      Mastermind22595 points

      I expect it will be out next week.  I'll have someone try to follow up with some data points.  That said, you'll want to be more specific as to what the source and destination are specifically and what the transfer mechanism is (EDMA, IDMA, CorePacs, Peripherals)

      Best Regards,

      Chad

      ------------------------------------------------------------------------------------------------------------

      Please click the Verify Answer button on this post if it answers your question.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Ya D
      Posted by Ya D
      on Sep 19 2011 14:14 PM
      Intellectual485 points

      Thank you.

      I am using EDMA for DDR to L2 and shared memory.

      The CorePacs operate directly on L2 and shared memory.

      But I am also interested (for comparasion only) about access directly to DDR from the core.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Derek Brown
      Posted by Derek Brown
      on Sep 20 2011 09:17 AM
      Verified Answer
      Verified by Ya D
      Intellectual1880 points

      Ya,

      Hopefully, the below information will be adequate until the performance application note is published.

      All throughput figures listed below assume that the DSP is operating at 1 GHz (1000 Mhz).

      Raw Memory Throughput:

      • L1D SRAM: L1D operates at DSP/1 frequency, so it can be accessed every cycle. L1D is accessed through a 256 bit interface, so the maximum throughput of L1D is 32,000 megabytes per second. ((256 bit buswidth)/(8 bits/byte)*(1000M)=32000MB/s)
      • L2 SRAM: L2 operates at DSP/2 frequency, so it can be accessed every other cycle. L2 is accessed through a 256 bit interface, so the maximum throughput of L2 is 16,000 megabytes per second. ((256 bit buswidth)/(8 bits/byte)*(1000M/2)=16000MB/s)
      • MSMC SRAM: MSMC operates at DSP/2 frequency, so it can be accessed every other cycle. MSMC has 4 memory banks, each of which is accessed through a 256 bit interface. All 4 banks can be accessed simultaneously by separate cores or other system masters. If the data in MSMC SRAM is allocated so that each of the 4 banks are being fulling utilized simultaneously, the maximum throughput of the shared memory is 64,000 megabytes per second. ((256 bit buswidth)/(8 bits/byte)*(1000M/2)*(4 banks)=64000MB/s). Up to 4 masters can access MSMC simultaneously.
      • DDR3: DDR3 data is accessed through a 64-bit interface. Assuming that DDR3-1333 is operating at the maximum rate, the theoretical maximum throughput 10,664 megabytes per second. ((64bits)/(8bit/byte)*(1333M)=10664MB/s). Only one master can access the DDR3 at a time.

       

      Maximum Throughputs for C66x core and DMA masters:

      • C66x Core: (128 bit buswidth)/(8bit/byte)*(1000M)=16000MB/s
      • IDMA: (256 bit buswidth)/(8bit/byte)*(1000M/2)=16000MB/s
      • EDMA0: (256 bit buswidth)/(8bit/byte)*(1000M/2)=16000MB/s
      • EDMA1: (128 bit buswidth)/(8bit/byte)*(1000M/3)=5333MB/s
      • EDMA2: (128 bit buswidth)/(8bit/byte)*(1000M/3)=5333MB/s

       

      DDR3 Accesses using EDMA:

      If you use EDMA0 to access DDR3, then you should be able to get data at a maximum rate of 10664 megabytes per second, excluding any idle time required for setting up the EDMA transaction. If using EDMA1 or EDMA2 to access DDR3, then the maximim rate that you can get data from DDR3 is 5333 megabytes per second, exculding any idle time required for setting up the EDMA trasaction.

       

      C66x Core Accessed to DDR3:

      Assuming that L1D, L2, and MSMC are all configured as SRAM (caching disabled) then single reads to DDR3 will stall the DSP 89 cycles. Burst reads to DDR3 will stall the DSP 43.2 cycles. Writes to DDR3 will not stall the DSP since the data will be written to DDR3 by the DDR3 controller; however, it will take some time for the data to actually arrive in DDR3.

       

      Regards,

      Derek

      Regards,

      Derek

      -------------------------------------------------------------------------------------------------------------------------------

      If you need more help, please reply back. If this answers the question, please click  Verify Answer , below.

      C6678 Throughput Memory Access
      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Ya D
      Posted by Ya D
      on Sep 20 2011 09:36 AM
      Intellectual485 points

      Thank you very much.

      This is exactly what I need.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Ya D
      Posted by Ya D
      on Sep 25 2011 02:14 AM
      Intellectual485 points

      Hi,

      One last question,

      You said that direct access to DDR3 stalls the DSP for 89 cycles. What causes the stall? DDR3 latency? How do you get this number?

      Thank you

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Derek Brown
      Posted by Derek Brown
      on Sep 27 2011 14:53 PM
      Intellectual1880 points

      Ya,

      Yes, this delay is primarily caused by DDR3 latency. There is also some delay in the bus for accessing DDR3, since the bus that is used to access DDR3 operates at a slower rate than the DSP.

      I will get back with you on how these numbers are being measured.

      Regards,

      Derek

      Regards,

      Derek

      -------------------------------------------------------------------------------------------------------------------------------

      If you need more help, please reply back. If this answers the question, please click  Verify Answer , below.

      C66x DDR3
      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Ya D
      Posted by Ya D
      on Nov 15 2011 04:05 AM
      Intellectual485 points

      Hi,

      any updates?

      Is the performance document published yet?

      I could not find it anywhere.

      Thanks.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Chad Courtney
      Posted by Chad Courtney
      on Nov 15 2011 13:25 PM
      Mastermind22595 points

      Ya,

      We had a number of internal reviews with many changes for clarification and haven't published it yet.  We are really close and it should be out in the next few weeks.

      Best Regards,

      Chad

      ------------------------------------------------------------------------------------------------------------

      Please click the Verify Answer button on this post if it answers your question.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Ivan Krechetov
      Posted by Ivan Krechetov
      on Apr 13 2012 17:32 PM
      Prodigy210 points

      Hi,

      I have a problem with IDMA1 (L2<->L1) performance (only approx. 3,3Gb/s), pls. look thread http://e2e.ti.com/support/dsp/c6000_multi-core_dsps/f/639/t/182014.aspx#656195

      Thanks

      Ivan

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Chad Courtney
      Posted by Chad Courtney
      on Apr 16 2012 14:46 PM
      Mastermind22595 points

      Ivan,

      I have someone who's digging into this and will be replying on the other thread.

      Best Regards,
      Chad

      ------------------------------------------------------------------------------------------------------------

      Please click the Verify Answer button on this post if it answers your question.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Teun van Kuppeveld
      Posted by Teun van Kuppeveld
      on Apr 17 2012 09:27 AM
      Prodigy50 points

      Dear Chad,

      Is the document published at this moment?

      Best regards,

      Teun

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Chad Courtney
      Posted by Chad Courtney
      on Apr 17 2012 10:04 AM
      Mastermind22595 points

      Teun,

      It's documented in the Keystone Throughput Performance Guide http://www.ti.com/lit/an/sprabk5/sprabk5.pdf

      Best Regards,

      Chad

      ------------------------------------------------------------------------------------------------------------

      Please click the Verify Answer button on this post if it answers your question.

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    TI E2E™ Community
    • Support Forums
    • Blogs
    • Videos
    • Groups
    • Site Support & Feedback
    • Settings
    TI E2E™ Community Groups
    • TI University Program
    • Make the Switch
    • Microcontroller Projects
    • Motor Drive & Control
    Other Communities
    • Deyisupport
    • Designsomething.org
    • beagleboard.org
    • TI on Element 14
    • TI on TechXchangeSM
    Other Technical & Support Resources
    • WEBENCH® Design Center
    • Product Information Centers
    • Technical Documents
    • TI Design Network
    • TI Technical Articles
    • TI Training

    All content and materials on this site are provided "as is". TI and its respective suppliers and providers of content make no representations about the suitability of these materials for any purpose and disclaim all warranties and conditions with regard to these materials, including but not limited to all implied warranties and conditions of merchantability, fitness for a particular purpose, title and non-infringement of any third party intellectual property right. TI and its respective suppliers and providers of content make no representations about the suitability of these materials for any purpose and disclaim all warranties and conditions with respect to these materials. No license, either express or implied, by estoppel or otherwise, is granted by TI. Use of the information on this site may require a license from a third party, or a license from TI.

    Content on this site may contain or be subject to specific guidelines or limitations on use. All postings and use of the content on this site are subject to the Terms of Use of the site; third parties using this content agree to abide by any limitations or guidelines and to comply with the Terms of Use of this site. TI, its suppliers and providers of content reserve the right to make corrections, deletions, modifications, enhancements, improvements and other changes to the content and materials, its products, programs and services at any time or to move or discontinue any content, products, programs, or services without notice.

    Follow Us Texas Instruments on Facebook Texas Instruments on Twitter Texas Instruments on LinkedIn Texas Instruments on Google+
    TI Worldwide | Contact Us | my.TI Login | Site Map | Corporate Citizenship | mobile m.ti.com (Mobile Version)

    TI is a global semiconductor design and manufacturing company. Innovate with 100,000+ analog ICs and
    embedded processors, along with software, tools and the industry’s largest sales/support staff.

    © Copyright 1995-2013 Texas Instruments Incorporated. All rights reserved.
    Trademarks | Privacy Policy | Terms of Use