• Join
  • Sign In with my.TI Login
Texas Instruments
  • Products
  • Applications
  • Tools & Software
  • Support & Community
  • Sample & Buy
  • About TI
Sample & Purchase Cart Sample & Purchase Cart
  • Search
  • Advanced
TI E2E™ Community
  • Support Forums
  • Blogs
  • Groups
  • Videos
  • 简体中文
  • More ...
TI Home » TI E2E Community » Support Forums » Digital Signal Processors (DSP) » OMAP™ Processors » OMAP35x Processors Forum » C6Run Benchmark DSP much Slower than ARM even for different memory maps
Share
OMAP™ Processors
  • Forums
  • Announcements
Options
  • Subscribe via RSS
Resources
  • OMAP-L1x DSP+ARM9™-based Processors Product Folder
  • OMAP3525/30 DSP+ARM Cortex™-A8-based SOCs Product Folder

  • Top OMAPL Wiki Links
  • OMAPL3x Schematic Review Checklist
  • OMAPL13x Boot resources

  • OMAPL Document Resources
  • OMAPL137 Technical reference manual
  • OMAPL138 Technical reference manual
  • OMAPL Boot loader App Notes
  • C6Run Benchmark DSP much Slower than ARM even for different memory maps

    C6Run Benchmark DSP much Slower than ARM even for different memory maps

    This question is not answered
    Steve Haviland
    Posted by Steve Haviland
    on May 04 2012 22:28 PM
    Prodigy90 points
    c6run-memorymap-compare.txt

    Hello

    New to the TI/DSP stuff, I have been trying to figure out why I can't get DSP code to run faster. I have read through the forums as much as I could but have not figured out what my issue is. I have tried adjusting the DSP mapping but that does not effect anything at all I think I am missing a key point.

    I have attached a text file that contains two different memory map runs with the benchmark runs

    Here are two different memory maps I did for C6RUN:

    First I had DSP_REGION_CMEM_SIZE at 16 MB and
    DSP_REGION_CODE_SIZE  at 13 MB

    then I tried DSP_REGION_CMEM_SIZE at 28 MB and
    DSP_REGION_CODE_SIZE  at 28 MB

    Beside that only not making any difference. When I checked lsmod for both there was no difference even though I suspect there should be...

    I do all the steps that the readme for c6run says to do in setting up platform etc..and I do not run into any run time errors

    I read that I am NOT suppose to adjust DSPLINK memory map as the platform config does that for me...

    How do I get the DSP to run faster? What I'm I missing?

    Any recommendations would be great.

    C6Run
    Report Abuse
    • Reply
    You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    All Replies
    • Gagan Maur
      Posted by Gagan Maur
      on May 15 2012 16:41 PM
      Expert4435 points

      Performance depends on:

      1. What is the code you are trying to run? Is the code suitable for DSP? Generally DSP is good at running code that does same stuff over and over again. So code that has loops will do better on DSP
      2. What are the options you are using to compile the code? http://processors.wiki.ti.com/index.php/C6RunLib_Documentation#Common_Command-line_Options Make sure you are using options that help improve the performance
      3. How much work are you asking DSP to do? Note, every time ARM offloads processing to DSP, there is some overhead involved. So make sure when you are asking DSP to do certain processing, you are giving enough work to DSP.. else, just your calling overheads will dominate. I think the ballpark for overhead is ~200usecs
      4. Are you helping DSP codegen tools to get best performance by providing them the relevant information? See the document here: http://www.ti.com/lit/pdf/sprabf2 by helping the DSP codegen to not make worst case assumptions about your code, you can gain significant performance

       

      ----------------------------------------------------------------------------------------------------------
      Please click the
      Verify Answer button on this post if it answers your question.
      ----------------------------------------------------------------------------------------------------------

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Steve Haviland
      Posted by Steve Haviland
      on May 17 2012 16:50 PM
      Prodigy90 points

      Thank you for your response Gagan. I was running all the included C6Run Example code ( bench_dsp and cfft_dsp ) and comparing them with ( bench_arm and cfft_arm)

      Both Arm version's run much faster. I used the standard memory map and proper u-boot environment variable setup. I search the forum and found someone else:

      http://e2e.ti.com/support/dsp/omap_applications_processors/f/447/p/70317/255208.aspx#255208

      who wrote some Matrix calculation sample and ran that too. The DSP version was running close to the same speed as the ARM.

      I also ran the C6Accel sample code(c6accel_app)

      that goes through each function and logs time it takes to go through it. Comparing mine to the pdf TI provided mine is slower as well. 

      Running on DM3730. 

      I will look more into what you wrote (2,3 and 4) and will report back if anything helps.

      Thanks

      Steve

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    • Gagan Maur
      Posted by Gagan Maur
      on May 17 2012 17:11 PM
      Expert4435 points

      Steve, one other thing that I didn't mention is the impact of running floating point code. Note for DM3730, DSP is fixed point whereas the A8 supports floating point. So if the benchmark you are running is natively floating point, the performance of the DSP will not be great. There are fixed point version of FFTs provided in the DSPLIB.

      The other thing to note is the CPU freq for the two cores. You should account for that when comparing performance

      Cheers,
      Gagan 

      ----------------------------------------------------------------------------------------------------------
      Please click the
      Verify Answer button on this post if it answers your question.
      ----------------------------------------------------------------------------------------------------------

      Report Abuse
      • Reply
      You have posted to a forum that requires a moderator to approve posts before they are publicly available.
    TI E2E™ Community
    • Support Forums
    • Blogs
    • Videos
    • Groups
    • Site Support & Feedback
    • Settings
    TI E2E™ Community Groups
    • TI University Program
    • Make the Switch
    • Microcontroller Projects
    • Motor Drive & Control
    Other Communities
    • Deyisupport
    • Designsomething.org
    • beagleboard.org
    • TI on Element 14
    • TI on TechXchangeSM
    Other Technical & Support Resources
    • WEBENCH® Design Center
    • Product Information Centers
    • Technical Documents
    • TI Design Network
    • TI Technical Articles
    • TI Training

    All content and materials on this site are provided "as is". TI and its respective suppliers and providers of content make no representations about the suitability of these materials for any purpose and disclaim all warranties and conditions with regard to these materials, including but not limited to all implied warranties and conditions of merchantability, fitness for a particular purpose, title and non-infringement of any third party intellectual property right. TI and its respective suppliers and providers of content make no representations about the suitability of these materials for any purpose and disclaim all warranties and conditions with respect to these materials. No license, either express or implied, by estoppel or otherwise, is granted by TI. Use of the information on this site may require a license from a third party, or a license from TI.

    Content on this site may contain or be subject to specific guidelines or limitations on use. All postings and use of the content on this site are subject to the Terms of Use of the site; third parties using this content agree to abide by any limitations or guidelines and to comply with the Terms of Use of this site. TI, its suppliers and providers of content reserve the right to make corrections, deletions, modifications, enhancements, improvements and other changes to the content and materials, its products, programs and services at any time or to move or discontinue any content, products, programs, or services without notice.

    Follow Us Texas Instruments on Facebook Texas Instruments on Twitter Texas Instruments on LinkedIn Texas Instruments on Google+
    TI Worldwide | Contact Us | my.TI Login | Site Map | Corporate Citizenship | mobile m.ti.com (Mobile Version)

    TI is a global semiconductor design and manufacturing company. Innovate with 100,000+ analog ICs and
    embedded processors, along with software, tools and the industry’s largest sales/support staff.

    © Copyright 1995-2013 Texas Instruments Incorporated. All rights reserved.
    Trademarks | Privacy Policy | Terms of Use