Tool/software:
Hi,
I am running a bunch of software on an AM5729 custom board under Linux SDK 6.03. I am having intermittent issues (and seemingly random issues) that manifests in errors as such:
[ 104.776326] Alignment trap: not handling instruction ed927b00 at [<b4f57638>]
[ 104.784006] Unhandled fault: alignment exception (0x001) at 0xea2edebf
[ 104.790913] pgd = ceb88000
[ 104.793778] [ea2edebf] *pgd=00000000
These errors come from different applications at different times and do not seem to follow any logical trend as to when they occur. Sometimes I can run the software stack for hours without seeing the issue and other times it happens almost immediately when the applications begin.
I am a bit stuck on how to proceed and was hoping to get some advice. Some things I have done so far:
- Run applications using gdb to ensure is not a software fault in terms of memory management causing these issues. The fact that different applicatons cause the problem (it is not always the same one) seems to rule out this
- Run the am57xx-ddr.dss to verify that the changes I have made from the EMIF Tool have successfully gone through via u-boot - they seem to be as expected
- Run memtester for an extended period of time - no problems
- Used stressapptest to test the system under duress. Everytime it is run I see issues such as - this is the red flag that I keep coming back to which seems to suggest there is a problem with my memory setup, however, I cannot seem to find any issues
Log: CrcCopyPage CRC mismatch aaaaaaaa01aaaaaaaa01aaffffff5700aaffffff57f4 != aaaaaaaa01aaaaaaaa01aaffffff5700aaffffff5700, but no miscompares found. Retrying with fresh data.
Process Error: CPU 0(0x2) CrcCopyPage CRC mismatch aaaaaaaa01aaaaaaaa01aaffffff5700aaffffff57f4 != aaaaaaaa01aaaaaaaa01aaffffff5700aaffffff5700, but no miscompares found on second pass.
Report Error: miscompare : DIMM Unknown : 1 : 4s
Hardware Error: miscompare on CPU 0(0x2) at 0x968e0000(0x2e60d0:DIMM Unknown): read:0x5555555555555555, reread:0x5555555555555555 expected:0x0000000000000000
I am currently running out of things to try so any advice / recommendations on how to proceed to track down this issue would be greatly appreciated.
Thanks,
Jared