This thread has been locked.

If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.

system crashed

Hi all,

I'm using a usb 3G dongle on a dm6446 platform,  the kernel version is 2.6.10.

After I modprobed musb_hdrc, pl2303,  and then "ls" to see whether the /dev/tts/USB1 device node was created, the system crashed like the followings, please give some hints to know where to find the faults.

Thanks.

Richard

hub_port_init:2075, retry=0
hub_port_init:2075, retry=0
hub_port_init:2075, retry=0
ls: /dev/tts/USB1: No such file or directory
ls: /dev/tts/USB1: No such file or directory
BUG: dead task env:31335 back from the grave!
.... flags: 0000810c, count: 0, state: 00000040
.... trying again ...
Unable to handle kernel NULL pointer dereference at virtual address 00000004
pgd = c0004000
[00000004] *pgd=00000000
Internal error: Oops: 17 [#1]
Modules linked in: musb_hdrc pl2303 usbserial cdc_acm usbcore dm350mmap dsplinkk cmemk
CPU: 0
PC is at dequeue_task+0xc/0x78
LR is at deactivate_task+0x28/0x34
pc : [<c0041ea8>]    lr : [<c00421e4>]    Tainted: PF   
sp : c4c67ef8  ip : c4c67f08  fp : c4c67f04
r10: c4c66000  r9 : c02e8390  r8 : c73e0d00
r7 : c4c67f78  r6 : c73e0d00  r5 : c838b820  r4 : c73e0d00
r3 : 00000001  r2 : 00000000  r1 : 00000000  r0 : c73e0d00
Flags: nzCv  IRQs off  FIQs on  Mode SVC_32  Segment user
Control: 5317F  Table: 89A9C000  DAC: 00000015
Process env (pid: 31335, stack limit = 0xc4c661a0)
Stack: (0xc4c67ef8 to 0xc4c68000)
7ee0:                                                       c4c67f1c c4c67f08
7f00: c00421e4 c0041eac c004a144 c004a144 c4c67f6c c4c67f20 c0243984 c00421cc
7f20: c4c67f4c c4c67f30 c4c66000 00989680 ea92ff80 00002181 00000020 c73e0e5c
7f40: c4c67f5c c4c66000 c838b820 c73e0d00 c4c67f78 00000020 c4c66000 c73e0d5c
7f60: c4c67f8c c4c67f70 c004a144 c02437b0 c73e0da0 0090005b c4c67f78 c4c67f78
7f80: c4c67fa4 c4c67f90 c004a2c4 c0049418 00085cb0 00085cd4 00000000 c4c67fa8
7fa0: c00319c0 c004a1f4 00085cb0 00085cd4 00000000 00000000 fbad2088 ffffffff
7fc0: 00085cb0 00085cd4 00000000 00000000 00000000 000a963c 4015e000 0000001c
7fe0: 40017000 befffae0 4007eec8 400d846c 60000010 00000000 d9ecdfd1 eb4747f8
Backtrace:
[<c0041e9c>] (dequeue_task+0x0/0x78) from [<c00421e4>] (deactivate_task+0x28/0x34)
[<c00421bc>] (deactivate_task+0x0/0x34) from [<c0243984>] (__schedule+0x1e4/0x5ac)
 r4 = C004A144
[<c02437a0>] (__schedule+0x0/0x5ac) from [<c004a144>] (do_exit+0xd3c/0xda0)
[<c0049408>] (do_exit+0x0/0xda0) from [<c004a2c4>] (sys_exit_group+0x0/0x1c)
[<c004a1e4>] (do_group_exit+0x0/0xe0) from [<c00319c0>] (ret_fast_syscall+0x0/0x2c)
 r5 = 00085CD4  r4 = 00085CB0
Code: e89da800 e1a0c00d e92dd800 e24cb004 (e5913004)
 <6>note: env[31335] exited with preempt_count 2
Unable to handle kernel NULL pointer dereference at virtual address 00000004
pgd = c0004000
[00000004] *pgd=00000000
Internal error: Oops: 17 [#2]
Modules linked in: musb_hdrc pl2303 usbserial cdc_acm usbcore dm350mmap dsplinkk cmemk
CPU: 0
PC is at do_exit+0xdc/0xda0
LR is at vprintk+0x23c/0x270
pc : [<c00494e4>]    lr : [<c0047a4c>]    Tainted: PF   
sp : c4c67df0  ip : c4c67da0  fp : c4c67e0c
r10: 0000000b  r9 : 00000008  r8 : bf000000
r7 : c0041ea8  r6 : c73e0d00  r5 : c4c66000  r4 : c4c66000
r3 : 40000093  r2 : 00000000  r1 : 40000013  r0 : 00000030
Flags: nZcv  IRQs off  FIQs on  Mode SVC_32  Segment user
Control: 5317F  Table: 89A9C000  DAC: 00000015
Process env (pid: 31335, stack limit = 0xc4c661a0)
Stack: (0xc4c67df0 to 0xc4c68000)
7de0:                                     c4c67eb0 00000000 c4c66000 00000001
7e00: c4c67e24 c4c67e10 c0037508 c0049418 c0039cdc bf000000 c4c67e44 c4c67e28
7e20: c0039cec c00371c8 00000017 c4c67ee4 c73e0d00 00000000 c4c67e84 c4c67e48
7e40: c0039f88 c0039c90 c4c66000 c02e5a70 c4c67e7c c4c67eb0 c00330e8 00000017
7e60: c4c67ee4 00000004 c028d414 c4c67eb0 60000093 c4c66000 c4c67eac c4c67e88
7e80: c003a0ec c0039dc4 c4c67eb4 ffffffff c4c67ee4 c73e0d00 c4c67f78 c73e0d00
7ea0: c4c67f04 c4c67eb0 c0031588 c003a0bc c73e0d00 00000000 00000000 00000001
7ec0: c73e0d00 c838b820 c73e0d00 c4c67f78 c73e0d00 c02e8390 c4c66000 c4c67f04
7ee0: c4c67f08 c4c67ef8 c00421e4 c0041ea8 20000093 ffffffff c4c67f1c c4c67f08
7f00: c00421e4 c0041eac c004a144 c004a144 c4c67f6c c4c67f20 c0243984 c00421cc
7f20: c4c67f4c c4c67f30 c4c66000 00989680 ea92ff80 00002181 00000020 c73e0e5c
7f40: c4c67f5c c4c66000 c838b820 c73e0d00 c4c67f78 00000020 c4c66000 c73e0d5c
7f60: c4c67f8c c4c67f70 c004a144 c02437b0 c73e0da0 0090005b c4c67f78 c4c67f78
7f80: c4c67fa4 c4c67f90 c004a2c4 c0049418 00085cb0 00085cd4 00000000 c4c67fa8
7fa0: c00319c0 c004a1f4 00085cb0 00085cd4 00000000 00000000 fbad2088 ffffffff
7fc0: 00085cb0 00085cd4 00000000 00000000 00000000 000a963c 4015e000 0000001c
7fe0: 40017000 befffae0 4007eec8 400d846c 60000010 00000000 d9ecdfd1 eb4747f8
Backtrace:
[<c0049408>] (do_exit+0x0/0xda0) from [<c0037508>] (die+0x350/0x3a0)
[<c00371b8>] (die+0x0/0x3a0) from [<c0039cec>] (__do_kernel_fault+0x6c/0x7c)
[<c0039c80>] (__do_kernel_fault+0x0/0x7c) from [<c0039f88>] (do_page_fault+0x1d4/0x1ec)
 r7 = 00000000  r6 = C73E0D00  r5 = C4C67EE4  r4 = 00000017
[<c0039db4>] (do_page_fault+0x0/0x1ec) from [<c003a0ec>] (do_DataAbort+0x40/0xa4)
[<c003a0ac>] (do_DataAbort+0x0/0xa4) from [<c0031588>] (__dabt_svc+0x48/0x60)
 r8 = C73E0D00  r7 = C4C67F78  r6 = C73E0D00  r5 = C4C67EE4
 r4 = FFFFFFFF
[<c0041e9c>] (dequeue_task+0x0/0x78) from [<c00421e4>] (deactivate_task+0x28/0x34)
[<c00421bc>] (deactivate_task+0x0/0x34) from [<c0243984>] (__schedule+0x1e4/0x5ac)
 r4 = C004A144
[<c02437a0>] (__schedule+0x0/0x5ac) from [<c004a144>] (do_exit+0xd3c/0xda0)
[<c0049408>] (do_exit+0x0/0xda0) from [<c004a2c4>] (sys_exit_group+0x0/0x1c)
[<c004a1e4>] (do_group_exit+0x0/0xe0) from [<c00319c0>] (ret_fast_syscall+0x0/0x2c)
 r5 = 00085CD4  r4 = 00085CB0
Code: e59621f8 e10f1000 e3813080 e121f003 (e5923004)
 <6>note: env[31335] exited with preempt_count 2
Unable to handle kernel NULL pointer dereference at virtual address 00000004
pgd = c0004000
[00000004] *pgd=00000000
Internal error: Oops: 17 [#3]
Modules linked in: musb_hdrc pl2303 usbserial cdc_acm usbcore dm350mmap dsplinkk cmemk
CPU: 0
PC is at do_exit+0xdc/0xda0
LR is at vprintk+0x23c/0x270
pc : [<c00494e4>]    lr : [<c0047a4c>]    Tainted: PF   
sp : c4c67ce8  ip : c4c67c98  fp : c4c67d04
r10: 0000000b  r9 : 00000008  r8 : bf000000
r7 : c00494e4  r6 : c73e0d00  r5 : c4c66000  r4 : c4c66000
r3 : 40000093  r2 : 00000000  r1 : 40000013  r0 : 00000030
Flags: nZcv  IRQs off  FIQs on  Mode SVC_32  Segment user
Control: 5317F  Table: 89A9C000  DAC: 00000015
Process env (pid: 31335, stack limit = 0xc4c661a0)
Stack: (0xc4c67ce8 to 0xc4c68000)

.......

 

  • Hello Richard,

     

    I realize you made this post about a month ago, but I was wondering if you've made any progress?

    We are also using the old 2.6.10 kernel and we've been experiencing this issue (under different circumstances though).

    Unfortunately, the CMEM and DSPLINK modules we won't load when kgdb is enabled so we haven't been able to capture the specific causes but we've noticed that the kernel is much more vulnerable when we are communicating inter-process with named pipes (fifo) or when we are under load and are creating a number of processes (via bash scripting).

     

    Could you share some additional specs of your design to see if we're doing something similar?

     

    We are using DM6441 + CodecEngine 2.10.01.

    We have 128MB ram with 96MB allocated to linux (the rest goes to CMEM).

    We're pretty close to utilizing all of our linux ram and we have swap disabled so maybe that has something to do with it.

     

    Regards,

    David

     

    Legrand

    Home System Division

     

  • Hello -

     

    I'm seeing a very similar kernel NULL pointer deref (in dequeue_task).

     

    Have either of you found a solution to the problem yet?

     

    Thanks,

     

    - Paul

  • Please see this forum post with a more detailed explanation of the problem we were seeing, along with a *solution* from MontaVista:

     

    http://e2e.ti.com/support/embedded/f/354/p/90471/374312.aspx#374312

     

    Hopefully, it will be helpful to others.

     

    - Paul