This thread has been locked.

If you have a related question, please click the "Ask a related question" button in the top right corner. The newly created question will be automatically linked to this question.

SK-TDA4VM unexpected fails: Oops: Fatal exception in interrupt

Other Parts Discussed in Thread: J721EXCPXEVM

Hello,

I am using TDA4 on my desk and also in the system, but it is not working reliable in both places. 

Case 1) When board is working, I am using SSH or UART or nothing, after some time it falls into hard fault, without dependencies on the load of the processor. Power reset is necessary to reborn the board. SSH and UART connection becomes unreachable (UART just not respond) after this error. I attached two fault reports which I receive by UART before the fault. Also the board seemed working, processor stay be heated. 
 

Arago 2021.09 tda4vm-sk ttyS2

tda4vm-sk login: [   31.073403] Initializing XFRM netlink socket
[   32.954414] bridge: filtering via arp/ip/ip6tables is no longer available by                                                                                                                      default. Update your scripts to load br_netfilter if you need this.
[   32.973202] Bridge firewalling registered
[   33.872256] process 'docker/tmp/qemu-check530355309/check' started with execu                                                                                                                     table stack
[   54.458704] IPv6: ADDRCONF(NETDEV_CHANGE): can0: link becomes ready
[   55.474769] IPv6: ADDRCONF(NETDEV_CHANGE): can1: link becomes ready
[   55.481108] IPv6: ADDRCONF(NETDEV_CHANGE): can2: link becomes ready
[   55.487627] IPv6: ADDRCONF(NETDEV_CHANGE): can3: link becomes ready
[  143.162787] Unable to handle kernel paging request at virtual address ffbf800                                                                                                                     01152d6f0
[  143.170700] Mem abort info:
[  143.173495]   ESR = 0x96000004
[  143.173745] Unable to handle kernel paging request at virtual address ffff800                                                                                                                     010fcff00
[  143.176548]   EC = 0x25: DABT (current EL), IL = 32 bits
[  143.184440] Mem abort info:
[  143.184441]   ESR = 0x96000047
[  143.184443]   EC = 0x25: DABT (current EL), IL = 32 bits
[  143.184445] Unable to handle kernel paging request at virtual address ffbf000                                                                                                                     87fde8be0
[  143.184447]   SET = 0, FnV = 0
[  143.184448] Mem abort info:
[  143.184449]   EA = 0, S1PTW = 0
[  143.184450]   ESR = 0x96000044
[  143.184451] Data abort info:
[  143.184453]   EC = 0x25: DABT (current EL), IL = 32 bits
[  143.184454]   ISV = 0, ISS = 0x00000047
[  143.184455]   SET = 0, FnV = 0
[  143.184456]   CM = 0, WnR = 1
[  143.184457]   EA = 0, S1PTW = 0
[  143.184459] swapper pgtable: 4k pages, 48-bit VAs, pgdp=0000000082f38000
[  143.184459] Data abort info:
[  143.184461] [ffff800010fcff00] pgd=00000008fffff003
[  143.184462]   ISV = 0, ISS = 0x00000044
[  143.184463] , p4d=00000008fffff003
[  143.184464]   CM = 0, WnR = 1
[  143.184465] , pud=00000008ffffe003
[  143.184466] [ffbf00087fde8be0] address between user and kernel address ranges
[  143.184467] , pmd=00000008ffffc003, pte=0000000000000000
[  143.184470] Internal error: Oops: 96000044 [#1] PREEMPT SMP
[  143.184471]
[  143.184472] Modules linked in: xt_conntrack xt_MASQUERADE xt_addrtype iptable                                                                                                                     _filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c                                                                                                                      ip_tables x_tables br_netfilter bridge stp llc overlay xfrm_user xfrm_algo md5 e                                                                                                                     cb aes_neon_bs aes_neon_blk des_generic libdes cbc bluetooth ecdh_generic ecc rf                                                                                                                     kill xhci_plat_hcd xhci_hcd omap_rng cdns3 udc_core roles irq_pruss_intc pru_rpr                                                                                                                     oc rpmsg_char icss_iep usbcore usb_common crct10dif_ce phy_can_transceiver sa2ul                                                                                                                      pruss ti_k3_r5_remoteproc pvrsrvkm(O) vxd_dec sha512_generic vxe_enc authenc vi                                                                                                                     deobuf2_dma_contig ti_k3_dsp_remoteproc videobuf2_dma_sg v4l2_mem2mem pci_endpoi                                                                                                                     nt_test virtio_rpmsg_bus cdns_dphy videobuf2_memops videobuf2_v4l2 videobuf2_com                                                                                                                     mon cdns3_ti m_can_platform m_can can_dev rti_wdt optee_rng rng_core sch_fq_code                                                                                                                     l rpmsg_kdrv_switch cryptodev(O) ipv6
[  143.184568] CPU: 1 PID: 1503 Comm: node Tainted: G           O      5.10.162-                                                                                                                     g76b3e88d56 #1
[  143.184570] Hardware name: Texas Instruments J721E SK (DT)
[  143.184573] pstate: 60000085 (nZCv daIf -PAN -UAO -TCO BTYPE=--)
[  143.184584] pc : insert_work+0x2c/0xb0
[  143.184587] lr : __queue_work+0x104/0x390
[  143.184588] sp : ffff8000112c3d50
[  143.184589] x29: ffff8000112c3d50 x28: ffff0008473f9590
[  143.184593] x27: 0000000000000100 x26: ffff800010fc4008
[  143.184596] x25: ffff800011119c58 x24: ffff800010fc4008
[  143.184599] x23: 0000000000000034 x22: 0000000000000001
[  143.184602] x21: ffff000847148a00 x20: ffff00087f9e8bc0
[  143.184605] x19: ffff00087f9ee400 x18: 0000000000000000
[  143.184609] x17: 0000000000000000 x16: 0000000000000000
[  143.184611] x15: 0000268a3cd5a83d x14: 00000000000002e2
[  143.184615] x13: 00000000000000fb x12: 0000000000000001
[  143.184618] x11: 0000000000000040 x10: ffff80001112d9e8
[  143.184621] x9 : ffff80001112d9e0 x8 : ffff000847400028
[  143.184624] x7 : 0000000000000000 x6 : 0000000000000000
[  143.184627] x5 : ffff00087f9e8bc0 x4 : ffff0008473f9598
[  143.184630] x3 : ffbf00087fde8be0 x2 : ffff00087f9e8be0
[  143.184633] x1 : ffff0008473f9590 x0 : ffff00087f9ee400
[  143.184636] Call trace:
[  143.184639]  insert_work+0x2c/0xb0
[  143.184642]  queue_work_on+0x5c/0x80
[  143.184646]  rpm_suspend+0x400/0x460
[  143.184648]  pm_suspend_timer_fn+0x7c/0x80
[  143.184652]  __hrtimer_run_queues+0x114/0x1b8
[  143.184655]  hrtimer_interrupt+0xe8/0x248
[  143.184659]  arch_timer_handler_phys+0x34/0x48
[  143.184664]  handle_percpu_devid_irq+0x84/0x148
[  143.184667]  generic_handle_irq+0x30/0x48
[  143.184669]  __handle_domain_irq+0x64/0xc0
[  143.184673]  gic_handle_irq+0x58/0x128
[  143.184676]  el1_irq+0xcc/0x180
[  143.184679]  vprintk_emit+0x1f8/0x268
[  143.184682]  vprintk_default+0x38/0x48
[  143.184684]  vprintk_func+0xf4/0x2a8
[  143.184689]  printk+0x60/0x84
[  143.184691]  mem_abort_decode+0x64/0x124
[  143.184694]  die_kernel_fault+0x48/0x78
[  143.184697]  __do_kernel_fault+0x74/0x148
[  143.184699]  do_bad_area+0x5c/0x68
[  143.184701]  do_translation_fault+0x38/0x68
[  143.184703]  do_mem_abort+0x40/0xa0
[  143.184706]  el1_abort+0x48/0x70
[  143.184708]  el1_sync_handler+0xac/0xc8
[  143.184710]  el1_sync+0x88/0x140
[  143.184714]  sysctl_max_threads+0x198/0x4b0
[  143.184718]  finish_task_switch+0x170/0x200
[  143.184722]  __schedule+0x278/0x720
[  143.184724]  schedule+0x70/0x108
[  143.184728]  futex_wait_queue_me+0xb0/0x138
[  143.184730]  futex_wait+0xd8/0x1f0
[  143.184732]  do_futex+0x134/0xb60
[  143.184735]  __arm64_sys_futex+0x118/0x190
[  143.184738]  el0_svc_common.constprop.0+0x78/0x1c8
[  143.184741]  do_el0_svc+0x24/0x90
[  143.184743]  el0_svc+0x14/0x20
[  143.184746]  el0_sync_handler+0xb0/0xb8
[  143.184748]  el0_sync+0x180/0x1c0
[  143.184753] Code: f8008483 f9400443 f9000444 a9008c22 (f9000064)
[  143.184760] ---[ end trace 23669914b49ec935 ]---
[  143.184763] Kernel panic - not syncing: Oops: Fatal exception in interrupt
[  143.184766] SMP: stopping secondary CPUs
[  144.252269] SMP: failed to stop secondary CPUs 0-1
[  144.252279] Kernel Offset: disabled
[  144.252281] CPU features: 0x28040022,20006008
[  144.252282] Memory Limit: none
[  144.721176] ---[ end Kernel panic - not syncing: Oops: Fatal exception in int                                                                                                                     errupt ]---
[   54.502477] IPv6: ADDRCONF(NETDEV_CHANGE): can0: link becomes ready
[   55.515935] IPv6: ADDRCONF(NETDEV_CHANGE): can1: link becomes ready
[   55.522270] IPv6: ADDRCONF(NETDEV_CHANGE): can2: link becomes ready
[   55.528935] IPv6: ADDRCONF(NETDEV_CHANGE): can3: link becomes ready
[ 4451.491918] Unable to handle kernel paging request at virtual address ffbf000853c30f40
[ 4451.495663] Unable to handle kernel paging request at virtual address ffbf000853c4ed70
[ 4451.499818] Mem abort info:
[ 4451.499820]   ESR = 0x96000004
[ 4451.499822]   EC = 0x25: DABT (current EL), IL = 32 bits
[ 4451.499825]   SET = 0, FnV = 0
[ 4451.507720] Mem abort info:
[ 4451.510496]   EA = 0, S1PTW = 0
[ 4451.510497] Data abort info:
[ 4451.510498]   ISV = 0, ISS = 0x00000004
[ 4451.510499]   CM = 0, WnR = 0
[ 4451.510501] [ffbf000853c30f40] address between user and kernel address ranges
[ 4451.510505] Internal error: Oops: 96000004 [#1] PREEMPT SMP
[ 4451.510507] Modules linked in: xt_conntrack
[ 4451.513553]   ESR = 0x96000004
[ 4451.518842]  xt_MASQUERADE xt_addrtype iptable_filter
[ 4451.521888]   EC = 0x25: DABT (current EL), IL = 32 bits
[ 4451.524663]  iptable_nat nf_nat nf_conntrack
[ 4451.527792]   SET = 0, FnV = 0
[ 4451.530655]  nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c ip_tables
[ 4451.534481]   EA = 0, S1PTW = 0
[ 4451.537429]  x_tables br_netfilter bridge
[ 4451.544547] Data abort info:
[ 4451.550096]  stp llc overlay xfrm_user xfrm_algo md5
[ 4451.554272]   ISV = 0, ISS = 0x00000004
[ 4451.557307]  ecb aes_neon_bs aes_neon_blk
[ 4451.562345]   CM = 0, WnR = 0
[ 4451.567635]  des_generic libdes cbc
[ 4451.571893] [ffbf000853c4ed70] address between user and kernel address ranges
[ 4451.574928]  bluetooth ecdh_generic ecc rfkill xhci_plat_hcd xhci_hcd omap_rng rpmsg_char cdns3 udc_core irq_pruss_intc icss_iep roles pru_rproc usbcore usb_common crct10dif_ce phy_can_transceiver ti_k3_r5_remoteproc pci_endpoint_test pruss sa2ul pvrsrvkm(O) vxe_enc ti_k3_dsp_remoteproc vxd_dec cdns_dphy sha512_generic authenc cdns3_ti virtio_rpmsg_bus videobuf2_dma_sg v4l2_mem2mem videobuf2_dma_contig videobuf2_memops videobuf2_v4l2 videobuf2_common m_can_platform m_can can_dev rti_wdt optee_rng rng_core sch_fq_codel rpmsg_kdrv_switch cryptodev(O) ipv6
[ 4451.665974] CPU: 1 PID: 1538 Comm: node Tainted: G           O      5.10.162-g76b3e88d56 #1
[ 4451.674301] Hardware name: Texas Instruments J721E SK (DT)
[ 4451.679770] pstate: 20000085 (nzCv daIf -PAN -UAO -TCO BTYPE=--)
[ 4451.685766] pc : update_curr+0x34/0x1a8
[ 4451.689588] lr : task_tick_fair+0x6c/0x680
[ 4451.693668] sp : ffff8000112c3d00
[ 4451.696968] x29: ffff8000112c3d00 x28: ffff8000100eddc0
[ 4451.702264] x27: ffff00087f9e6340 x26: 0000000000000006
[ 4451.707560] x25: ffff80001121d000 x24: ffff800011119000
[ 4451.712857] x23: ffff800010fc5a68 x22: ffff000853830e40
[ 4451.718152] x21: ffff00087f9e9240 x20: ffff00084e289400
[ 4451.723448] x19: ffbf000853c30f00 x18: 0000000000000000
[ 4451.728744] x17: 0000000000000000 x16: 0000000000000000
[ 4451.734041] x15: 0000000000000000 x14: 0000000000000000
[ 4451.739336] x13: 0000000000000000 x12: 0000000000000000
[ 4451.744633] x11: 0000000000000301 x10: 0000000000000004
[ 4451.749929] x9 : 0000000000000000 x8 : 0000000000000301
[ 4451.755225] x7 : ffff00087f9e9d80 x6 : ffff00087f9e9240
[ 4451.760521] x5 : 000000000000b97f x4 : 0000000000000000
[ 4451.765816] x3 : ffff80001008b970 x2 : 0000000000000000
[ 4451.771111] x1 : ffff000853830e40 x0 : 00000406929c0508
[ 4451.776408] Call trace:
[ 4451.778842]  update_curr+0x34/0x1a8
[ 4451.782317]  task_tick_fair+0x6c/0x680
[ 4451.786052]  scheduler_tick+0xb8/0x158
[ 4451.789789]  update_process_times+0x8c/0xa0
[ 4451.793957]  tick_sched_handle.isra.0+0x34/0x50
[ 4451.798471]  tick_sched_timer+0x4c/0xa8
[ 4451.802291]  __hrtimer_run_queues+0x114/0x1b8
[ 4451.806633]  hrtimer_interrupt+0xe8/0x248
[ 4451.810630]  arch_timer_handler_phys+0x34/0x48
[ 4451.815060]  handle_percpu_devid_irq+0x84/0x148
[ 4451.819576]  generic_handle_irq+0x30/0x48
[ 4451.823569]  __handle_domain_irq+0x64/0xc0
[ 4451.827651]  gic_handle_irq+0x58/0x128
[ 4451.831386]  el0_irq_naked+0x4c/0x54
[ 4451.834950] Code: 7100041f 54000969 f944e2a0 b40002f3 (f9402275)
[ 4451.841029] ---[ end trace df15f55af966bda3 ]---
[ 4451.845630] Kernel panic - not syncing: Oops: Fatal exception in interrupt
[ 4451.852485] SMP: stopping secondary CPUs
[ 4452.923892] SMP: failed to stop secondary CPUs 0-1
[ 4452.928675] Kernel Offset: disabled
[ 4452.932149] CPU features: 0x28040022,20006008
[ 4452.936490] Memory Limit: none
[ 4452.939534] ---[ end Kernel panic - not syncing: Oops: Fatal exception in interrupt ]---


Case 2) I'm receiving and sending a lot of UDP packets in TDA4. For example I have a stream from the camera there. After some error (which is not recognizable through Ethernet port) TDA4 stops communication. When we decrease the number of sent UDP packets, problem disappeared, but this solution is not applicable for us. 
Maybe you already met this issue and have any solution.

Maybe boot log can be also useful.

tda4_boot_log.txt

Mikita