Kernel panic when use USB wireless netcard zd1211rw 1-1.1:1.0: phy1 #57

atzlinux · 2022-06-01T06:42:51Z

When I plugin SAGEM USB wireless netcard and boot the board, it will get Kernel panic.

The following info copy from serial port:

[ 23.485555] usb 1-1.4: reset high-speed USB device number 4 using xhci-hcd
[ 23.659675] zd1211rw 1-1.4:1.0: phy1
[ 23.668942] usbcore: registered new interface driver zd1211rw

Debian GNU/Linux bookworm/sid Debian-StarFive ttyS0

Debian-StarFive login: [ 23.746895] zd1211rw 1-1.4:1.0 wlx0060b3e722e9: renamed from wlan1
[ 23.897606] zd1211rw 1-1.4:1.0: firmware version 4605
[ 23.911613] zd1211rw 1-1.4:1.0: zd1211 chip 079b:004a v4330 high 00-60-b3 AL2230_RF pa0 g----
[ 24.161500] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
[ 24.385505] Oops - load access fault [#1]
[ 24.389542] Modules linked in: zd1211rw mac80211 libarc4 joydev snd_soc_starfive_pwmdac snd_soc_spdif_tx brcmfmac snd_soc_spdif_rx so
[ 24.424995] dw_mmc 10000000.mmc: Unexpected interrupt latency
[ 24.433832] CPU: 0 PID: 389 Comm: NetworkManager Not tainted 5.18.0-starfive-5.18 #1
[ 24.447259] Hardware name: StarFive VisionFive V1 (DT)
[ 24.452376] epc : zd_mac_rx+0xfc/0x398 [zd1211rw]
[ 24.457264] ra : zd_mac_rx+0xc0/0x398 [zd1211rw]
[ 24.462013] epc : ffffffff0202f94a ra : ffffffff0202f90e sp : ffffffc8044f2b90
[ 24.469205] gp : ffffffff81a4eda8 tp : ffffffd8841ad340 t0 : 000000000000000d
[ 24.476397] t1 : 2603654800000000 t2 : 00000000000012c0 s0 : ffffffc8044f2c50
[ 24.483590] s1 : 000000000000007f a0 : 0000000000000000 a1 : ffffffc80459607a
[ 24.490782] a2 : 000000000000007f a3 : 000000000000001c a4 : ffffffd881ed4e4e
[ 24.497974] a5 : 000000000000000c a6 : 0000000000000000 a7 : 000000000000000c
[ 24.505165] s2 : ffffffd881ed07c0 s3 : 0000000000000075 s4 : 0000000000000075
[ 24.512358] s5 : ffffffc804596005 s6 : ffffffc804596000 s7 : 0000000200000122
[ 24.519551] s8 : ffffffd8846061ac s9 : ffffffffffffffff s10: ffffffd881ed1fc0
[ 24.526742] s11: 0000000000000000 t3 : 0000000038e46e74 t4 : 0000000000000031
[ 24.533934] t5 : ffffffc80431f050 t6 : 0000000000001241
[ 24.539223] status: 0000000200000120 badaddr: ffffffc804596005 cause: 0000000000000005
[ 24.547113] [] handle_rx_packet+0x5e/0x104 [zd1211rw]
[ 24.553766] [] rx_urb_complete+0x124/0x178 [zd1211rw]
[ 24.560413] [] __usb_hcd_giveback_urb+0x78/0x11c
[ 24.566591] [] usb_giveback_urb_bh+0xd8/0x150
[ 24.572492] [] tasklet_action_common.isra.25+0xac/0xe2
[ 24.579173] [] tasklet_action+0x40/0x48
[ 24.584552] [] __do_softirq+0x140/0x310
[ 24.589939] [] irq_exit+0x134/0x16e
[ 24.594973] [] generic_handle_arch_irq+0x66/0x76
[ 24.601140] [] ret_from_exception+0x0/0xc
[ 24.606813] ---[ end trace 0000000000000000 ]---
[ 24.611421] Kernel panic - not syncing: Fatal exception in interrupt
[ 24.617751] SMP: stopping secondary CPUs
[ 24.621679] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---

esmil · 2022-06-01T12:08:51Z

Unfortunately there a problems with USB and CONFIG_PM=y, so make sure that is disabled.

atzlinux · 2022-06-01T12:58:40Z

In my kernel, it's use the default config as your kernel. # CONFIG_PM is not set 在 2022/6/1 20:09, Emil Renner Berthing 写道:

…

Unfortunately there a problems with USB and CONFIG_PM=y, so make sure that is disabled.

[ Upstream commit 0ee7828 ] Since priv->rx_mapping[i] is maped in moxart_mac_open(), we should unmap it from moxart_mac_stop(). Fixes 2 warnings. 1. During error unwinding in moxart_mac_probe(): "goto init_fail;", then moxart_mac_free_memory() calls dma_unmap_single() with priv->rx_mapping[i] pointers zeroed. WARNING: CPU: 0 PID: 1 at kernel/dma/debug.c:963 check_unmap+0x704/0x980 DMA-API: moxart-ethernet 92000000.mac: device driver tries to free DMA memory it has not allocated [device address=0x0000000000000000] [size=1600 bytes] CPU: 0 PID: 1 Comm: swapper Not tainted 5.19.0+ #60 Hardware name: Generic DT based system unwind_backtrace from show_stack+0x10/0x14 show_stack from dump_stack_lvl+0x34/0x44 dump_stack_lvl from __warn+0xbc/0x1f0 __warn from warn_slowpath_fmt+0x94/0xc8 warn_slowpath_fmt from check_unmap+0x704/0x980 check_unmap from debug_dma_unmap_page+0x8c/0x9c debug_dma_unmap_page from moxart_mac_free_memory+0x3c/0xa8 moxart_mac_free_memory from moxart_mac_probe+0x190/0x218 moxart_mac_probe from platform_probe+0x48/0x88 platform_probe from really_probe+0xc0/0x2e4 2. After commands: ip link set dev eth0 down ip link set dev eth0 up WARNING: CPU: 0 PID: 55 at kernel/dma/debug.c:570 add_dma_entry+0x204/0x2ec DMA-API: moxart-ethernet 92000000.mac: cacheline tracking EEXIST, overlapping mappings aren't supported CPU: 0 PID: 55 Comm: ip Not tainted 5.19.0+ #57 Hardware name: Generic DT based system unwind_backtrace from show_stack+0x10/0x14 show_stack from dump_stack_lvl+0x34/0x44 dump_stack_lvl from __warn+0xbc/0x1f0 __warn from warn_slowpath_fmt+0x94/0xc8 warn_slowpath_fmt from add_dma_entry+0x204/0x2ec add_dma_entry from dma_map_page_attrs+0x110/0x328 dma_map_page_attrs from moxart_mac_open+0x134/0x320 moxart_mac_open from __dev_open+0x11c/0x1ec __dev_open from __dev_change_flags+0x194/0x22c __dev_change_flags from dev_change_flags+0x14/0x44 dev_change_flags from devinet_ioctl+0x6d4/0x93c devinet_ioctl from inet_ioctl+0x1ac/0x25c v1 -> v2: Extraneous change removed. Fixes: 6c821bd ("net: Add MOXA ART SoCs ethernet driver") Signed-off-by: Sergei Antonov <saproj@gmail.com> Reviewed-by: Andrew Lunn <andrew@lunn.ch> Link: https://lore.kernel.org/r/20220819110519.1230877-1-saproj@gmail.com Signed-off-by: Jakub Kicinski <kuba@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org>

…) to avoid crash [ Upstream commit 68b99e9 ] When CPU 0 is offline and intel_powerclamp is used to inject idle, it generates kernel BUG: BUG: using smp_processor_id() in preemptible [00000000] code: bash/15687 caller is debug_smp_processor_id+0x17/0x20 CPU: 4 PID: 15687 Comm: bash Not tainted 5.19.0-rc7+ #57 Call Trace: <TASK> dump_stack_lvl+0x49/0x63 dump_stack+0x10/0x16 check_preemption_disabled+0xdd/0xe0 debug_smp_processor_id+0x17/0x20 powerclamp_set_cur_state+0x7f/0xf9 [intel_powerclamp] ... ... Here CPU 0 is the control CPU by default and changed to the current CPU, if CPU 0 offlined. This check has to be performed under cpus_read_lock(), hence the above warning. Use get_cpu() instead of smp_processor_id() to avoid this BUG. Suggested-by: Chen Yu <yu.c.chen@intel.com> Signed-off-by: Srinivas Pandruvada <srinivas.pandruvada@linux.intel.com> [ rjw: Subject edits ] Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Sasha Levin <sashal@kernel.org>

We need to probe for IOCP only once during boot stage, as we were probing for IOCP for all the stages this caused the below issue during module-init stage, [9.019104] Unable to handle kernel paging request at virtual address ffffffff8100d3a0 [9.027153] Oops [#1] [9.029421] Modules linked in: rcar_canfd renesas_usbhs i2c_riic can_dev spi_rspi i2c_core [9.037686] CPU: 0 PID: 90 Comm: udevd Not tainted 6.7.0-rc1+ #57 [9.043756] Hardware name: Renesas SMARC EVK based on r9a07g043f01 (DT) [9.050339] epc : riscv_noncoherent_supported+0x10/0x3e [9.055558] ra : andes_errata_patch_func+0x4a/0x52 [9.060418] epc : ffffffff8000d8c2 ra : ffffffff8000d95c sp : ffffffc8003abb00 [9.067607] gp : ffffffff814e25a0 tp : ffffffd80361e540 t0 : 0000000000000000 [9.074795] t1 : 000000000900031e t2 : 0000000000000001 s0 : ffffffc8003abb20 [9.081984] s1 : ffffffff015b57c7 a0 : 0000000000000000 a1 : 0000000000000001 [9.089172] a2 : 0000000000000000 a3 : 0000000000000000 a4 : ffffffff8100d8be [9.096360] a5 : 0000000000000001 a6 : 0000000000000001 a7 : 000000000900031e [9.103548] s2 : ffffffff015b57d7 s3 : 0000000000000001 s4 : 000000000000031e [9.110736] s5 : 8000000000008a45 s6 : 0000000000000500 s7 : 000000000000003f [9.117924] s8 : ffffffc8003abd48 s9 : ffffffff015b1140 s10: ffffffff8151a1b0 [9.125113] s11: ffffffff015b1000 t3 : 0000000000000001 t4 : fefefefefefefeff [9.132301] t5 : ffffffff015b57c7 t6 : ffffffd8b63a6000 [9.137587] status: 0000000200000120 badaddr: ffffffff8100d3a0 cause: 000000000000000f [9.145468] [<ffffffff8000d8c2>] riscv_noncoherent_supported+0x10/0x3e [9.151972] [<ffffffff800027e8>] _apply_alternatives+0x84/0x86 [9.157784] [<ffffffff800029be>] apply_module_alternatives+0x10/0x1a [9.164113] [<ffffffff80008fcc>] module_finalize+0x5e/0x7a [9.169583] [<ffffffff80085cd6>] load_module+0xfd8/0x179c [9.174965] [<ffffffff80086630>] init_module_from_file+0x76/0xaa [9.180948] [<ffffffff800867f6>] __riscv_sys_finit_module+0x176/0x2a8 [9.187365] [<ffffffff80889862>] do_trap_ecall_u+0xbe/0x130 [9.192922] [<ffffffff808920bc>] ret_from_exception+0x0/0x64 [9.198573] Code: 0009 b7e9 6797 014d a783 85a7 c799 4785 0717 0100 (0123) aef7 [9.205994] ---[ end trace 0000000000000000 ]--- This is because we called riscv_noncoherent_supported() for all the stages during IOCP probe. riscv_noncoherent_supported() function sets noncoherent_supported variable to true which has an annotation set to "__ro_after_init" due to which we were seeing the above splat. Fix this by probing for IOCP only once in boot stage by having a boolean variable "done" which will be set to true upon IOCP probe in errata_probe_iocp() and we bail out early if "done" is set to true. While at it make return type of errata_probe_iocp() to void as we were not checking the return value in andes_errata_patch_func(). Fixes: e021ae7 ("riscv: errata: Add Andes alternative ports") Signed-off-by: Lad Prabhakar <prabhakar.mahadev-lad.rj@bp.renesas.com> Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be> Reviewed-by: Yu Chien Peter Lin <peterlin@andestech.com> Link: https://lore.kernel.org/r/20231130212647.108746-1-prabhakar.mahadev-lad.rj@bp.renesas.com Signed-off-by: Palmer Dabbelt <palmer@rivosinc.com>

[ Upstream commit f221033 ] During the removal of the idxd driver, registered offline callback is invoked as part of the clean up process. However, on systems with only one CPU online, no valid target is available to migrate the perf context, resulting in a kernel oops: BUG: unable to handle page fault for address: 000000000002a2b8 #PF: supervisor write access in kernel mode #PF: error_code(0x0002) - not-present page PGD 1470e1067 P4D 0 Oops: 0002 [#1] PREEMPT SMP NOPTI CPU: 0 PID: 20 Comm: cpuhp/0 Not tainted 6.8.0-rc6-dsa+ starfive-tech#57 Hardware name: Intel Corporation AvenueCity/AvenueCity, BIOS BHSDCRB1.86B.2492.D03.2307181620 07/18/2023 RIP: 0010:mutex_lock+0x2e/0x50 ... Call Trace: <TASK> __die+0x24/0x70 page_fault_oops+0x82/0x160 do_user_addr_fault+0x65/0x6b0 __pfx___rdmsr_safe_on_cpu+0x10/0x10 exc_page_fault+0x7d/0x170 asm_exc_page_fault+0x26/0x30 mutex_lock+0x2e/0x50 mutex_lock+0x1e/0x50 perf_pmu_migrate_context+0x87/0x1f0 perf_event_cpu_offline+0x76/0x90 [idxd] cpuhp_invoke_callback+0xa2/0x4f0 __pfx_perf_event_cpu_offline+0x10/0x10 [idxd] cpuhp_thread_fun+0x98/0x150 smpboot_thread_fn+0x27/0x260 smpboot_thread_fn+0x1af/0x260 __pfx_smpboot_thread_fn+0x10/0x10 kthread+0x103/0x140 __pfx_kthread+0x10/0x10 ret_from_fork+0x31/0x50 __pfx_kthread+0x10/0x10 ret_from_fork_asm+0x1b/0x30 <TASK> Fix the issue by preventing the migration of the perf context to an invalid target. Fixes: 81dd4d4 ("dmaengine: idxd: Add IDXD performance monitor support") Reported-by: Terrence Xu <terrence.xu@intel.com> Tested-by: Terrence Xu <terrence.xu@intel.com> Signed-off-by: Fenghua Yu <fenghua.yu@intel.com> Link: https://lore.kernel.org/r/20240313214031.1658045-1-fenghua.yu@intel.com Signed-off-by: Vinod Koul <vkoul@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kernel panic when use USB wireless netcard zd1211rw 1-1.1:1.0: phy1 #57

Kernel panic when use USB wireless netcard zd1211rw 1-1.1:1.0: phy1 #57

atzlinux commented Jun 1, 2022

esmil commented Jun 1, 2022

atzlinux commented Jun 1, 2022 via email

Kernel panic when use USB wireless netcard zd1211rw 1-1.1:1.0: phy1 #57

Kernel panic when use USB wireless netcard zd1211rw 1-1.1:1.0: phy1 #57

Comments

atzlinux commented Jun 1, 2022

esmil commented Jun 1, 2022

atzlinux commented Jun 1, 2022 via email