Description of problem: Quite frequently specially when the wifi network interface is loaded (a few web pages loading is often enough) The crash symptoms are - no more network - errors in system logs - see here after It happens for some reason * that the wifi refuses to start after such a crash and a reboot. Several linux reboots does not help but rebooting under windows and the under mageia solves the issue... * might it be due to a partial hw initialisation ? I don't know how to investigate further It looks like windows is working fine with it. So i don't think there's an hardware issue I have this issue for several monthes, maybe 6. I did not find anything on the web which looks relevant. I have been running uptodate cauldron during this period, hopping that an upgrade would eventually fix it. It did not :( Does someone also meet this issue and/or has an idea how to investigate it ? Version-Release number of selected component (if applicable): # uname -a Linux uranus 6.3.2-desktop-3.mga9 #1 SMP PREEMPT_DYNAMIC Sun May 14 13:59:04 UTC 2023 x86_64 GNU/Linux # rpm -q -a |grep kernel kernel-firmware-20230210-1.mga9 kernel-firmware-nonfree-20230427-1.mga9.nonfree kernel-desktop-6.3.2-2.mga9 kernel-desktop-devel-6.3.2-2.mga9 kernel-desktop-6.3.2-3.mga9 kernel-desktop-latest-6.3.2-3.mga9 kernel-userspace-headers-6.3.2-3.mga9 kernel-desktop-devel-6.3.2-3.mga9 kernel-desktop-devel-latest-6.3.2-3.mga9 # rpm -q -a |grep wifi|sort iwlwifi-3945-ucode-15.32.2.9-11.mga9.nonfree iwlwifi-4965-ucode-228.61.2.24-13.mga9.nonfree iwlwifi-firmware-20230427-1.mga9.nonfree networkmanager-wifi-1.40.18-2.mga9 rtlwifi-firmware-20230427-1.mga9.nonfree wmwifi-0.6-12.mga9 # lshw description: Notebook product: ASUS TUF Gaming F17 FX706HCB_TUF766HCB vendor: ASUSTeK COMPUTER INC. version: 1.0 serial: N1NRCX00J643012 width: 4294967295 bits capabilities: smbios-3.3 dmi-3.3 smp vsyscall32 configuration: boot=normal chassis=notebook family=ASUS TUF Gaming F17 uuid=1F84672F-1E63-4B48-85FC-339548AD0E3A *-core description: Motherboard product: FX706HCB vendor: ASUSTeK COMPUTER INC. physical id: 0 version: 1.0 serial: N101NRCX000WZ3MB slot: Default string *-firmware description: BIOS vendor: American Megatrends International, LLC. physical id: 0 version: FX706HCB.313 date: 03/03/2023 size: 64KiB capacity: 15MiB capabilities: pci upgrade shadowing cdboot bootselect socketedrom edd int13floppynec int13floppytoshiba int13floppy360 int13floppy1200 int13floppy720 int13floppy2880 int5printscreen int9keyboard int14serial int17printer int10video acpi usb biosbootspecification uefi [...] *-pci:1 description: PCI bridge product: Intel Corporation vendor: Intel Corporation physical id: 1c bus info: pci@0000:00:1c.0 version: 11 width: 32 bits clock: 33MHz capabilities: pci pciexpress msi pm normal_decode bus_master cap_list configuration: driver=pcieport resources: irq:123 ioport:6102100000(size=2097152) *-network description: Wireless interface product: MT7921 802.11ax PCI Express Wireless Network Adapter vendor: MEDIATEK Corp. physical id: 0 bus info: pci@0000:02:00.0 logical name: wlp2s0 version: 00 serial: 14:13:33:09:b2:1b width: 64 bits clock: 33MHz capabilities: pciexpress msi pm bus_master cap_list ethernet physical wireless configuration: broadcast=yes driver=mt7921e driverversion=6.3.2-desktop-3.mga9 firmware=____010000-20230331110939 ip=192.168.1.19 latency=0 link=yes multicast=yes wireless=IEEE 802.11 resources: iomemory:610-60f iomemory:610-60f iomemory:610-60f irq:162 memory:6102100000-61021fffff memory:6102200000-6102203fff memory:6102204000-6102204fff # lspci 0000:02:00.0 Network controller: MEDIATEK Corp. MT7921 802.11ax PCI Express Wireless Network Adapter Subsystem: AzureWave Device 4680 Flags: bus master, fast devsel, latency 0, IRQ 162 Memory at 6102100000 (64-bit, prefetchable) [size=1M] Memory at 6102200000 (64-bit, prefetchable) [size=16K] Memory at 6102204000 (64-bit, prefetchable) [size=4K] Capabilities: [80] Express Endpoint, MSI 00 Capabilities: [e0] MSI: Enable+ Count=1/32 Maskable+ 64bit+ Capabilities: [f8] Power Management version 3 Capabilities: [100] Vendor Specific Information: ID=1556 Rev=1 Len=008 <?> Capabilities: [108] Latency Tolerance Reporting Capabilities: [110] L1 PM Substates Capabilities: [200] Advanced Error Reporting Kernel driver in use: mt7921e Kernel modules: mt7921e May 15 19:13:00 uranus dbus-daemon[958]: [system] Successfully activated service 'org.freedesktop.hostname1' May 15 19:14:06 uranus kernel: [29727.674116] wlp2s0: deauthenticating from e8:d2:ff:d6:ac:b5 by local choice (Reason: 3=DEAUTH_LEAVING) May 15 19:14:09 uranus kernel: [29730.717258] mt7921e 0000:02:00.0: Message 00020002 (seq 15) timeout May 15 19:14:10 uranus kernel: [29731.737204] mt7921e 0000:02:00.0: driver own failed May 15 19:14:12 uranus kernel: [29733.012118] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:13 uranus kernel: [29734.032105] mt7921e 0000:02:00.0: driver own failed May 15 19:14:14 uranus kernel: [29735.307091] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:15 uranus kernel: [29736.327078] mt7921e 0000:02:00.0: driver own failed May 15 19:14:16 uranus kernel: [29737.602063] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:19 uranus kernel: [29740.123626] BUG: kernel NULL pointer dereference, address: 0000000000000008 May 15 19:14:19 uranus kernel: [29740.123632] #PF: supervisor read access in kernel mode May 15 19:14:19 uranus kernel: [29740.123634] #PF: error_code(0x0000) - not-present page May 15 19:14:19 uranus kernel: [29740.123636] PGD 166176067 P4D 166176067 PUD 0 May 15 19:14:19 uranus kernel: [29740.123647] Oops: 0000 [#1] PREEMPT SMP NOPTI May 15 19:14:19 uranus kernel: [29740.123649] CPU: 9 PID: 11205 Comm: HTML5 Parser Not tainted 6.3.2-desktop-3.mga9 #1 May 15 19:14:19 uranus kernel: [29740.123651] Hardware name: ASUSTeK COMPUTER INC. ASUS TUF Gaming F17 FX706HCB_TUF766HCB/FX706HCB, BIOS FX706HCB.313 03/03/2023 May 15 19:14:19 uranus kernel: [29740.123653] RIP: 0010:__mod_lruvec_page_state+0xf/0x140 May 15 19:14:19 uranus kernel: [29740.123659] Code: 41 5c c3 cc cc cc cc 0f 1f 40 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 0f 1f 44 00 00 41 56 41 55 41 54 55 89 f5 53 <4c> 8b 67 08 89 d3 41 f6 c4 01 0f 85 cd 00 00 00 0f 1f 44 00 00 49 May 15 19:14:19 uranus kernel: [29740.123661] RSP: 0018:ffffb286cca37d30 EFLAGS: 00010002 May 15 19:14:19 uranus kernel: [29740.123663] RAX: 0000000000000202 RBX: 0000000000000202 RCX: 0000000000000000 May 15 19:14:19 uranus kernel: [29740.123665] RDX: 00000000fffffffc RSI: 0000000000000025 RDI: 0000000000000000 May 15 19:14:19 uranus kernel: [29740.123666] RBP: 0000000000000025 R08: ffff992c7896f280 R09: ffffffffbe84a8e0 May 15 19:14:19 uranus kernel: [29740.123667] R10: ffffb286cca37cc8 R11: ffffb286cca37cd0 R12: ffff992c7d6cb480 May 15 19:14:19 uranus kernel: [29740.123668] R13: ffff992ca5f3a100 R14: ffffffffbe84a8e0 R15: ffff992c44266470 May 15 19:14:19 uranus kernel: [29740.123669] FS: 0000000000000000(0000) GS:ffff992fbb640000(0000) knlGS:0000000000000000 May 15 19:14:19 uranus kernel: [29740.123671] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 15 19:14:19 uranus kernel: [29740.123672] CR2: 0000000000000008 CR3: 000000017fa94003 CR4: 0000000000770ee0 May 15 19:14:19 uranus kernel: [29740.123673] PKRU: 55555554 May 15 19:14:19 uranus kernel: [29740.123674] Call Trace: May 15 19:14:19 uranus kernel: [29740.123676] <TASK> May 15 19:14:19 uranus kernel: [29740.123679] mod_lruvec_page_state.constprop.0+0x1c/0x30 May 15 19:14:19 uranus kernel: [29740.123683] exit_task_stack_account+0x2c/0x60 May 15 19:14:19 uranus kernel: [29740.123685] do_exit+0x792/0xa70 May 15 19:14:19 uranus kernel: [29740.123689] do_group_exit+0x2d/0x80 May 15 19:14:19 uranus kernel: [29740.123691] get_signal+0x996/0x9a0 May 15 19:14:19 uranus kernel: [29740.123695] arch_do_signal_or_restart+0x2a/0x250 May 15 19:14:19 uranus kernel: [29740.123700] exit_to_user_mode_prepare+0x19a/0x1f0 May 15 19:14:19 uranus kernel: [29740.123704] syscall_exit_to_user_mode+0x17/0x40 May 15 19:14:19 uranus kernel: [29740.123708] do_syscall_64+0x46/0x90 May 15 19:14:19 uranus kernel: [29740.123710] entry_SYSCALL_64_after_hwframe+0x72/0xdc May 15 19:14:19 uranus kernel: [29740.123716] RIP: 0033:0x7fe45788e736 May 15 19:14:19 uranus kernel: [29740.123718] Code: Unable to access opcode bytes at 0x7fe45788e70c. May 15 19:14:19 uranus kernel: [29740.123719] RSP: 002b:00007fe443891910 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca May 15 19:14:19 uranus kernel: [29740.123721] RAX: fffffffffffffe00 RBX: 0000000000000000 RCX: 00007fe45788e736 May 15 19:14:19 uranus kernel: [29740.123722] RDX: 0000000000000000 RSI: 0000000000000189 RDI: 00007fe44321f37c May 15 19:14:19 uranus kernel: [29740.123723] RBP: 0000000000000000 R08: 0000000000000000 R09: 00000000ffffffff May 15 19:14:19 uranus kernel: [29740.123724] R10: 0000000000000000 R11: 0000000000000246 R12: 00007fe44321f320 May 15 19:14:19 uranus kernel: [29740.123725] R13: 0000000000000000 R14: 0000000000000001 R15: 00007fe44321f37c May 15 19:14:19 uranus kernel: [29740.123728] </TASK> May 15 19:14:19 uranus kernel: [29740.123729] Modules linked in: rfcomm ip6t_REJECT nf_reject_ipv6 xt_comment ip6table_mangle ip6table_nat ip6table_raw ip6table_filter ip6_tables xt_recent ipt_REJECT n f_reject_ipv4 xt_conntrack xt_hashlimit xt_addrtype xt_mark iptable_mangle iptable_nat xt_CT xt_tcpudp iptable_raw xt_NFLOG nfnetlink_log xt_LOG nf_log_syslog nf_nat_tftp nf_nat_snmp_basic nf_conntrack _snmp nf_nat_sip nf_nat_pptp nf_nat_irc nf_nat_h323 nf_nat_ftp nf_nat_amanda ts_kmp nf_conntrack_amanda nf_nat nf_conntrack_sane nf_conntrack_tftp nf_conntrack_sip nf_conntrack_pptp nf_conntrack_netlin k tls nfnetlink nf_conntrack_netbios_ns nf_conntrack_broadcast nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter ccm af_packet qrtr cmac algi f_hash algif_skcipher af_alg bnep cuse nls_utf8 nls_cp437 vfat fat dm_mirror dm_region_hash dm_log snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_ cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp May 15 19:14:19 uranus kernel: [29740.123773] joydev snd_sof uvcvideo snd_sof_utils snd_soc_hdac_hda snd_hda_codec_realtek uvc snd_hda_ext_core videobuf2_vmalloc videobuf2_memops snd_soc_acpi_intel_ma tch x86_pkg_temp_thermal videobuf2_v4l2 snd_soc_acpi intel_powerclamp mt7921e snd_hda_codec_generic coretemp snd_soc_core videodev mt7921_common mt76_connac_lib snd_hda_codec_hdmi kvm_intel mt76 videob uf2_common snd_compress ac97_bus snd_pcm_dmaengine snd_hda_intel mc snd_intel_dspcfg mac80211 snd_intel_sdw_acpi kvm hid_multitouch snd_hda_codec snd_hda_core btusb processor_thermal_device_pci_legacy processor_thermal_device snd_hwdep r8169 snd_pcm irqbypass processor_thermal_rfim spi_nor btmtk mei_hdcp mei_pxp btrtl iTCO_wdt asus_nb_wmi processor_thermal_mbox realtek btbcm intel_pmc_bxt rapl mtd c fg80211 ee1004 asus_wmi iTCO_vendor_support snd_timer pmt_telemetry btintel processor_thermal_rapl intel_lpss_pci intel_cstate mdio_devres intel_rapl_msr pmt_class mei_me snd intel_lpss ledtrig_audio i 2c_i801 intel_rapl_common bluetooth spi_intel_pci intel_uncore May 15 19:14:19 uranus kernel: [29740.123822] platform_profile mei libphy wmi_bmof libarc4 soundcore idma64 spi_intel i2c_smbus intel_soc_dts_iosf intel_vsec tpm_crb ecdh_generic int3403_thermal int34 0x_thermal_zone tpm_tis tpm_tis_core rfkill tpm ecc intel_hid int3400_thermal acpi_pad asus_wireless sparse_keymap acpi_thermal_rel evdev input_leds sch_fq_codel fuse dm_mod loop configfs efivarfs dmi_ sysfs ip_tables x_tables ipv6 crc_ccitt autofs4 xhci_pci xhci_pci_renesas crc32_pclmul crc32c_intel polyval_clmulni polyval_generic gf128mul ghash_clmulni_intel sha512_ssse3 aesni_intel xhci_hcd crypto _simd atkbd cryptd vivaldi_fmap serio_raw vmd i915 nouveau drm_ttm_helper mxm_wmi drm_buddy i2c_algo_bit drm_display_helper drm_kms_helper cec ucsi_acpi typec_ucsi roles ttm typec video i2c_hid_acpi i2 c_hid wmi drm [...] May 15 19:14:19 uranus kernel: [29740.638031] mt7921e 0000:02:00.0: Message 00020003 (seq 1) timeout May 15 19:14:20 uranus kernel: [29741.658014] mt7921e 0000:02:00.0: driver own failed May 15 19:14:22 uranus kernel: [29742.932000] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:23 uranus kernel: [29743.951987] mt7921e 0000:02:00.0: driver own failed May 15 19:14:24 uranus kernel: [29745.225973] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:27 uranus kernel: [29748.252941] mt7921e 0000:02:00.0: Message 00020003 (seq 2) timeout May 15 19:14:28 uranus kernel: [29749.272928] mt7921e 0000:02:00.0: driver own failed May 15 19:14:29 uranus kernel: [29750.547910] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:30 uranus kernel: [29751.567898] mt7921e 0000:02:00.0: driver own failed May 15 19:14:32 uranus kernel: [29752.841885] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:35 uranus kernel: [29755.868852] mt7921e 0000:02:00.0: Message 00020003 (seq 3) timeout May 15 19:14:38 uranus kernel: [29758.876818] mt7921e 0000:02:00.0: Message 00020003 (seq 4) timeout May 15 19:14:39 uranus kernel: [29759.896804] mt7921e 0000:02:00.0: driver own failed May 15 19:14:40 uranus kernel: [29761.171789] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:41 uranus kernel: [29762.191776] mt7921e 0000:02:00.0: driver own failed May 15 19:14:42 uranus kernel: [29763.466764] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:45 uranus kernel: [29766.492732] mt7921e 0000:02:00.0: Message 00020003 (seq 5) timeout May 15 19:14:46 uranus kernel: [29767.512716] mt7921e 0000:02:00.0: driver own failed May 15 19:14:47 uranus kernel: [29768.787703] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:47 uranus kernel: [29768.787714] mt7921e 0000:02:00.0: chip reset failed May 15 19:14:47 uranus kernel: [29768.787721] wlp2s0: Driver requested disconnection from AP 00:00:00:00:00:00 May 15 19:14:51 uranus kernel: [29771.804669] mt7921e 0000:02:00.0: Message 00020001 (seq 6) timeout May 15 19:14:52 uranus kernel: [29772.824657] mt7921e 0000:02:00.0: driver own failed May 15 19:14:53 uranus kernel: [29774.099641] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:54 uranus kernel: [29775.119629] mt7921e 0000:02:00.0: driver own failed May 15 19:14:55 uranus kernel: [29776.393615] mt7921e 0000:02:00.0: Timeout for driver own May 15 19:14:56 uranus kernel: [29777.413605] mt7921e 0000:02:00.0: driver own failed Those last messages repeat forever.
CC: (none) => boulshet
Actually a very similar issue was found upstream mid 2022 : https://github.com/openwrt/mt76/issues/681 If i read it correctly, a fix has been merged and is included in mageia kernel: $ md5sum /usr/src/kernel-*/drivers/net/wireless/mediatek/mt76/mt7921/pci_mac.c 26094e03661ca3d400edda9b99f1ff13 /usr/src/kernel-6.3.2-3.mga9/drivers/net/wireless/mediatek/mt76/mt7921/pci_mac.c 26094e03661ca3d400edda9b99f1ff13 /usr/src/kernel-linus-6.3.2-1.mga9/drivers/net/wireless/mediatek/mt76/mt7921/pci_mac.c It looks like i should report it upstream. Do you think of additional checks to be made before ? Regards.
That reference "mt7921e driver crash in kernel 5.18" looks exactly right, but refers to kernels 5.xx. Would the fact that our kernels are 6.x matter? CC'ing tmb. The topic ends ends: ---------------------------------- objelf commented on Sep 15, 2022 @Lasoloz @LoreleiPenn I got help from the wireless maintainer and pushed the fixup patches to the Linus tree in [1]. I will backport the patch to k5.18 and k5.19 sooner to stop the kernel hang then the distribution will get the benefit from the newer stable kernel. [1]https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/drivers/net/wireless/mediatek/mt76?id=fa3fbe64037839f448dc569212bafc5a495d8219 dhewg commented on Nov 13, 2022 It sounds like this is resolved and can be closed?  kaienkira closed this as completed on Feb 1 --------------------------------------------- So is this a new problem? Can you say the origin of your Mageia 9/Cauldron system? Was the problem evident from the start, or has it appeared following a recent update? If the latter, can you try an earlier kernel (normally 3 in all are kept)?
CC: (none) => lewyssmith, tmb
My system is a mageia cauldron from years, updated daily. Wifi has been workin smoothly in the past. It is possible that the issue started around mid 2022 when the linked discussion was held. At first, i was hopping that it would be just a transient glitch and started using the ethernet interface as a workaround since then. So reverting to a not so older kernel would not help. Thanks
(In reply to GG HH from comment #3) > Wifi has been working smoothly in the past. > It is possible that the issue started around mid 2022 when the linked > discussion was held. Makes the link more relevant. > At first, i was hoping that it would be just a transient glitch and started > using the ethernet interface as a workaround since then. Know what you mean... > So reverting to a not so older kernel would not help. Point taken. Assigning to kernel/drivers.
Assignee: bugsquad => kernelCC: lewyssmith, tmb => (none)
here it is : https://github.com/openwrt/mt76/issues/681 the fix is the last post of the discussion is the one you were refering to in comment 2 thanks
I gave a a try to wifi again from 1 week. Till now it goes smooth even under heavy wifi load while crashes happened several times a day in the past. So, closing (as resolved/fixed event if not explicit fix seems related) I will reopen if needed. $ uname -a Linux uranus 6.3.9-desktop-2.mga9 #1 SMP PREEMPT_DYNAMIC Fri Jun 23 07:46:59 UTC 2023 x86_64 GNU/Linux [09/07/2023 11:40:19 - jerome@uranus ~] - 0 $ rpm -q -a |grep kernel kernel-desktop-6.3.9-2.mga9 kernel-desktop-devel-6.3.9-2.mga9 kernel-desktop-devel-latest-6.3.9-2.mga9 kernel-desktop-latest-6.3.9-2.mga9 kernel-userspace-headers-6.3.9-2.mga9 kernel-firmware-nonfree-20230625-1.mga9.nonfree kernel-firmware-20230625-1.mga9
Resolution: (none) => FIXEDStatus: NEW => RESOLVED
Unfortunately it just crashed again
Resolution: FIXED => (none)Status: RESOLVED => REOPENED
does kernel 6.4.3-1 work any better ?
I saw > - wifi: mt76: mt7921e: fix init command fail with enabled device in the changelog. So, I hope. For now, I have just installed it. It will need some time to proove good. I will update the ticket. thanks
unfortunately it crashed again in the same way. i just noticed that bluetooth is now enabled - it was not in the past.
Have you tried the linus kernels? Also, there are now desktop and server kernels 6.6.33 in testing repos - not ready for QA yet but i use that -desktop.
CC: (none) => fri
No i did not. I will and get back here to report. Thanks
i tried the current linus kernel and it failed shortly and the same way. > Also, there are now desktop and server kernels 6.6.33 in testing repos - not ready for QA yet but i use that -desktop. Where could i find them ? they are not in those directories at least on the mmirror i use backports/ backports_testing/ release/ updates/ updates_testing/ thanks