Bug 32837 - Hard freeze with kernel-server-6.6.16-1.mga9
Summary: Hard freeze with kernel-server-6.6.16-1.mga9
Status: RESOLVED FIXED
Alias: None
Product: Mageia
Classification: Unclassified
Component: RPM Packages (show other bugs)
Version: 9
Hardware: All Linux
Priority: Normal normal
Target Milestone: ---
Assignee: Giuseppe Ghibò
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-02-12 17:11 CET by Dave Hodgins
Modified: 2024-02-14 07:42 CET (History)
2 users (show)

See Also:
Source RPM: kernel-6.6.16-1.mga9.src.rpm
CVE:
Status comment:


Attachments

Description Dave Hodgins 2024-02-12 17:11:21 CET
After installing kernel-server-6.6.16-1.mga9 and booting into it, everything
proceeded normally, including logging in and starting a few applictions until
roughly 2 minutes after the boot started at which point the system froze.

No response to keybord. Unable to connect via ssh from another system on my lan.

Only entry that appears relevant from the journal is
Feb 12 09:06:56 kernel: EEVDF scheduling fail, picking leftmost

That error message is repeated 16 times.

System is working normally after rebooting to 6.6.14-server-2.mga9
Dave Hodgins 2024-02-12 17:11:37 CET

CC: (none) => kernel

Comment 1 Dave Hodgins 2024-02-12 17:29:02 CET
]# inxi -v 3
System:
  Host: x3.hodgins.homeip.net Kernel: 6.6.14-server-2.mga9 arch: x86_64
    bits: 64 compiler: gcc v: 12.3.0 Desktop: KDE v: 4 Distro: Mageia 9
Machine:
  Type: Desktop Mobo: ASUSTeK model: SABERTOOTH 990FX v: Rev 1.xx
    serial: 110394070001417 BIOS: American Megatrends v: 1604 date: 10/16/2012
Battery:
  Device-1: hidpp_battery_0 model: Logitech K520 charge: 0%
    status: discharging
  Device-2: hidpp_battery_1 model: Logitech Wireless Mouse charge: 0%
    status: discharging
CPU:
  Info: quad core model: AMD FX-4170 bits: 64 type: MT MCP arch: Bulldozer
    rev: 2 cache: L1: 192 KiB L2: 4 MiB L3: 8 MiB
  Speed (MHz): avg: 2100 high: 4200 min/max: 1400/4200 boost: enabled cores:
    1: 4200 2: 1400 3: 1400 4: 1400 bogomips: 33710
  Flags: avx ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
  Device-1: AMD Cedar [Radeon HD 5000/6000/7350/8350 Series] vendor: ASUSTeK
    driver: radeon v: kernel arch: TeraScale-2 bus-ID: 05:00.0 temp: 64.0 C
  Display: server: X.org v: 1.21.1.8 with: Xwayland v: 22.1.9 driver: X:
    loaded: radeon unloaded: fbdev,modesetting,vesa dri: r600 gpu: radeon
    resolution: 1920x1080~60Hz
  API: OpenGL v: 4.5 Mesa 23.3.5 renderer: AMD CEDAR (DRM 2.50.0 /
    6.6.14-server-2.mga9 LLVM 15.0.6) direct-render: Yes
Network:
  Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
    vendor: TP-LINK TG-3468 driver: r8169 v: kernel port: a000 bus-ID: 07:00.0
  IF: eth0 state: up speed: 1000 Mbps duplex: full mac: 10:fe:ed:03:8d:b3
Drives:
  Local Storage: total: 3.11 TiB used: 396.35 GiB (12.5%)
Info:
  Processes: 261 Uptime: 2h 15m Memory: 15.58 GiB used: 5.01 GiB (32.2%)
  Init: systemd target: multi-user (3) Compilers: gcc: 12.3.0 clang: 15.0.6
  Packages: N/A note: see --rpm Shell: Bash v: 5.2.15 inxi: 3.3.26
Comment 2 katnatek 2024-02-12 20:29:48 CET
In working kernel as root rename /etc/X11/xorg.conf to /etc/X11/xorg.conf.works and reboot in problematic kernel or in grub menu add noxconf  to kernel options

If works compare inxi outpu perhaps is forcing other graphic driver
Comment 3 Morgan Leijström 2024-02-12 22:12:02 CET
FWIW kernel desktop 6.6.16 OK for me an hour use in Plasma and suspend-resume
Intel i7-870, P55 chipset, nvidia545

CC: (none) => fri

Comment 4 Giuseppe Ghibò 2024-02-12 22:20:42 CET
(In reply to Dave Hodgins from comment #0)

> After installing kernel-server-6.6.16-1.mga9 and booting into it, everything
> proceeded normally, including logging in and starting a few applictions until
> roughly 2 minutes after the boot started at which point the system froze.
> 
> No response to keybord. Unable to connect via ssh from another system on my
> lan.
> 
> Only entry that appears relevant from the journal is
> Feb 12 09:06:56 kernel: EEVDF scheduling fail, picking leftmost
> 

Ok, I got something similar, -server only. I pushed a 6.6.16-2.mga9 build, a little bit better, but not that much. Let's see.
Comment 5 Dave Hodgins 2024-02-12 22:49:11 CET
(In reply to katnatek from comment #2)
> In working kernel as root rename /etc/X11/xorg.conf to
> /etc/X11/xorg.conf.works and reboot in problematic kernel or in grub menu
> add noxconf  to kernel options
> 
> If works compare inxi outpu perhaps is forcing other graphic driver

It too locks up. I don't have an xorg.conf file, and haven't had one for years.
I normally boot to run level 3 and then run startx.

For this test, I booted to run level 3, logged in as root and let the system
run htop for about 10 minutes. After that, I logged out, logged in as my
regular user and ran startx. The system froze while playing the kde startup
sound, with one tone repeating over and over till I pressed the reset
button after confirming the magic sysrq keys had no effect.

I did capture the inxi output before running htop.
System:
  Host: x3.hodgins.homeip.net Kernel: 6.6.16-server-1.mga9 arch: x86_64 bits: 64 compiler: gcc
    v: 12.3.0 Console: tty 1 Distro: Mageia 9
Machine:
  Type: Desktop Mobo: ASUSTeK model: SABERTOOTH 990FX v: Rev 1.xx serial: 110394070001417
    BIOS: American Megatrends v: 1604 date: 10/16/2012
CPU:
  Info: quad core model: AMD FX-4170 bits: 64 type: MT MCP arch: Bulldozer rev: 2 cache:
    L1: 192 KiB L2: 4 MiB L3: 8 MiB
  Speed (MHz): avg: 1400 min/max: 1400/4200 boost: enabled cores: 1: 1400 2: 1400 3: 1400
    4: 1400 bogomips: 33710
  Flags: avx ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
  Device-1: AMD Cedar [Radeon HD 5000/6000/7350/8350 Series] vendor: ASUSTeK driver: radeon
    v: kernel arch: TeraScale-2 bus-ID: 05:00.0 temp: 67.5 C
  Display: server: X.org v: 1.21.1.8 with: Xwayland v: 22.1.9 driver: X: loaded: radeon
    unloaded: fbdev,modesetting,vesa dri: r600 gpu: radeon tty: 240x67 resolution: 1920x1080
  API: OpenGL Message: GL data unavailable in console for root.
Network:
  Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: TP-LINK TG-3468
    driver: r8169 v: kernel port: a000 bus-ID: 07:00.0
  IF: eth0 state: up speed: 1000 Mbps duplex: full mac: 10:fe:ed:03:8d:b3
Drives:
  Local Storage: total: 3.11 TiB used: 396.34 GiB (12.5%)
Info:
  Processes: 191 Uptime: 0m Memory: 15.58 GiB used: 658.6 MiB (4.1%) Init: systemd
  target: multi-user (3) Compilers: gcc: 12.3.0 clang: 15.0.6 Packages: N/A note: see --rpm
  Shell: Bash v: 5.2.15 inxi: 3.3.26

I'll try 6.6.16-2 when it shows up.
Comment 6 Giuseppe Ghibò 2024-02-12 23:53:41 CET
(In reply to Dave Hodgins from comment #5)

> I'll try 6.6.16-2 when it shows up.

6.6.16-3 should be the good one.
Comment 7 Dave Hodgins 2024-02-14 07:42:11 CET
I've been running 6.6.16-server-3.mga9 for around 2 hours now. No lockups.
Thanks.

As the kernel update will be in another bug report, closing this as fixed.

Resolution: (none) => FIXED
Status: NEW => RESOLVED


Note You need to log in before you can comment on or make changes to this bug.