Bug 24844

Summary: Installation of the nvidia kernel module hangs until os-prober is killed
Product: Mageia Reporter: Heiko Stark <heiko>
Component: RPM PackagesAssignee: Kernel and Drivers maintainers <kernel>
Status: RESOLVED OLD QA Contact:
Severity: normal    
Priority: Normal CC: marja11
Version: 7   
Target Milestone: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Source RPM: util-linux? CVE:
Status comment:
Attachments: Ksysguard screenshot
journalctl >log.txt
today log file

Description Heiko Stark 2019-05-20 08:30:33 CEST
Description of problem:

Installation of the nvidia driver hangs:

nvidia-current (430.14-1.mga7.nonfree): Installing module.
...........................
.......
Creating: target|kernel|dracut args|basicmodules 

See also image. It seems it's Blkid.

Version-Release number of selected component (if applicable):

430.14-1.mga7.nonfree

How reproducible:

Install latest updates.

How to fix:

Killing os-prober:

nvidia-current (430.14-1.mga7.nonfree): Installing module.
...........................
.......
Creating: target|kernel|dracut args|basicmodules 
update-grub2 failed: GRUB-Konfigurationsdatei wird erstellt …
Thema gefunden: /boot/grub2/themes/maggy/theme.txt
Linux-Abbild gefunden: /boot/vmlinuz-5.1.3-server-4.mga7
initrd-Abbild gefunden: /boot/initrd-5.1.3-server-4.mga7.img
Linux-Abbild gefunden: /boot/vmlinuz-5.1.2-server-2.mga7
initrd-Abbild gefunden: /boot/initrd-5.1.2-server-2.mga7.img
Linux-Abbild gefunden: /boot/vmlinuz-5.1.1-server-1.mga7
initrd-Abbild gefunden: /boot/initrd-5.1.1-server-1.mga7.img
Linux-Abbild gefunden: /boot/vmlinuz-server
initrd-Abbild gefunden: /boot/initrd-server.img
May 20 08:18:26 | DM multipath kernel driver not loaded
/usr/sbin/grub2-mkconfig: Zeile 261:  5936 Getötet                "$i"
Comment 1 Heiko Stark 2019-05-20 08:31:58 CEST
Created attachment 11004 [details]
Ksysguard screenshot
Comment 2 Marja Van Waes 2019-05-21 11:37:24 CEST
Please attach log.txt that is the result of running, as root:

   journalctl -a --since="YYYY-MM-DD hh:mm" --until="YYYY-MM-DD hh:mm" > log.txt

and adjust the --since time to:
    right before you started to install nvidia-current-430.14-1.mga7.nonfree 
and the --until time to:
    shortly after you succeeded in installing it after killing os-prober.

CC: (none) => marja11
Keywords: (none) => NEEDINFO
Summary: Installation of the nvidia driver hangs => Installation of the nvidia driver hangs until os-prober is killed
Source RPM: (none) => util-linux? os-prober?

Comment 3 Heiko Stark 2019-05-21 12:12:28 CEST
Created attachment 11016 [details]
journalctl >log.txt

journalctl -a --since="2019-05-20 06:00" --until="2019-05-20 18:00" -q >log.txt

and plasmashell errors removed
Comment 4 Heiko Stark 2019-05-21 14:27:58 CEST
Created attachment 11018 [details]
today log file
Comment 5 Marja Van Waes 2019-05-24 17:11:34 CEST
Thanks for the logs, there is nothing about installing dkms-nvidia-current in them, so it appears I had misunderstood your bug report (I should have known, because "dkms" is part of the package name).

(In reply to Heiko Stark from comment #4)
> Created attachment 11018 [details]
> today log file

There were 11 kernel Call Traces:
The first one related to nouveau, 
The other ten were each  right after "kernel: INFO: task blkid:30204 blocked for more than X¹) seconds."

¹) the amount of seconds, X, increases from 122 to 1228 seconds

About nvidia, the first lines came right after the nouveau Call Trace:

(I removed "Mai 21 12:35:05 localhost.localdomain kernel: " from the beginning of the lines)

kernel: ---[ end trace f033c253613f6568 ]---
kernel: nouveau 0000:02:00.0: secboot: error during falcon reset: -110
kernel: nouveau 0000:02:00.0: gr: init failed, -110
kernel: nouveau 0000:02:00.0: DRM: allocated 1680x1050 fb: 0x200000, bo (____ptrval____)
kernel: fbcon: nouveaufb (fb0) is primary device
kernel: Console: switching to colour frame buffer device 210x65
kernel: nouveau 0000:02:00.0: fb0: nouveaufb frame buffer device
kernel: [drm] Initialized nouveau 1.3.1 20120801 for 0000:02:00.0 on minor 0
kernel: IPMI message handler: version 39.2
kernel: nvidia: loading out-of-tree module taints kernel.
kernel: nvidia: module license 'NVIDIA' taints kernel.
kernel: Disabling lock debugging due to kernel taint
kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 245
kernel: NVRM: The NVIDIA probe routine was not called for 1 device(s).
kernel: NVRM: This can occur when a driver such as: 
        NVRM: nouveau, rivafb, nvidiafb or rivatv 
        NVRM: was loaded and obtained ownership of the NVIDIA device(s).
kernel: NVRM: Try unloading the conflicting kernel module (and/or
        NVRM: reconfigure your kernel without the conflicting
        NVRM: driver(s)), then try loading the NVIDIA kernel module
        NVRM: again.
kernel: NVRM: No NVIDIA devices probed.
kernel: nvidia-nvlink: Unregistered the Nvlink Core, major device number 245

I don't understand how the blkid issue can be related, nor why nouveau was tried  if you don't want to use it. 

Anyway, assigning to our kernel and drivers maintainers, who can provide better help.

Keywords: NEEDINFO => (none)
Source RPM: util-linux? os-prober? => util-linux?
Assignee: bugsquad => kernel
Summary: Installation of the nvidia driver hangs until os-prober is killed => Installation of the nvidia kernel module hangs until os-prober is killed

Comment 6 Aurelien Oudelet 2021-07-06 13:15:31 CEST
Mageia 7 is EOL since July 1st 2021.
There will not have any further bugfix for this release.

You are encouraged to upgrade to Mageia 8 as soon as possible.

@reporter, if this bug still apply with Mageia 8, please let us know it.

@packager, if you work on the Mageia 7 version of your package, please check the Mageia 8 package if issue is also present. In this case, please fix the Mageia 8 version instead.

This bug report will be closed OLD if there is no further notice within 1st September 2021.
Comment 7 Marja Van Waes 2021-09-07 14:09:37 CEST
Hi bug reporter and hi assignee and others involved,

Please reopen this bug report if it is still valid for Mageia 8 or 9(cauldron), and change "Version:" in the upper left of this report accordingly.

This report is being closed as OLD because it was filed against Mageia 7, for which  support ended on June 30th 2021.

Thanks,
Marja

Status: NEW => RESOLVED
Resolution: (none) => OLD