Bug 33578 - Update request: nvidia-newfeature-560.35.03-1.mga9.nonfree
Summary: Update request: nvidia-newfeature-560.35.03-1.mga9.nonfree
Status: NEW
Alias: None
Product: Mageia
Classification: Unclassified
Component: RPM Packages (show other bugs)
Version: 9
Hardware: All Linux
Priority: Normal normal
Target Milestone: ---
Assignee: QA Team
QA Contact:
URL:
Whiteboard:
Keywords: advisory
Depends on: 33571
Blocks:
  Show dependency treegraph
 
Reported: 2024-09-20 16:55 CEST by Giuseppe Ghibò
Modified: 2024-09-27 20:21 CEST (History)
1 user (show)

See Also:
Source RPM: nvidia-newfeature-560.35.03-1.mga9.nonfree
CVE:
Status comment:


Attachments
files list (742 bytes, text/plain)
2024-09-22 16:24 CEST, Giuseppe Ghibò
Details
journal with autologin disabled (29.03 KB, application/gzip)
2024-09-27 02:07 CEST, Thomas Andrews
Details

Description Giuseppe Ghibò 2024-09-20 16:55:50 CEST
This is a new and bug release in the 560.xx series. Bug fixes:

https://www.nvidia.com/en-us/drivers/details/230918/
Comment 1 Giuseppe Ghibò 2024-09-20 17:00:22 CEST
(In reply to Giuseppe Ghibò from comment #0)

> This is a new and bug release in the 560.xx series. Bug fixes:

s/bug release/bug fix release/
Comment 2 Giuseppe Ghibò 2024-09-22 16:24:28 CEST
Created attachment 14675 [details]
files list

files list
katnatek 2024-09-22 19:46:33 CEST

Keywords: (none) => advisory

Comment 3 Thomas Andrews 2024-09-23 20:15:24 CEST
MGA9-64 Plasma, i5-7500, Nvidia Quadro K620 graphics, server and desktop kernels.

Updated nvidia-current first, and that was OK with both kernels, but...

When I attempted to use MCC to switch to the newfeature driver, I get a message that x11-driver-video-nvidia-newfeature cannot be installed due to unsatisfied libgbm.so.1.

So I didn't get very far.

CC: (none) => andrewsfarm

Comment 4 katnatek 2024-09-23 20:36:11 CEST
(In reply to Thomas Andrews from comment #3)
> MGA9-64 Plasma, i5-7500, Nvidia Quadro K620 graphics, server and desktop
> kernels.
> 
> Updated nvidia-current first, and that was OK with both kernels, but...
> 
> When I attempted to use MCC to switch to the newfeature driver, I get a
> message that x11-driver-video-nvidia-newfeature cannot be installed due to
> unsatisfied libgbm.so.1.
> 
> So I didn't get very far.

Looks like this need mesa packages in testing

Depends on: (none) => 33571

Comment 5 Thomas Andrews 2024-09-23 20:47:33 CEST
I see that as a possibility, after a bit of investigation.

But should it be needing that specific libgbm.so.1? The current tainted version of the package is installed, and the nvidia-current candidate was OK with it.
Comment 6 Giuseppe Ghibò 2024-09-24 00:37:58 CEST
that libgbm.so.1 required should be in the 32bit repo (either core or tainted).
Comment 7 Thomas Andrews 2024-09-24 02:29:25 CEST
So in order to install the newfeature driver on a 64-bit system, you have to activate the 32-bit repos? It can't use the libgbm.so.1 that's a file in the lib64gbm1 (tainted or core) package?

That doesn't sound right.
Comment 8 Thomas Andrews 2024-09-24 02:47:45 CEST
I updated to the 64-bit core mesa packages, and tried installing the newfeature driver again without activating the 32-bit repos. Now it says it needs libdrm.so.2. There's one of them in the lib64drm2 package, installed in /usr/lib64, as is libgbm.so.1.

Could it be looking for these libraries in the wrong spot?
Comment 9 Giuseppe Ghibò 2024-09-24 09:56:03 CEST
Have to figure out why it triggered a new dep of a 32bit library, at least for the non-full package, and eventually filter out the dependency from the main package, also because *-newfeature would be, at some point, the next *-current series. In the meanwhile see if adding 32bit repos would fix the problem.
Comment 10 Thomas Andrews 2024-09-24 17:56:57 CEST
After adding the 32-bit repos, XFdrake downloaded some 32-bit packages, but it happened too fast to see what they were. The newfeature driver installed successfully, and looks OK on a quick check after the reboot, except that autologin was again disabled, like it was the last time we tried a newfeature driver.
Comment 11 Giuseppe Ghibò 2024-09-24 18:05:57 CEST
It's weird the autologin missed. Maybe crashed some process. We might try to collect some core, enabling core generation with:

sysctl -w kernel.core_uses_pid=0
sysctl -w kernel.core_pattern="/var/tmp/%e_%t_%s_uid%u.core"

[this is volatile, so next boot is disabled]

then try to see whether with autologin, some core, in /var/tmp/ is generated.
Comment 12 Giuseppe Ghibò 2024-09-24 18:10:31 CEST
(In reply to Thomas Andrews from comment #10)

> it happened too fast to see what they were. The newfeature driver installed

journalctl -b | grep -e \\[RPM\\]

should show what happening on package installing (but I'm pretty sure there is some other "proper" journalctl option to see that info in a more fashioned way...).
Comment 13 Giuseppe Ghibò 2024-09-24 18:57:34 CEST
(In reply to Giuseppe Ghibò from comment #11)
> It's weird the autologin missed. Maybe crashed some process. We might try to
> collect some core, enabling core generation with:
> 
> sysctl -w kernel.core_uses_pid=0
> sysctl -w kernel.core_pattern="/var/tmp/%e_%t_%s_uid%u.core"
> 
> [this is volatile, so next boot is disabled]
> 
> then try to see whether with autologin, some core, in /var/tmp/ is generated.

Oops, I forgot that at the autologin you won't have the console prompt where to type such commands, so we have to find another way...
Comment 14 Thomas Andrews 2024-09-27 02:07:05 CEST
Created attachment 14682 [details]
journal with autologin disabled

For what it's worth, a journal created within a few minutes of booting with autologin having been disabled
Comment 15 Giuseppe Ghibò 2024-09-27 20:21:17 CEST
(In reply to Giuseppe Ghibò from comment #11)
> It's weird the autologin missed. Maybe crashed some process. We might try to
> collect some core, enabling core generation with:
> 
> sysctl -w kernel.core_uses_pid=0
> sysctl -w kernel.core_pattern="/var/tmp/%e_%t_%s_uid%u.core"
> 
> [this is volatile, so next boot is disabled]
> 
> then try to see whether with autologin, some core, in /var/tmp/ is generated.

While digging into this I found a sort of bug in our systemd package that made our core dump experience "incomplete". We had this line:

https://svnweb.mageia.org/packages/updates/9/systemd/current/SPECS/systemd.spec?revision=2087162&view=markup#l405

(ditto for cauldron) which we disabled proper systemd core dumps at some point (probably because at that time that feature was incomplete in systemd) and then remained as is our next releases. This feature has been perfectioned in systemd upstream so I think we should re-enable it in next systemd builds.

Anyway, for now to get coredumps just add a file /etc/sysctl.d/99-coredump.conf containing the lines:

kernel.core_pattern=|/usr/lib/systemd/systemd-coredump %P %u %g %s %t %c %h
kernel.core_pipe_limit=16
fs.suid_dumpable=2

then reboot. If some program crashes it creates a core, and you can catch with

coredumpctl list

Note You need to log in before you can comment on or make changes to this bug.