On Thursday 2024-06-20 13:39, Stephan Hemeier via openSUSE Users wrote:
Date: Thu, 20 Jun 2024 13:39:19 From: Stephan Hemeier via openSUSE Users <users@lists.opensuse.org> Reply-To: Stephan Hemeier <Sauerlandlinux@gmx.de> To: users@lists.opensuse.org Subject: Re: troubleshooting nvidia
Am Donnerstag, 20. Juni 2024, 13:25:38 CEST schrieb Paul Neuwirth via openSUSE Users:
On Thursday 2024-06-20 12:57, Masaru Nomiya wrote:
Date: Thu, 20 Jun 2024 12:57:32 From: Masaru Nomiya <nomiya@lake.dti.ne.jp> Reply-To: m.nomiya+suse@gmail.com To: users@lists.opensuse.org Subject: Re: troubleshooting nvidia
Hello,
Sorry, by mistake, I sent a direct mail.
In the Message;
Subject : Re: troubleshooting nvidia Message-ID : <alpine.LSU.2.21.2406201147400.3177@alpha.swabian.net> Date & Time: Thu, 20 Jun 2024 11:52:44 +0200 (CEST)
[PN] == Paul Neuwirth via openSUSE Users <users@lists.opensuse.org> has written:
PN> On Thursday 2024-06-20 11:23, Paul Neuwirth via openSUSE Users wrote:
PN> > Date: Thu, 20 Jun 2024 11:23:18 PN> > From: Paul Neuwirth via openSUSE Users <users@lists.opensuse.org> PN> > Reply-To: Paul Neuwirth <mail@paul-neuwirth.nl> PN> > To: Stephan Hemeier <Sauerlandlinux@gmx.de> PN> > Cc: users@lists.opensuse.org PN> > Subject: Re: troubleshooting nvidia [...] PN> some hints maybe. Reinstalled the nvidia drivers (`zypper rm [any nvidia PN> packages]`, then `zypper inr --repo NVIDIA:repo-non-free`) and noticed some PN> suspicious looking lines while building the modules and during dracut: PN> # depmod: ERROR: fstatat(5, nvidia-drm.ko): No such file or directory PN> # depmod: ERROR: fstatat(5, nvidia-modeset.ko): No such file or directory PN> # depmod: ERROR: fstatat(5, nvidia-uvm.ko): No such file or directory PN> # depmod: ERROR: fstatat(5, nvidia.ko): No such file or directory PN> # depmod: ERROR: fstatat(5, nvidia-drm.ko): No such file or directory PN> # depmod: ERROR: fstatat(5, nvidia-modeset.ko): No such file or directory PN> # depmod: ERROR: fstatat(5, nvidia-uvm.ko): No such file or directory PN> # depmod: ERROR: fstatat(5, nvidia.ko): No such file or directory PN> # depmod: WARNING: could not open modules.order at /lib/modules/5.14.21-150500.53-default: No such file or directory PN> # depmod: WARNING: could not open modules.builtin at /lib/modules/5.14.21-150500.53-default: No such file or directory PN> # dracut-install: Failed to find module 'nvidia_drm' PN> # dracut: FAILED: /usr/lib/dracut/dracut-install -D /var/tmp/dracut.T96nUl/initramfs -N i2o_scsi --kerneldir /lib/modules/6.8.8-lp155.9-default/ -m nvidia nvidia_drm nvidia-modeset nvidia-uvm
PN> also noticed, that `modprobe nvidia` doesn't find the module (don't know if PN> that's normal behaviour?): PN> modprobe: ERROR: could not find module by name='nvidia' PN> modprobe: ERROR: could not insert 'nvidia': Unknown symbol in module, or unknown parameter (see dmesg)
I think this might be of interest to you.
https://forums.opensuse.org/t/nvidia-driver-550-90-broken-plus-no-boot-optio...
Best Regards.
Indeed, there are similarities. I meanwhile discovered, that the nvidia modules are not built for the current kernel 6.8.8-lp155.9-default - and it keeps using some files of an older kernel 5.14.21.... and the rpms indeed require that kernel-dev package. I tried to uninstall these old kernel packages (and ran zypper -n purge-kernels), and it suggested downgrade of the nvidia drivers to another repository (obs://build.opensuse.org/home:regataos) - but still the build seems to fail as dracut still cannot find nvidia_drm.
Paul Neuwirth
The nvidia packages for Leap are build against the first kernel of Leap, for Leap 15.5 that is kernel 5.14.21-150500.53.2. So they are working with all new kernels from the SLE Update Repo.
When you install kernel 6.8.8 from somewhere, the nvidia driver will not work anymore because the driver for Leap 15.5 is not using that kernel when building.
For kernel:stable:backports you need to deinstall the nvidia Packages and install the driver by Hand.
Stephan
ok this was misleading then. but (current) kernel is still from official repo. nevertheless tried to reinstall the nvidia drivers from that regataos repo, and build fails in another way: # make[4]: *** [/usr/src/linux-6.8.8-lp155.9/scripts/Makefile.build:244: /usr/src/kernel-modules/nvidia-550.78-default/nvidia/nv-dmabuf.o] Error 1 # cc: error: unrecognized command line option ‘-mharden-sls=all’; did you mean ‘-mhard-float’? # make[4]: *** [/usr/src/linux-6.8.8-lp155.9/scripts/Makefile.build:244: /usr/src/kernel-modules/nvidia-550.78-default/nvidia/nv-nano-timer.o] Error 1 maybe I should try and distribution upgrade to 15.6 - would have to do that anyway sometime later. Paul