[Bug 1215981] New: Black Screen during boot on both internal and external screen in kernel 6.5.4-1 on Thinkpad P16 (Discrete Graphics mode)
https://bugzilla.suse.com/show_bug.cgi?id=1215981 Bug ID: 1215981 Summary: Black Screen during boot on both internal and external screen in kernel 6.5.4-1 on Thinkpad P16 (Discrete Graphics mode) Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: Other OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: petr.vorel@suse.com QA Contact: qa-bugs@suse.de Target Milestone: --- Found By: --- Blocker: --- I have similar problem to #1213693, but on newer kernel 6.5.4-1, which should contain the fix. #1213693 was broken by commit ca62297b2085 ("drm/edid: Fix csync detailed mode parsing") in v6.4-rc1, which was fixed by revert it in 50b6f2c82977 ("Revert "drm/edid: Fix csync detailed mode parsing"") in v6.5-rc7. In my case I'm not able to see anything after kernel being loaded. I have Tumbleweed kernel 6.5.4-1 and 6.5.2-1. Problem is on Thinkpad P16 with 2 GPU: 00:02.0 VGA compatible controller: Intel Corporation Alder Lake-HX GT1 [UHD Graphics 770] (rev 0c) 01:00.0 VGA compatible controller: NVIDIA Corporation GA107GLM [RTX A1000 Laptop GPU] (rev a1) The problem is on "Discrete Graphics" (Nvidia only) mode. "Hybrid Graphics" (Intel + Nvidia) works, but I need for external screen to use "Discrete Graphics" as it's the only way to get external screens working (because external output is wired only to nvidia): i.e. on Discrete Graphics there is only Intel card being used $ drm_info |grep -i node: -A1 Node: /dev/dri/card0 Driver: i915 (Intel Graphics) version 1.6.0 (20201103) I tested with internal screen only and with internal screen + 2 external GPU. I tested to disable plymouth with rd.plymouth=0 plymouth.enable=0 plymouth=0 cmdline args, also tried fbcon=map:1 also boot to runlevel 1 and 3 instead the default. None helped. $ rpm -qa |grep -i -e nouveau -e intel -e ^kernel kernel-firmware-nvidia-gsp-G06-525.116.04-2.1.x86_64 kernel-firmware-nvidia-gspx-G06-535.113.01-1.1.x86_64 kernel-firmware-serial-20230829-1.1.noarch libdrm_nouveau2-2.4.116-2.1.x86_64 intel-vaapi-driver-2.4.1-5.11.x86_64 kernel-firmware-mwifiex-20230829-1.1.noarch xf86-video-intel-2.99.917.916_g31486f40-3.6.x86_64 kernel-firmware-platform-20230829-1.1.noarch kernel-firmware-intel-20230829-1.1.noarch kernel-firmware-iwlwifi-20230829-1.1.noarch kernel-firmware-all-20230829-1.1.noarch intel-media-driver-23.3.3-1.1.x86_64 ucode-intel-20230808-1.1.x86_64 kernel-firmware-nvidia-gsp-G06-535.54.03-1.1.x86_64 kernel-firmware-amdgpu-20230829-1.1.noarch kernel-firmware-usb-network-20230829-1.1.noarch kernel-firmware-i915-20230829-1.1.noarch kernel-macros-6.5.4-1.1.noarch kernel-firmware-qcom-20230829-1.1.noarch libvulkan_intel-23.2.0-1699.360.pm.1.x86_64 intel-gpu-tools-1.27.1-2.3.x86_64 kernel-firmware-sound-20230829-1.1.noarch kernel-firmware-ath10k-20230829-1.1.noarch libvdpau_nouveau-23.2.0-1699.360.pm.1.x86_64 kernel-firmware-bnx2-20230829-1.1.noarch Mesa-dri-nouveau-23.2.0-1699.360.pm.1.x86_64 kernel-firmware-dpaa2-20230829-1.1.noarch kernel-firmware-atheros-20230829-1.1.noarch kernel-firmware-radeon-20230829-1.1.noarch kernel-firmware-ueagle-20230829-1.1.noarch kernel-firmware-brcm-20230829-1.1.noarch kernel-firmware-chelsio-20230829-1.1.noarch kernel-firmware-nvidia-20230829-1.1.noarch kernel-firmware-ti-20230829-1.1.noarch kernel-firmware-media-20230829-1.1.noarch kernel-firmware-realtek-20230829-1.1.noarch kernel-firmware-mellanox-20230829-1.1.noarch libdrm_intel1-2.4.116-2.1.x86_64 kernel-firmware-network-20230829-1.1.noarch kernel-firmware-ath11k-20230829-1.1.noarch kernel-firmware-mediatek-20230829-1.1.noarch kernel-firmware-bluetooth-20230829-1.1.noarch kernel-firmware-prestera-20230829-1.1.noarch kernel-firmware-liquidio-20230829-1.1.noarch kernel-firmware-marvell-20230829-1.1.noarch kernel-default-6.5.2-1.1.x86_64 kernel-firmware-nfp-20230829-1.1.noarch kernel-default-devel-6.5.2-1.1.x86_64 kernel-devel-6.5.4-1.1.noarch kernel-firmware-qlogic-20230829-1.1.noarch kernel-default-devel-6.5.4-1.1.x86_64 kernel-default-6.5.4-1.1.x86_64 kernel-devel-6.5.2-1.1.noarch $ lsmod |grep -i -e i915 -e nvidia -e nouveau nvidia_drm 94208 0 nvidia_modeset 1794048 1 nvidia_drm nvidia_uvm 3608576 0 i915 4087808 5 drm_buddy 20480 1 i915 i2c_algo_bit 20480 1 i915 drm_display_helper 237568 1 i915 ttm 102400 1 i915 cec 90112 2 drm_display_helper,i915 nvidia 8843264 2 nvidia_uvm,nvidia_modeset video 77824 3 thinkpad_acpi,i915,nvidia_modeset $ modinfo nvidia |grep -i version version: 535.113.01 srcversion: 81566B70A70B0B19F40FD1A vermagic: 6.5.4-1-default SMP preempt mod_unload modversions $ cat /proc/cmdline # but I tested with others, see above BOOT_IMAGE=/boot/vmlinuz-6.5.4-1-default root=/dev/mapper/system-root splash=silent resume=/dev/system/swap mitigations=auto quiet security=apparmor modprobe.blacklist=i915 nosimplefb=1 I use these non-factory repos: https://download.opensuse.org/repositories/X11:/Drivers:/Video:/Redesign/ope... https://download.opensuse.org/repositories/X11:/XOrg/openSUSE_Tumbleweed/ https://download.nvidia.com/opensuse/tumbleweed -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 Petr Vorel <petr.vorel@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- See Also| |https://bugzilla.suse.com/s | |how_bug.cgi?id=1213693 CC| |tiwai@suse.com -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c1 --- Comment #1 from Patrik Jakobsson <patrik.jakobsson@suse.com> --- Can you access the system remotely? If so, please provide dmesg and hwinfo output. -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c2 --- Comment #2 from Petr Vorel <petr.vorel@suse.com> --- (In reply to Patrik Jakobsson from comment #1)
Can you access the system remotely? If so, please provide dmesg and hwinfo output.
Unfortunately the system does not reply to ping. I'm able to get to working system if I switch in BIOS to "Discrete Graphics". I'm not sure if the system crashes, or network requires mn-applet to start. I'll try setup network over lan cable and setup SSH so that I can get some logs. -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c3 --- Comment #3 from Petr Vorel <petr.vorel@suse.com> --- Created attachment 869967 --> https://bugzilla.suse.com/attachment.cgi?id=869967&action=edit dmesg of the affected system -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c4 --- Comment #4 from Petr Vorel <petr.vorel@suse.com> --- Created attachment 869968 --> https://bugzilla.suse.com/attachment.cgi?id=869968&action=edit hwinfo of the affected system -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c5 Petr Vorel <petr.vorel@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Attachment #869967|0 |1 is obsolete| | --- Comment #5 from Petr Vorel <petr.vorel@suse.com> --- Created attachment 869969 --> https://bugzilla.suse.com/attachment.cgi?id=869969&action=edit dmesg of the affected system (cmdline cleanup) I removed modprobe.blacklist=i915 nosimplefb=1 from cmdline. Obviously it did not solve problem, just to use the default cmdline. There are some errors, not sure [ 1.464073] BERT: [Hardware Error]: Skipped 1 error records ... [ 2.052280] pcieport 0000:00:01.0: AER: Multiple Uncorrected (Non-Fatal) error received: 0000:01:00.0 [ 2.052299] pci 0000:01:00.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) [ 2.052345] pci 0000:01:00.0: device [10de:25b9] error status/mask=00100000/00000000 ... [ 9.027482] sof-audio-pci-intel-tgl 0000:00:1f.3: init of i915 and HDMI codec failed ... [ 12.628660] [drm:nv_drm_load [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice [ 12.629139] [drm:nv_drm_probe_devices [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000100] Failed to register device Nvidia card is visible: $ lspci |grep -i nvidia 01:00.0 VGA compatible controller: NVIDIA Corporation GA107GLM [RTX A1000 Laptop GPU] (rev a1) 01:00.1 Audio device: NVIDIA Corporation Device 2291 (rev a1) -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c6 Petr Vorel <petr.vorel@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Attachment #869968|0 |1 is obsolete| | --- Comment #6 from Petr Vorel <petr.vorel@suse.com> --- Created attachment 869970 --> https://bugzilla.suse.com/attachment.cgi?id=869970&action=edit hwinfo of the affected system (cmdline cleanup) The main difference is that modprobe.blacklist=i915 nosimplefb=1 (previous log file) forced efi-framebuffer instead of the default simple-framebuffer and had "Generic Monitor". But output is the same - none. -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c7 --- Comment #7 from Petr Vorel <petr.vorel@suse.com> --- Created attachment 869971 --> https://bugzilla.suse.com/attachment.cgi?id=869971&action=edit dmesg on Hybrid Graphics mode (where GUI works, just for a reference) -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c8 --- Comment #8 from Petr Vorel <petr.vorel@suse.com> --- Created attachment 869972 --> https://bugzilla.suse.com/attachment.cgi?id=869972&action=edit hwinfo on Hybrid Graphics mode (where GUI works, just for a reference) -- You are receiving this mail because: You are the assignee for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1215981 https://bugzilla.suse.com/show_bug.cgi?id=1215981#c9 Stefan Dirsch <sndirsch@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Assignee|kernel-bugs@opensuse.org |gfx-bugs@suse.de QA Contact|qa-bugs@suse.de |sndirsch@suse.com Status|NEW |IN_PROGRESS Flags| |needinfo?(petr.vorel@suse.c | |om) Priority|P5 - None |P3 - Medium Component|Kernel |X11 3rd Party Driver --- Comment #9 from Stefan Dirsch <sndirsch@suse.com> --- [ 12.368440] NVRM: Open nvidia.ko is only ready for use on Data Center GPUs. [ 12.368442] NVRM: To force use of Open nvidia.ko on other GPUs, see the [ 12.368442] NVRM: 'OpenRmEnableUnsupportedGpus' kernel module parameter described [ 12.368443] NVRM: in the README. So have you set this in modprobe.de/50-nvidia-default.conf ? -- You are receiving this mail because: You are the assignee for the bug.
participants (1)
-
bugzilla_noreply@suse.com