[Bug 1063139] New: /usr/bin/X coredump in RRSetChanged()
http://bugzilla.suse.com/show_bug.cgi?id=1063139 Bug ID: 1063139 Summary: /usr/bin/X coredump in RRSetChanged() Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: Other OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: X.Org Assignee: xorg-maintainer-bugs@forge.provo.novell.com Reporter: kkaempf@suse.com QA Contact: xorg-maintainer-bugs@forge.provo.novell.com Found By: --- Blocker: --- #8 <signal handler called> #9 RRSetChanged (pScreen=0x5560e172ad90) at randr.c:558 #10 0x00005560e0e9082f in RRScreenSetSizeRange (pScreen=pScreen@entry=0x5560e172ad90, minWidth=<optimized out>, minHeight=<optimized out>, maxWidth=<optimized out>, maxHeight=<optimized out>) at rrinfo.c:228 #11 0x00005560e0e4e693 in xf86RandR12CreateScreenResources12 (pScreen=0x5560e172ad90) at xf86RandR12.c:1795 #12 xf86RandR12CreateScreenResources (pScreen=pScreen@entry=0x5560e172ad90) at xf86RandR12.c:844 #13 0x00005560e0e413f0 in xf86CrtcCreateScreenResources (screen=<optimized out>) at xf86Crtc.c:719 #14 0x00005560e0dd2031 in dix_main (argc=12, argv=0x7ffdbd25b448, envp=<optimized out>) at main.c:208 #15 0x00007fb0014ecf4a in __libc_start_main () from /lib64/libc.so.6 #16 0x00005560e0dbbeba in _start () at ../sysdeps/x86_64/start.S:120 -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c1
--- Comment #1 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c2
--- Comment #2 from Klaus Kämpf
rpm -qf /usr/bin/X xorg-x11-server-1.19.3-5.1.x86_64
rpm -q xf86-video-intel xf86-video-intel-2.99.917+git781.c8990575-1.1.x86_64
lspci 00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller]) Subsystem: Lenovo Device 21da Flags: bus master, fast devsel, latency 0, IRQ 30 Memory at f0000000 (64-bit, non-prefetchable) [size=4M] Memory at e0000000 (64-bit, prefetchable) [size=256M] I/O ports at 5000 [size=64] [virtual] Expansion ROM at 000c0000 [disabled] [size=128K] Capabilities: <access denied> Kernel driver in use: i915 Kernel modules: i915
-- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c3
--- Comment #3 from Klaus Kämpf
lspci 00:02.0 VGA compatible controller: Intel Corporation Xeon E3-1200 v3/4th Gen Core Processor Integrated Graphics Controller (rev 06) (prog-if 00 [VGA controller]) Subsystem: Dell Device 05a4 Flags: bus master, fast devsel, latency 0, IRQ 33 Memory at f7800000 (64-bit, non-prefetchable) [size=4M] Memory at d0000000 (64-bit, prefetchable) [size=256M] I/O ports at f000 [size=64] Capabilities: <access denied> Kernel driver in use: i915 Kernel modules: i915
-- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c4
--- Comment #4 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c5
Michal Srb
From the stacktrace it looks like it could be some problem with PRIME (multi GPU support). From the X log it looks like you also have some AMD card in your computer: PCI: (0:0:2:0) 8086:0412:1028:05a4 rev 6, Mem @ 0xf7800000/4194304, 0xd0000000/268435456, I/O @ 0x0000f000/64 PCI:*(0:1:0:0) 1002:6611:1028:210b rev 0, Mem @ 0xe0000000/268435456, 0xf7c00000/262144, I/O @ 0x0000e000/256, BIOS @ 0x????????/131072
Is there also AMD GPU in the computer? (In reply to Klaus Kämpf from comment #2)
rpm -qf /usr/bin/X xorg-x11-server-1.19.3-5.1.x86_64
The newest version in Tumbleweed is 1.19.4. I don't see any related fixes in 1.19.4, but it is worth trying.
rpm -q xf86-video-intel xf86-video-intel-2.99.917+git781.c8990575-1.1.x86_64
Strange that it is installed, but X server is failing to find it. There seem to be some other discrepancies between versions: [ 68920.106] (II) Module modesetting: vendor="X.Org Foundation" [ 68920.106] compiled for 1.19.4, module version = 1.19.4 ^^^^^^ Please make sure you have up-to-date X server and all drivers. If the crash still happens, please attach coredump as well. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c6
Klaus Kämpf
Is there also AMD GPU in the computer?
Yes:
lspci
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Oland [Radeon HD 8570 / R7 240/340 OEM] (prog-if 00 [VGA controller]) Subsystem: Dell Radeon R5 240 OEM Flags: fast devsel, IRQ 16 Memory at e0000000 (64-bit, prefetchable) [size=256M] Memory at f7c00000 (64-bit, non-prefetchable) [size=256K] I/O ports at e000 [size=256] Expansion ROM at 000c0000 [disabled] [size=128K] Capabilities: <access denied> Kernel modules: radeon, amdgpu
There seem to be some other discrepancies between versions: [ 68920.106] (II) Module modesetting: vendor="X.Org Foundation" [ 68920.106] compiled for 1.19.4, module version = 1.19.4 ^^^^^^
Huh ? Shouldn't package dependencies prevent this ? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c7
--- Comment #7 from Klaus Kämpf
There seem to be some other discrepancies between versions: [ 68920.106] (II) Module modesetting: vendor="X.Org Foundation" [ 68920.106] compiled for 1.19.4, module version = 1.19.4 ^^^^^^
Huh ? Shouldn't package dependencies prevent this ?
Argh, my bad. Again, I confused remote with local. Remote system is now up to date with
rpm -q xorg-x11-server xorg-x11-server-1.19.4-1.1.x86_64
And the crash is with the AMD GPU
rpm -q xf86-video-amdgpu xf86-video-amdgpu-1.4.0-1.1.x86_64
Background: I used to run with multiple graphics cards and multiple monitors. This setup was running fine with Tumbleweed until Jun/Aug timeframe. Since then, only one card was functioning but not both together. Since last week (Tumbleweed 20171004) X completely refused to start. Just yesterday I detected the core dumps :-/ -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c8
--- Comment #8 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c9
--- Comment #9 from Michal Srb
Huh ? Shouldn't package dependencies prevent this ?
Well in theory the different versions should work together as long as they have the same ABI_VIDEODRV_VERSION (they have in this case). Package dependencies only make sure that ABI_VIDEODRV_VERSION of the drivers match the one of X server. But something is broken, so it is worth checking every option, maybe upstream made some changes and forgot to bump ABI_VIDEODRV_VERSION. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c10
--- Comment #10 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c11
Michal Srb
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c12
--- Comment #12 from Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c13
--- Comment #13 from Klaus Kämpf
Also, we had a regression in 1.19.4, which has been addressed again in 1.19.5. Klaus has installed the -amdgpu driver for unknown reasons. It appears to be the wrong driver. Should be -ati instead.
I had no idea which one to install. I did some experiments of installing only one of -amdgpu, -ati, or -intel in the past. To no avail. Also installing xf86-video-ati (alongside xf86-video-intel) lead to the same crash. To be continued next week ... thanks a lot so far ! -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c14
Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c15
--- Comment #15 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c16
Michal Srb
Can you please verify that their files are OK? (rpm -qV xf86-video-intel xf86-video-ati)
If the files are there, could you record strace of X server so we know why is it failing to find their files?
-- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c17
--- Comment #17 from Klaus Kämpf
That's why I asked you to:
(In reply to Michal Srb from comment #11)
Can you please verify that their files are OK? (rpm -qV xf86-video-intel xf86-video-ati)
If the files are there, could you record strace of X server so we know why is it failing to find their files?
Sorry, I missed this comment. (rpm -qV xf86-video-intel xf86-video-ati) does not return any output. Both packages were re-installed by me after the last TW update. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c18
Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c19
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c20
Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c21
--- Comment #21 from Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c22
--- Comment #22 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c23
Michal Srb
http://bugzilla.suse.com/show_bug.cgi?id=1063139
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c24
--- Comment #24 from Klaus Kämpf
Alright, now the situation looks bit different. Both the strace and the logs in comment 20 and comment 22 show that the xf86-video-ati is found and loaded. However, it fails to initialize and in case of comment 20 it fallbacks to VESA, in case of comment 22 it has no options left and terminates.
Why doesn't it fall back to the intel card ? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c25
Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c26
--- Comment #26 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c27
--- Comment #27 from Stefan Dirsch
(In reply to Michal Srb from comment #23)
Alright, now the situation looks bit different. Both the strace and the logs in comment 20 and comment 22 show that the xf86-video-ati is found and loaded. However, it fails to initialize and in case of comment 20 it fallbacks to VESA, in case of comment 22 it has no options left and terminates.
Why doesn't it fall back to the intel card ?
Good question. It's yet another bug I would say. But please let's concentrate now on why radeon KMS support isn't working, please. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c28
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c29
--- Comment #29 from Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c30
--- Comment #30 from Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c31
--- Comment #31 from Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c32
Klaus Kämpf
Klaus, can you confirm, that this kernel option is also set on your system?
zcat /proc/config.gz | grep DRM_AMDGPU_SI
Confirmed :-( -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c33
--- Comment #33 from Klaus Kämpf
For some reason amdgpu instead of radeon KMS driver is loaded and apparently does not work, not even with generic modesetting X driver it seems. Please try these kernel options.
radeon.si_support=1 amdgpu.si_support=0
No, this does not improve the situation. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c34
--- Comment #34 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c35
--- Comment #35 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c36
--- Comment #36 from Michal Srb
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c37
Stefan Dirsch
It seems to me that amdgpu puts its hands away from the card correctly.
I agree.
What I don't understand is why the radeon driver doesn't get used for it after that.
Exactly. We need to know what happens, if you run modprobe radeon manually and the dmesg output afterwards. Even better the dmesg output *before* and after running dmesg -c > /dev/null modprobe radeon -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c38
--- Comment #38 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c39
Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c40
--- Comment #40 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c41
--- Comment #41 from Michal Srb
With the radeon module loaded, startx succeeds (amd card only). Xorg.0.log attached.
The log shows that both (user-space) radeon and intel drivers are used. Maybe the PRIME failed to associate the cards as output sink/source automatically. Let's add it to the list of issues to solve later. So we found out that manually loaded (kernel) radeon driver works. The question remains why is it not loaded automatically. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c42
--- Comment #42 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c43
--- Comment #43 from Stefan Dirsch
Created attachment 744956 [details] dmesg "after" (no additional kernel parameters)
[ 105.932494] radeon 0000:01:00.0: Invalid PCI ROM data signature: expecting 0x52494350, got 0xe808aa55 [...] [ 106.123771] radeon 0000:01:00.0: failed VCE resume (-110). Hmm. Something sounds wrong here. Maybe kernel-firmware is not installed or outdated or whatever. Or even hardware is broken. Could you verify, that the graphics card's fan is still rotating? ;-) -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c44
--- Comment #44 from Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c45
--- Comment #45 from Klaus Kämpf
Honestly, if it's possible in Firmware/BIOS to disable Intel card, I would like to know whether the radeon cards works during boot.
Yes, actually both cards work during boot and show boot log messages (first the ati card, then kernel logging switches to the intel card surprisingly). -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c46
--- Comment #46 from Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c47
--- Comment #47 from Klaus Kämpf
Honestly, if it's possible in Firmware/BIOS to disable Intel card, I would like to know whether the radeon cards works during boot.
I can't disable a card, but I can configure the primary display in the BIOS. If I choose the ATI card, kernel messages appear on the ATI displays (2 monitors are attached to the ATI card) during boot up to a specific point where the kernel switches message output to the Intel display. The GDM login appears on the ATI displays. But it's unusable due to flickering. If I choose the Intel card, all kernel messages appear on the Intel card. The ATI displays stay completely dark. Running "startx" provides a GNOME desktop on the Intel display. But it's unusable due to flickering. ==> Both cards work separately. I can choose the card via BIOS setting. The actual desktop is broken for both cards (excessive flickering) though. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c48
--- Comment #48 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c49
--- Comment #49 from Klaus Kämpf
http://bugzilla.suse.com/show_bug.cgi?id=1063139
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c53
Stefan Dirsch
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c54
--- Comment #54 from Klaus Kämpf
Klaus, is this still an issue with curent TW?
No, this seems fixed. Thanks ! -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1063139
http://bugzilla.suse.com/show_bug.cgi?id=1063139#c55
Klaus Kämpf
participants (1)
-
bugzilla_noreply@novell.com