[opensuse-factory] Workstation randomly freezing!
Hello All, I have a brand new HP Z240 workstation running Leap and it randomly freezes, becomes fully unresponsive, both locally and over the network, and I'm forced to power cycle the system. This has been happening multiple time a day with no rime or reason, sometimes it's sitting idle after a reboot, reading email, whatever. The most frustrating part about it is there's never anything in the logs to indicate a reason for this. It's the default installation, BTRFS partitioning scheme, XFS for /home, KDE5, nothing crazy at all. I've installed and configured kdump with the hope it will be able to provide some insight after the next occurrence, but given there are no kernel crash messages in the logs it's hard to say whether it will help. I guess time will tell. Does anyone have any thoughts further debugging this type of an issue? -- Later, Darin -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On 15/04/2016 12:45 AM, Darin Perusich wrote:
Does anyone have any thoughts further debugging this type of an issue?
start disabling things, I'd start with the power saving settings - sleep, hibernate, dim screen etc. -- Lindsay Mathieson -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Thu, 14 Apr 2016 16:45:31 +0200, Darin Perusich wrote:
Hello All,
I have a brand new HP Z240 workstation running Leap and it randomly freezes, becomes fully unresponsive, both locally and over the network, and I'm forced to power cycle the system. This has been happening multiple time a day with no rime or reason, sometimes it's sitting idle after a reboot, reading email, whatever. The most frustrating part about it is there's never anything in the logs to indicate a reason for this.
It's the default installation, BTRFS partitioning scheme, XFS for /home, KDE5, nothing crazy at all.
I've installed and configured kdump with the hope it will be able to provide some insight after the next occurrence, but given there are no kernel crash messages in the logs it's hard to say whether it will help. I guess time will tell.
Does anyone have any thoughts further debugging this type of an issue?
It's a Skylake system, right? Skylake graphics is weakly supported by Leap kernel. Try the latest kernel in OBS Kernel:openSUSE-42.1 repo, at least. I guess 4.1.20 was released in the update repo recently, too, so this should be OK, too. Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable In anyway, feel free to open the bug report. Don't forget to attach the output of "hwinfo --all" and the kernel messages (dmesg output) for a while after a fresh boot. Takashi -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Thu, Apr 14, 2016 at 11:21 AM, Takashi Iwai <tiwai@suse.de> wrote:
On Thu, 14 Apr 2016 16:45:31 +0200, Darin Perusich wrote:
Hello All,
I have a brand new HP Z240 workstation running Leap and it randomly freezes, becomes fully unresponsive, both locally and over the network, and I'm forced to power cycle the system. This has been happening multiple time a day with no rime or reason, sometimes it's sitting idle after a reboot, reading email, whatever. The most frustrating part about it is there's never anything in the logs to indicate a reason for this.
It's the default installation, BTRFS partitioning scheme, XFS for /home, KDE5, nothing crazy at all.
I've installed and configured kdump with the hope it will be able to provide some insight after the next occurrence, but given there are no kernel crash messages in the logs it's hard to say whether it will help. I guess time will tell.
Does anyone have any thoughts further debugging this type of an issue?
It's a Skylake system, right? Skylake graphics is weakly supported by Leap kernel. Try the latest kernel in OBS Kernel:openSUSE-42.1 repo, at least. I guess 4.1.20 was released in the update repo recently, too, so this should be OK, too.
Yes this is a Skylack system and it's running the 4.1.20-11-default, and I see that 4.1.21-2.1 is the version in OBS Kernel:openSUSE-42.1 .
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable
I'll give these options a try before installing the newer kernel.
In anyway, feel free to open the bug report. Don't forget to attach the output of "hwinfo --all" and the kernel messages (dmesg output) for a while after a fresh boot.
I was planning on this but wanted to wait until I had a few work-arounds.
Takashi -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
-- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On jeudi, 14 avril 2016 12.19:23 h CEST Darin Perusich wrote:
HP Z240 workstation
Don't forget to recheck frequently bios and firmware update. My December Dell Mobile Precision got 6 update in 4 months with numerous upgrade of firmware :-) -- Bruno Friedmann Ioda-Net Sàrl www.ioda-net.ch openSUSE Member, fsfe fellowship GPG KEY : D5C9B751C4653227 irc: tigerfoot -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Thu, Apr 14, 2016 at 1:19 PM, Darin Perusich <darin@darins.net> wrote:
Yes this is a Skylack system and it's running the 4.1.20-11-default, and I see that 4.1.21-2.1 is the version in OBS Kernel:openSUSE-42.1 .
Not gonna work well.. try kernel:HEAD or Kernel:Stable.. -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Thu, Apr 14, 2016 at 12:19 PM, Darin Perusich <darin@darins.net> wrote:
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable
I'll give these options a try before installing the newer kernel.
Setting these kernel parameters had no effect, the system just stopped responding and needed a power cycle. Unfortunately kdump didn't capture anything. -- Later, Darin -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Thu, 14 Apr 2016 22:15:40 +0200, Darin Perusich wrote:
On Thu, Apr 14, 2016 at 12:19 PM, Darin Perusich <darin@darins.net> wrote:
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable
I'll give these options a try before installing the newer kernel.
Setting these kernel parameters had no effect, the system just stopped responding and needed a power cycle. Unfortunately kdump didn't capture anything.
You may try also i915-quickfix KMP in OBS home:tiwai:bnc974884 repo. It has a couple of backports. Just install i915-quickfix-kmp-default.rpm from that repo, reboot and retest. Speaking of kdump: you should test kdump manually beforehand, just to check whether it works at all. YaST2 kdump setup is often too tight and fails when KMS is used. Run echo -n c > /proc/sysrq-trigger and see whether you get a proper kdump. Takashi -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Thu, Apr 14, 2016 at 4:27 PM, Takashi Iwai <tiwai@suse.de> wrote:
On Thu, 14 Apr 2016 22:15:40 +0200, Darin Perusich wrote:
On Thu, Apr 14, 2016 at 12:19 PM, Darin Perusich <darin@darins.net> wrote:
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable
I'll give these options a try before installing the newer kernel.
Setting these kernel parameters had no effect, the system just stopped responding and needed a power cycle. Unfortunately kdump didn't capture anything.
You may try also i915-quickfix KMP in OBS home:tiwai:bnc974884 repo. It has a couple of backports. Just install i915-quickfix-kmp-default.rpm from that repo, reboot and retest.
Installed and rebooted. I'll add any comments to bugzilla.
Speaking of kdump: you should test kdump manually beforehand, just to check whether it works at all. YaST2 kdump setup is often too tight and fails when KMS is used. Run echo -n c > /proc/sysrq-trigger and see whether you get a proper kdump.
The kdump failed due to not enough memory so I'll tweak the values.
Takashi -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
-- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Thu, Apr 14, 2016 at 4:27 PM, Takashi Iwai <tiwai@suse.de> wrote:
On Thu, 14 Apr 2016 22:15:40 +0200, Darin Perusich wrote:
On Thu, Apr 14, 2016 at 12:19 PM, Darin Perusich <darin@darins.net> wrote:
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable
I'll give these options a try before installing the newer kernel.
Setting these kernel parameters had no effect, the system just stopped responding and needed a power cycle. Unfortunately kdump didn't capture anything.
You may try also i915-quickfix KMP in OBS home:tiwai:bnc974884 repo. It has a couple of backports. Just install i915-quickfix-kmp-default.rpm from that repo, reboot and retest.
These KMP did not resolve the issue, the system stopped responding sometime last night. Will file a bug report. -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Fri, 15 Apr 2016 15:31:46 +0200, Darin Perusich wrote:
On Thu, Apr 14, 2016 at 4:27 PM, Takashi Iwai <tiwai@suse.de> wrote:
On Thu, 14 Apr 2016 22:15:40 +0200, Darin Perusich wrote:
On Thu, Apr 14, 2016 at 12:19 PM, Darin Perusich <darin@darins.net> wrote:
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable
I'll give these options a try before installing the newer kernel.
Setting these kernel parameters had no effect, the system just stopped responding and needed a power cycle. Unfortunately kdump didn't capture anything.
You may try also i915-quickfix KMP in OBS home:tiwai:bnc974884 repo. It has a couple of backports. Just install i915-quickfix-kmp-default.rpm from that repo, reboot and retest.
These KMP did not resolve the issue, the system stopped responding sometime last night. Will file a bug report.
OK, I haven't expected it much, as the patches are only relevant with DP MST. In anyway, it'd be helpful if you can get the crash dump, at least, the kernel message at crash. Otherwise it's difficult to know where to start from. Takashi -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Fri, Apr 15, 2016 at 9:36 AM, Takashi Iwai <tiwai@suse.de> wrote:
On Fri, 15 Apr 2016 15:31:46 +0200, Darin Perusich wrote:
On Thu, Apr 14, 2016 at 4:27 PM, Takashi Iwai <tiwai@suse.de> wrote:
On Thu, 14 Apr 2016 22:15:40 +0200, Darin Perusich wrote:
On Thu, Apr 14, 2016 at 12:19 PM, Darin Perusich <darin@darins.net> wrote:
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable
I'll give these options a try before installing the newer kernel.
Setting these kernel parameters had no effect, the system just stopped responding and needed a power cycle. Unfortunately kdump didn't capture anything.
You may try also i915-quickfix KMP in OBS home:tiwai:bnc974884 repo. It has a couple of backports. Just install i915-quickfix-kmp-default.rpm from that repo, reboot and retest.
These KMP did not resolve the issue, the system stopped responding sometime last night. Will file a bug report.
OK, I haven't expected it much, as the patches are only relevant with DP MST. In anyway, it'd be helpful if you can get the crash dump, at least, the kernel message at crash. Otherwise it's difficult to know where to start from.
https://bugzilla.opensuse.org/show_bug.cgi?id=975780 -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Thursday, April 14, 2016 10:27:28 PM PDT Takashi Iwai wrote:
On Thu, 14 Apr 2016 22:15:40 +0200,
Darin Perusich wrote:
On Thu, Apr 14, 2016 at 12:19 PM, Darin Perusich <darin@darins.net> wrote:
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options
intel_idle.max_cstate=0 intel_pstate=disable
I'll give these options a try before installing the newer kernel.
Setting these kernel parameters had no effect, the system just stopped responding and needed a power cycle. Unfortunately kdump didn't capture anything.
You may try also i915-quickfix KMP in OBS home:tiwai:bnc974884 repo. It has a couple of backports. Just install i915-quickfix-kmp-default.rpm from that repo, reboot and retest.
Installing this generated a package dependency problem nothing provides ksym(default:cpu_tss) = 3356b90b solution 1: don't install solution 2: break i915-quickfix KMP by ignoring dependency What's the best course of action with this? Carl
Speaking of kdump: you should test kdump manually beforehand, just to check whether it works at all. YaST2 kdump setup is often too tight and fails when KMS is used. Run echo -n c > /proc/sysrq-trigger and see whether you get a proper kdump.
Takashi
-- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On 04/15/2016 12:51 AM, Takashi Iwai wrote:
On Thu, 14 Apr 2016 16:45:31 +0200, Darin Perusich wrote:
Hello All,
I have a brand new HP Z240 workstation running Leap and it randomly freezes, becomes fully unresponsive, both locally and over the network, and I'm forced to power cycle the system. This has been happening multiple time a day with no rime or reason, sometimes it's sitting idle after a reboot, reading email, whatever. The most frustrating part about it is there's never anything in the logs to indicate a reason for this.
It's the default installation, BTRFS partitioning scheme, XFS for /home, KDE5, nothing crazy at all.
I've installed and configured kdump with the hope it will be able to provide some insight after the next occurrence, but given there are no kernel crash messages in the logs it's hard to say whether it will help. I guess time will tell.
Does anyone have any thoughts further debugging this type of an issue?
It's a Skylake system, right? Skylake graphics is weakly supported by Leap kernel. Try the latest kernel in OBS Kernel:openSUSE-42.1 repo, at least. I guess 4.1.20 was released in the update repo recently, too, so this should be OK, too.
Also, it might be safer to disable intel_idle and intel_pstate drivers, e.g. pass options intel_idle.max_cstate=0 intel_pstate=disable
In anyway, feel free to open the bug report. Don't forget to attach the output of "hwinfo --all" and the kernel messages (dmesg output) for a while after a fresh boot.
Takashi
I saw at least one of these shipping with a NVIDIA Quadro graphics card of which we never had a smooth experience with under Linux at my previous employer (Most were pulled out and replaced with much cheeper GeForce cards which did the job really well. Having said that if you do have one of these cards the latest NVIDIA proprietary drivers might work better. -- Simon Lees (Simotek) http://simotek.net Emergency Update Team keybase.io/simotek SUSE Linux Adeliade Australia, UTC+9:30 GPG Fingerprint: 5B87 DB9D 88DC F606 E489 CEC5 0922 C246 02F0 014B
On different hardware with a similar problem I found KDE was unable to deal with the new hardware. When I went to Gnome 3.16 from the install DVD, then the freezes stopped. Allen --- It is the thoughts we don't have that get us in life. +++++++ On Thu, 14 Apr 2016, Darin Perusich wrote:
Hello All,
I have a brand new HP Z240 workstation running Leap and it randomly freezes, becomes fully unresponsive, both locally and over the network, and I'm forced to power cycle the system. This has been happening multiple time a day with no rime or reason, sometimes it's sitting idle after a reboot, reading email, whatever. The most frustrating part about it is there's never anything in the logs to indicate a reason for this.
It's the default installation, BTRFS partitioning scheme, XFS for /home, KDE5, nothing crazy at all.
I've installed and configured kdump with the hope it will be able to provide some insight after the next occurrence, but given there are no kernel crash messages in the logs it's hard to say whether it will help. I guess time will tell.
Does anyone have any thoughts further debugging this type of an issue?
-- Later, Darin -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org
-- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
On Fri, Apr 15, 2016 at 6:21 AM, Carlos E. R. <robin.listas@telefonica.net> wrote:
On 2016-04-14 16:45, Darin Perusich wrote:
Does anyone have any thoughts further debugging this type of an issue?
The typical: check heat, check RAM health...
FWIW all hardware diagnostics pass -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
participants (9)
-
Allen Wilkinson
-
Bruno Friedmann
-
Carl Symons
-
Carlos E. R.
-
Cristian Rodríguez
-
Darin Perusich
-
Lindsay Mathieson
-
Simon Lees
-
Takashi Iwai