[Bug 461241] New: hard hangs and unkillable processes
https://bugzilla.novell.com/show_bug.cgi?id=461241 Summary: hard hangs and unkillable processes Product: openSUSE 11.1 Version: Final Platform: x86-64 OS/Version: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: augustmiles@yahoo.com QAContact: qa@suse.de Found By: Other I have just updated to 11.1. From 11.0 Our machine is a dual process xeon, used as a web server/imap mail. It is administered remotely over ssh. I had a imap process on my server using 100% of CPU. I tried using kill -9 to stop it without success. I then tried taking down dovecot imap with "rcdovecot stop" at this point I could no longer get the command prompt. I am now unable to login to the machine- it does answer a ping. I am filing under kernel- a runaway process should not bring down the connectivity of a machine in any situation. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User novell.com@kleinmanns.com added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c1 florian florian <novell.com@kleinmanns.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |novell.com@kleinmanns.com --- Comment #1 from florian florian <novell.com@kleinmanns.com> 2008-12-29 11:00:48 MST --- might be a duplicate of #460634 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User augustmiles@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c2 --- Comment #2 from august miles <augustmiles@yahoo.com> 2009-01-10 09:39:29 MST --- I am the original reporter- I have a second machine that had a similar problem with Okular, which was viewing a dvi file that was generated by latex. It was running at 100% cpu, again a 64-bit xeon. The okular program was launched as a subprocess to emacs-gtk, which also hang without using cpu. When switching to a virtual terminal the whole machine hang ----------- I have downgraded to an 11.0 kernel on the original machine running imap, since there has been no problems. (userland remains 11.1) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 Cyril Hrubis <chrubis@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|bnc-team-screening@forge.pr |kernel-maintainers@forge.pr |ovo.novell.com |ovo.novell.com -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User bw@inside-security.de added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c3 Boris Wesslowski <bw@inside-security.de> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |bw@inside-security.de --- Comment #3 from Boris Wesslowski <bw@inside-security.de> 2009-02-06 10:42:44 MST --- We have the same problem on a HP ML110 G5 Server with Xeon Quadcore Processor and two SATA Disks in RAID1, it is also dovecot that has shown unkillable processes, especially when the imap client is closed. The system degrades from the point where they appear until disk accesses seem to hang, xosview shows all cores in 100% wait states, the system still reponds to pings and the dhcp server even is able to write to syslog, but everything else seems to be waiting for the disks. At that point the console is usually black and won't react to Ctrl-Alt-Del, but Alt-Printscreen-b boots. Disabling the irq_balancer seems to make the hangs happen less often. We have not tried using some other kernel yet. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User bw@inside-security.de added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c4 Boris Wesslowski <bw@inside-security.de> changed: What |Removed |Added ---------------------------------------------------------------------------- Priority|P5 - None |P2 - High --- Comment #4 from Boris Wesslowski <bw@inside-security.de> 2009-02-10 11:26:47 MST --- Here's an update: We tried the "vanilla" kernel as supplied with openSUSE 11.1 and the server hung again, and again shortly after a certain (windows/thunderbird) user closes his dovecot IMAP mailbox at the end of the day... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User gregkh@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c5 Greg Kroah-Hartman <gregkh@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO Info Provider| |augustmiles@yahoo.com --- Comment #5 from Greg Kroah-Hartman <gregkh@novell.com> 2009-02-11 09:23:20 MST --- Any clues as to where the machine is hung? alt-sysrq-t should show you the task list, we are going to need some kind of clue to be able to work on this. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User augustmiles@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c6 --- Comment #6 from august miles <augustmiles@yahoo.com> 2009-02-11 09:31:06 MST --- I would guess it is this known problem of dovecot with 2.6.27 kernels... http://www.mail-archive.com/dovecot@dovecot.org/msg15054.html It seems to be a problem with inotify. This would be coherent with the problems I have also had with Okular, which I understand also uses inotify. I have been running weeks with the downgraded kernel (2.6.25.18-0.2 from opensuse 11.0), but 11.1 userland. It is totally stable. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User gregkh@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c7 --- Comment #7 from Greg Kroah-Hartman <gregkh@novell.com> 2009-02-11 10:04:21 MST --- Ah, yeah, that should be the issue. That is solved in the updated kernel package. If you get that, this should go away, can you try that? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User bw@inside-security.de added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c8 --- Comment #8 from Boris Wesslowski <bw@inside-security.de> 2009-02-12 03:13:01 MST --- Sorry for the dumb question, but is "the updated kernel package" supposed to be the one in http://download.opensuse.org/repositories/Kernel:/SL111_BRANCH/openSUSE_11.1... ? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User gregkh@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c9 --- Comment #9 from Greg Kroah-Hartman <gregkh@novell.com> 2009-02-12 09:52:26 MST --- (In reply to comment #8)
Yes, you can use that one. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User bw@inside-security.de added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c10 --- Comment #10 from Boris Wesslowski <bw@inside-security.de> 2009-02-13 06:43:08 MST --- (In reply to comment #9)
kernel-default-2.6.27.8-11.1.x86_64.rpm from the url above does not fix the problem for us, an unkillable imap process appeared again and the machine was not able to shut down and reboot by itself (I am not on site to check details)... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User bw@inside-security.de added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c11 --- Comment #11 from Boris Wesslowski <bw@inside-security.de> 2009-03-03 04:44:27 MST --- The problem does not happen with the recently released kernel-default-2.6.27.19-3.2.1, I consider this bug closed. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=461241 User gregkh@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=461241#c12 Greg Kroah-Hartman <gregkh@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |RESOLVED Info Provider|augustmiles@yahoo.com | Resolution| |FIXED --- Comment #12 from Greg Kroah-Hartman <gregkh@novell.com> 2009-03-03 08:24:27 MST --- thanks for letting us know. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com