[Bug 702205] New: RTL8111/8168B hard locking and rebooting machines when under heavy load
https://bugzilla.novell.com/show_bug.cgi?id=702205 https://bugzilla.novell.com/show_bug.cgi?id=702205#c0 Summary: RTL8111/8168B hard locking and rebooting machines when under heavy load Classification: openSUSE Product: openSUSE 11.4 Version: Final Platform: x86-64 OS/Version: openSUSE 11.4 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: Happy.Cerberus@gmail.com QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:2.0.1) Gecko/20100101 Firefox/4.0.1 When this network card (should apply to all cards from this family) is under heavy load (>20MB) it causes hard lockups and/or hard machine reboots. This issue is extremely hard to localize. There are either no errors, or the machine reboots to fast. Plus the actual errors are not very informative ("eth0 link up" is one of the "fatal errors"). Upstream reference: https://bugzilla.kernel.org/show_bug.cgi?id=32962 Ubuntu forum reference: http://ubuntuforums.org/showpost.php?p=10774353&postcount=18 Solution is to use the driver provided by Realtek. I spent two days hunting this problem. Reproducible: Always Steps to Reproduce: 1. do some network heavy stuff 2. watch your machine crash Actual Results: Machine reboots or hard locks. Expected Results: No problems should appear. Please even if you won't fix this soon, at least disable the problematic module, so that users will have some clue where to search, because this is a really, really, really annoying problem that is extremely hard to diagnose. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c1
--- Comment #1 from Simon Toth
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c2
Sven Hartrumpf
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c3
Joschi Brauchle
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c
Jeff Mahoney
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c4
Benjamin Poirier
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c5
Benjamin Poirier
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c6
Martin Seidler
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c7
--- Comment #7 from Benjamin Poirier
Compare:
1) In/against openSUSE 12.1: [opensuse] Install help for Network driver (Date: Wed, 14 Mar 2012 15:27:02 -0400) http://lists.opensuse.org/opensuse/2012-03/msg00676.html especially: http://lists.opensuse.org/opensuse/2012-03/msg00765.html
There is some confusion in that thread:
As suspected your software system uses a r816*9* kernel module (...9) but your hardware is a Realtek [...] RTL8111/816*8*B [10ec:816*8*] (...8).
Despite it's name, the r8169 module is meant to drive cards based on the realtek 8168/8111 chips. The difference between the two modules is that: r8168 is a binary-only driver provided by realtek r8169 is a community-developped and supported driver While it is the case that r8168 usually supports newer chips first, the version of r8169 currently in openSUSE 12.1 supports all the chips I've seen in circulation so far. IMO, steering users towards r8168 is ill-advised as it will be extremely difficult to find some developpers willing and able to provide support for it. Secondly, lspci output is insufficient to determine the chip version as many of them share a small set of pci ids. A first step in identifying the chip version is the (masked) XID line found in the kernel logs as pointed out at the end of comment 4. The chip version is identified from the (unmasked) XID in rtl8169_get_mac_version() http://lxr.linux.no/#linux+v3.1.10/drivers/net/r8169.c#L1724 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c8
--- Comment #8 from Benjamin Poirier
The difference between the two modules is that: r8168 is a binary-only driver provided by realtek
correction: the source for r8168 is in fact available. But it is an out of tree driver, it is not supported by the kernel community and it is not supported by SUSE (afaik). Thank you Martin for pointing this out. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=702205
https://bugzilla.novell.com/show_bug.cgi?id=702205#c9
Benjamin Poirier
participants (1)
-
bugzilla_noreply@novell.com