[Bug 798287] New: r8169 driver after some time doesn't receive packets
https://bugzilla.novell.com/show_bug.cgi?id=798287 https://bugzilla.novell.com/show_bug.cgi?id=798287#c0 Summary: r8169 driver after some time doesn't receive packets Classification: openSUSE Product: openSUSE 12.2 Version: Final Platform: x86-64 OS/Version: openSUSE 12.2 Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: Ralf.Friedl@online.de QAContact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (Windows; U; Win98; de-AT; rv:1.8.1.24) Gecko/20100228 SeaMonkey/1.1.19 I have a mainboard ASUS M5A97 EVO R2.0. Onboard it has a Realtek 8168 chip. This chip is recognized by the r8169 driver. After some days of normal/low use it happens that the driver doesn't receive packets. I can verify with tcpdump on another computer that it sends packets, but it doesn't receive the answers. This applies to all packets, ARP, ICMP, TCP, UDP. This can be fixed with reloading the driver (rmmod r8169; modprobe r8169). Then it would work again for some days. Yesterday I wanted to transfer a Suse ISO file to another computer also on gigabit Ethernet. Here the same effect happens always after 5-30 seconds. Reloading the driver again helps, but only for another few seconds. After searching the internet, I found recommendations to a r8168 driver from Realtek, see http://www.realtek.com.tw/downloads/downloadsView.aspx?Langid=1&PNid=13&PFid=5&Level=5&Conn=4&DownTypeID=3&GetDown=false I compiled and loaded this driver. With this driver I was able to transfer the ISO file and didn't have any problems since then. So whatever is different with this driver seems to fix the problem. Reproducible: Always Steps to Reproduce: 1. Use ASUS M5A97 EVO R2.0 (might also affect other boards / chip revisions) 2. Transfer large file over network interface Actual Results: After a few seconds, the driver doesn't receive any packets. Expected Results: File is transferred normally. Output from lspci: # lspci -nnQ -s 02:00.0 -v 02:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8111/8168 PCI Express Gigabit Ethernet controller [10ec:8168] (rev 09) Subsystem: ASUSTeK Computer Inc. P8H77-I Motherboard [1043:8505] Flags: bus master, fast devsel, latency 0, IRQ 90 I/O ports at e000 [size=256] Memory at fea04000 (64-bit, prefetchable) [size=4K] Memory at fea00000 (64-bit, prefetchable) [size=16K] Capabilities: [40] Power Management version 3 Capabilities: [50] MSI: Enable+ Count=1/1 Maskable- 64bit+ Capabilities: [70] Express Endpoint, MSI 01 Capabilities: [b0] MSI-X: Enable- Count=4 Masked- Capabilities: [d0] Vital Product Data Capabilities: [100] Advanced Error Reporting Capabilities: [140] Virtual Channel By the way, the option -Q to lspci displays RTL8111/8168 while the local file says RTL8111/8168B. The Realtek r8168 driver gives this output: r8168 Gigabit Ethernet driver 8.035.00-NAPI loaded r8168 0000:02:00.0: irq 90 for MSI/MSI-X eth%d: RTL8168F/8111F at 0xffffc90000030000, 50:46:5d:xx:xx:xx, IRQ 90 eth0: Identified chip type is 'RTL8168F/8111F'. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=798287 https://bugzilla.novell.com/show_bug.cgi?id=798287#c1 Ralf Friedl <Ralf.Friedl@online.de> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |Ralf.Friedl@online.de --- Comment #1 from Ralf Friedl <Ralf.Friedl@online.de> 2013-02-19 18:07:10 UTC --- Additional information: I just had the same problem with an ASUS M5A97 LE R2.0 board. It happened during online update after the initial install. The speed and transfer volume here is much lower then with the ISO image mentioned before. This online update failed several times before it could download all updates. The initial install was a network install from the 12.2 x64 ISO DVD. This probably transferred more then the online update and had no problem. It is not necessary to reload the driver, it is sufficient to rebind it: echo 0000:02:00.0 > /sys/bus/pci/drivers/r8169/unbind sleep 1 echo 0000:02:00.0 > /sys/bus/pci/drivers/r8169/bind The 0000:02:00.0 is the PCI Id of the network card. Or course this is not faster then reloading the driver. The sleep is necessary to get the interface configured, I haven't tried lower values. The r8168 module is much larger: r8168 252604 0 r8169 61212 0 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=798287 https://bugzilla.novell.com/show_bug.cgi?id=798287#c2 --- Comment #2 from Ralf Friedl <Ralf.Friedl@online.de> 2013-02-22 15:00:40 UTC --- Additional information: I got the advice to try a 3.8 kernel, so I installed kernel 3.8.0-2.1 from opensuse and transferred 147GB in both the send and receive direction without problems, so it seems the issue has been fixed with the newer kernel. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=798287 https://bugzilla.novell.com/show_bug.cgi?id=798287#c Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|kernel-maintainers@forge.pr |bpoirier@suse.com |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=798287 https://bugzilla.novell.com/show_bug.cgi?id=798287#c3 --- Comment #3 from Ralf Friedl <Ralf.Friedl@online.de> 2013-07-22 09:05:01 UTC --- Additional information: The new kernel doesn't solve the problem completely, but it is much better. I can no longer provoke the error by transferring large amounts of data, but the error does occur, rarely. Sometimes even when there is almost no activity on the network. At the moment I have a script with a loop that does a ping to another host and if it doesn't receive an answer, it uses the unbind/bind commands from comment 1. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=798287 https://bugzilla.novell.com/show_bug.cgi?id=798287#c4 --- Comment #4 from Ralf Friedl <Ralf.Friedl@online.de> 2014-01-23 17:17:17 UTC --- The problem still exists with kernel 3.8.2-1.1.x86_64 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=798287 https://bugzilla.novell.com/show_bug.cgi?id=798287#c7 Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |WONTFIX --- Comment #7 from Jeff Mahoney <jeffm@suse.com> 2014-08-08 16:29:23 EDT --- This report is against openSUSE 12.2 which is no longer under maintenance. If you are able to reproduce it with openSUSE 13.1 or openSUSE Factory, please re-open and reset the the "Product" field to the appropriate release. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com