[Bug 1161754] New: [usb-c] System loses USB NIC in Docking Station at random Times
http://bugzilla.suse.com/show_bug.cgi?id=1161754 Bug ID: 1161754 Summary: [usb-c] System loses USB NIC in Docking Station at random Times Classification: openSUSE Product: openSUSE Distribution Version: Leap 15.1 Hardware: Other OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel Assignee: oneukum@suse.com Reporter: eich@suse.com QA Contact: qa-bugs@suse.de Found By: Development Blocker: --- Created attachment 828223 --> http://bugzilla.suse.com/attachment.cgi?id=828223&action=edit Kernel log On a laptop (Dell Precision 5520) the 1GB NIC is contained in the docking station which is connected via USB-C. With kernel 4.12.14-lp151.28.36-default this device gets 'lost' at random times. 1. The kernel shows no obvious signs of the device getting lost. The lines like: usb 3-1.4: reset high-speed USB device number 6 using xhci_hcd don't seem to occur when the device gets lost. 2. The network device is no longer disappears from the lsusb output. 3. In the more annoying cases, the network device 'hangs': and 'ip addr' hangs in a 'sendto' to a netlink socket. In other cases, the device disappears from the 'ip addr' output, unplugging and plugging the docking station causes the devices to re-enumerated. NOTE: in the attached log, the re-docking happens after the 202556.480017 timestamp. The issue seems to start once the machine has been suspended and/or redocked after a boot. The issue did not occur on a Leap 42.3 kernel. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c1 --- Comment #1 from Egbert Eich <eich@suse.com> --- Created attachment 828224 --> http://bugzilla.suse.com/attachment.cgi?id=828224&action=edit lsusb output after device got lost -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c2 --- Comment #2 from Egbert Eich <eich@suse.com> --- Created attachment 828225 --> http://bugzilla.suse.com/attachment.cgi?id=828225&action=edit lsusb output for docked system with device still present -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c3 --- Comment #3 from Egbert Eich <eich@suse.com> --- Created attachment 828226 --> http://bugzilla.suse.com/attachment.cgi?id=828226&action=edit strace output from hanging 'ip addr' when system was not recoverable -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c4 Egbert Eich <eich@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |oneukum@suse.com Flags| |needinfo?(oneukum@suse.com) --- Comment #4 from Egbert Eich <eich@suse.com> --- Olly, please let me know what other data you need or what I should try! Any settings for additional logging by the kernel I should use? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c5 Oliver Neukum <oneukum@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |nsaenzjulienne@suse.com Flags|needinfo?(oneukum@suse.com) |needinfo?(nsaenzjulienne@su | |se.com) --- Comment #5 from Oliver Neukum <oneukum@suse.com> --- Nicolas, is this the docking station you fixed issues for? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c6 --- Comment #6 from Oliver Neukum <oneukum@suse.com> --- You can test disabling LPM usbcore.quirks=0bda:8153:gk,0424:5807:gk,1a40:0101:gk to be conservative also add usbcore.autosuspend=-1 -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c7 --- Comment #7 from Egbert Eich <eich@suse.com> --- (In reply to Oliver Neukum from comment #6)
You can test disabling LPM
usbcore.quirks=0bda:8153:gk,0424:5807:gk,1a40:0101:gk
Wow! Thanks! Do I need to 1a40:0101 as well? This is just an innocent USB2 hub that's connected to the docking station.
to be conservative also add
usbcore.autosuspend=-1
-- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c8 --- Comment #8 from Oliver Neukum <oneukum@suse.com> --- (In reply to Egbert Eich from comment #7)
Do I need to 1a40:0101 as well? This is just an innocent USB2 hub that's connected to the docking station.
Yes, sorry. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 Lubos Kocman <lubos.kocman@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Priority|P5 - None |P2 - High CC| |lubos.kocman@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 http://bugzilla.suse.com/show_bug.cgi?id=1161754#c11 --- Comment #11 from Peter Chadwick <pchadwick@suse.com> --- I switched to Tumbleweed and it all seems to be working now. Please let me know if there is info I can provide to isolate the problem. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.suse.com/show_bug.cgi?id=1161754 Shung-Hsi Yu <shung-hsi.yu@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |shung-hsi.yu@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 Vasilis Liaskovitis <vliaskovitis@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |vliaskovitis@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 https://bugzilla.suse.com/show_bug.cgi?id=1161754#c14 Oliver Neukum <oneukum@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |eich@suse.com Flags|needinfo?(nsaenzjulienne@su |needinfo?(eich@suse.com) |se.com) | --- Comment #14 from Oliver Neukum <oneukum@suse.com> --- We backported fixes for docking stations. Could you test KOTD? Does the issue persist? -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 https://bugzilla.suse.com/show_bug.cgi?id=1161754#c15 --- Comment #15 from Egbert Eich <eich@suse.com> --- Meanwhile I've updated to Leap 15.2, however, my kernel still has usbcore.quirks=0bda:8153:gk,0424:5807:gk usbcore.autosuspend=-1 splash=silent set. Since I've set these, the problem did not reoccur. However, my usage pattern has also change since Covid: I'm not traveling much any more. The problem seemed to have been related to docking/undocking and/or suspending the system. So, with this in mind, I'm not even sure the results of my tests will be conclusive - even if I remove usbcore.quirks=0bda:8153:gk,0424:5807:gk usbcore.autosuspend=-1. In any case, today is the last day for me this year, so this will have to wait till January. I'll leave the NEEDINFO set. -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 https://bugzilla.suse.com/show_bug.cgi?id=1161754#c16 Oliver Neukum <oneukum@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |IN_PROGRESS Flags|needinfo?(eich@suse.com) | --- Comment #16 from Oliver Neukum <oneukum@suse.com> --- https://git.kernel.org/pub/scm/linux/kernel/git/netdev/net.git/commit/?id=cb... This fix under testing in upstream is likely to fix teh issue. There is no need for testing until upstream has merged it and I backport it. ETA next week. -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 Vincent Untz <vuntz@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |vuntz@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 Mark Harvey <mark.harvey@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |mark.harvey@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 Roger Whittaker <roger.whittaker@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |roger.whittaker@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 https://bugzilla.suse.com/show_bug.cgi?id=1161754#c17 Oliver Neukum <oneukum@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Flags| |needinfo?(eich@suse.com) --- Comment #17 from Oliver Neukum <oneukum@suse.com> --- Please try the kernel at https://build.opensuse.org/project/monitor/home:oneukum:bsc1171677_20210223a -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 https://bugzilla.suse.com/show_bug.cgi?id=1161754#c18 --- Comment #18 from Oliver Neukum <oneukum@suse.com> --- Any progress? -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 https://bugzilla.suse.com/show_bug.cgi?id=1161754#c19 Egbert Eich <eich@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Flags| |needinfo?(oneukum@suse.com) --- Comment #19 from Egbert Eich <eich@suse.com> --- I will try to look at this over the weekend. I cannot shut down my laptop today as there is just too much stuff open. I still have these settings on the kernel command line: usbcore.quirks=0bda:8153:gk,0424:5807:gk usbcore.autosuspend=-1 I will remove them (ever since I've added these the problem did not occur any more), install and boot the kernel above and do a few docking/undocking cycles (it appears the problem appeared once the laptop had been undocked). Unfortunately, the issue then showed at random times, but never immediately. So I will try to run with this kernel for a week and get back to you if anything happens. Is this ok? -- You are receiving this mail because: You are on the CC list for the bug.
https://bugzilla.suse.com/show_bug.cgi?id=1161754 https://bugzilla.suse.com/show_bug.cgi?id=1161754#c20 Egbert Eich <eich@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Flags|needinfo?(oneukum@suse.com) | --- Comment #20 from Egbert Eich <eich@suse.com> --- It took a while for me to get conclusive results, as I was not certain if my observations were fully correct, however, my recent observation showed that the test kernel did not completely do away with the problem. I've now observed twice in a row that the Realtek USB NIC has suddenly disappeared while it was in use. It seems like the problem gets triggered as soon as the system has been suspended at least once since the last boot. Please let me know if you want me to do further tests. I will reinstate the quirk settings from comment #6. Sorry for the delay. -- You are receiving this mail because: You are on the CC list for the bug.
participants (2)
-
bugzilla_noreply@novell.com
-
bugzilla_noreply@suse.com