[Bug 683671] New: vlans cause softirq overload
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c0 Summary: vlans cause softirq overload Classification: openSUSE Product: openSUSE 11.4 Version: Final Platform: i686 OS/Version: openSUSE 11.4 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: linux@quartz-net.co.uk QAContact: qa@suse.de Found By: --- Blocker: --- Created an attachment (id=422109) --> (http://bugzilla.novell.com/attachment.cgi?id=422109) collection of config files, system resports, pcap User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:2.0) Gecko/20100101 Firefox/4.0 I have a Gigabyte GA-D525TUD Motherboard (Atom 525) on which I have now installed the final release of opensuse 11.4 (updated using the standard update repositories *not* Tumbleweed). I am building this as a new machine to replace an existing server. After weeks of normal operation using this install (and previous release candidates) where my network connection was configured to be a simple DHCP client (with no vlan), I reconfigured the interface to remove the dhcp and added the 3 vlans I use on my server machine. When I connected this to the tagged port of my managed switch, this machine became unresponsive and I noticed that the softirq count had reached very high levels. Disconnecting the network had no effect, the only recovery was to power cycle (the machine refused to shutdown). This state is now 100% repeatable. I have repeated the steps below several times all with the same result. Also, when I leave the vlans configured, enable dhcp on the interface and connect to a network with no vlan packets, then the machine happily ran all day with no problems (in the same fashion as it did while I was trying out the release candidates before adding the vlan interfaces). Uname info is: Linux ruby 2.6.37.1-1.2-desktop #1 SMP PREEMPT 2011-02-21 10:34:10 +0100 i686 i686 i386 GNU/Linux I've attached a tar/gzip containing: * My network configuration from /etc/sysconfig/network * dmesg report * /var/log/messages * hwinfo --all * lspci -vvvn * PNG files of Munin plots of CPU stats for day & week * contents of /proc/interrupts while system is running ok (ints-good.txt) and when failed (ints-bad.txt) * contents of /proc/softirqs when system failed * pcap capture started just before plugging in network (and system then fails) * rrd dump of Munin capture of softirqs count during failures The PNG plots show periods of normal operation where I left the network disconnected (or was running dhcp on the interface with no vlans on the network). The ramp up of softirqs starts almost immediately after connecting to a network with vlans. Reproducible: Always Steps to Reproduce: 1. Install opensuse 11.4 on GA-D525TUD motherboard 2. Configure interface with 3 vlan interfaces 3. Power up with network cable disconnected 4. Plug in network cable 5. Wait for arp, or do any ping -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c1 --- Comment #1 from Steve Price <linux@quartz-net.co.uk> 2011-03-31 10:52:33 UTC --- I've just built Ubuntu Server LTE 10.04 on a new disk that I swapped into this system. It runs vlans with no softirq problems. Linux kernel for Ubuntu Server LTE is: 2.6.32-28-generic-pae It's also worth mentioning that previously I had tried updating 11.4 using Tumbleweed, and I saw exactly the same problems with 2.6.38 as I'm currently seeing with 2.6.37. I then re-installed 11.4 from scratch (including removing all disk partitions) to get where I am today. I have also tried running with init 3 - this shows the same symptoms but takes slightly longer for the softirq count to ramp up. I also tried reducing the configured vlan interfaces down to just 1 - this still gives the same results. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c2 --- Comment #2 from Steve Price <linux@quartz-net.co.uk> 2011-04-05 08:57:35 UTC --- I've been experimenting with the failsafe boot, trying to remove kernel options until the problem reappears. With the default failsafe options I see no softirq problems. It seems that the softirq problem is prevented by the "nosmp" option, adding/removing the other options appear to have no effect. Also, I have installed Ubuntu Server 11.04 alpha3 (which uses kernel 2.6.38) and that runs fine with no softirq problems. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |jeffm@novell.com AssignedTo|kernel-maintainers@forge.pr |jbohac@novell.com |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c Benjamin Poirier <bpoirier@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |bpoirier@suse.com AssignedTo|jbohac@suse.com |bpoirier@suse.com -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c3 Benjamin Poirier <bpoirier@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |FIXED --- Comment #3 from Benjamin Poirier <bpoirier@suse.com> 2012-02-10 18:29:52 UTC --- Thank you for your detailed bug report and sorry for not getting to it earlier. I believe the issue you were experiencing is fixed. It was caused by a lock/unlock imbalance in the vlan receive path added by the Swap over NFS patches in the SUSE kernel. You can test the most recent kernel-desktop from the OBS project Kernel:openSUSE-11.4 here: http://download.opensuse.org/repositories/Kernel:/openSUSE-11.4/openSUSE_11.... Your feedback is welcome. Please reopen the bug if your issue is not fixed. --- Patch-mainline: no, part of swap over nfs SLES10 SP4 2.6.16.60 unaffected, does not have swap over nfs SLE11 SP1 2.6.32.49 unaffected SLE11 SP2 3.0.12 unaffected openSUSE 11.3 2.6.34.10 unaffected openSUSE 11.4 2.6.37.6 refreshed patches.suse/SoN-22-netvm.patch openSUSE 12.1 3.1.4 unaffected, does not have swap over nfs -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c4 Benjamin Poirier <bpoirier@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |mt@suse.com --- Comment #4 from Benjamin Poirier <bpoirier@suse.com> 2012-02-10 18:38:54 UTC --- *** Bug 679685 has been marked as a duplicate of this bug. *** http://bugzilla.novell.com/show_bug.cgi?id=679685 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c5 Benjamin Poirier <bpoirier@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |alex.wilms@adminguru.org --- Comment #5 from Benjamin Poirier <bpoirier@suse.com> 2012-02-10 18:41:07 UTC --- *** Bug 689261 has been marked as a duplicate of this bug. *** http://bugzilla.novell.com/show_bug.cgi?id=689261 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c Swamp Workflow Management <swamp@suse.de> changed: What |Removed |Added ---------------------------------------------------------------------------- Status Whiteboard| |obs:running:553:low -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c Swamp Workflow Management <swamp@suse.de> changed: What |Removed |Added ---------------------------------------------------------------------------- Status Whiteboard|obs:running:553:low |obs:running:553:moderate -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c6 --- Comment #6 from Swamp Workflow Management <swamp@suse.de> 2012-06-28 08:10:13 UTC --- openSUSE-SU-2012:0799-1: An update that solves 25 vulnerabilities and has 22 fixes is now available. Category: security (moderate) Bug References: 466279,651219,653260,655696,676204,681186,681639,683671,689860,703410,707332,711941,713430,714455,717209,717749,721366,726045,726600,729247,730118,731673,732908,737624,738644,740448,740703,740745,744658,745832,746980,747038,747660,748859,749569,750079,750959,756203,756840,757278,758243,758260,758813,759545,760902,765102,765320 CVE References: CVE-2009-4020,CVE-2010-3873,CVE-2010-4164,CVE-2010-4249,CVE-2011-1083,CVE-2011-1173,CVE-2011-2517,CVE-2011-2700,CVE-2011-2909,CVE-2011-2928,CVE-2011-3619,CVE-2011-3638,CVE-2011-4077,CVE-2011-4086,CVE-2011-4330,CVE-2012-0038,CVE-2012-0044,CVE-2012-0207,CVE-2012-1090,CVE-2012-1097,CVE-2012-1146,CVE-2012-2119,CVE-2012-2123,CVE-2012-2136,CVE-2012-2663 Sources used: openSUSE 11.4 (src): kernel-docs-2.6.37.6-0.20.2, kernel-source-2.6.37.6-0.20.1, kernel-syms-2.6.37.6-0.20.1, preload-1.2-6.17.1 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c Swamp Workflow Management <swamp@suse.de> changed: What |Removed |Added ---------------------------------------------------------------------------- Status Whiteboard|obs:running:553:moderate |obs:running:553:moderate | |obs:running:1049:moderate -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=683671 https://bugzilla.novell.com/show_bug.cgi?id=683671#c7 --- Comment #7 from Swamp Workflow Management <swamp@suse.de> 2012-11-05 09:11:02 UTC --- openSUSE-SU-2012:1439-1: An update that solves 26 vulnerabilities and has 28 fixes is now available. Category: security (moderate) Bug References: 466279,651219,653260,655696,676204,681186,681639,683671,689860,703410,707332,711941,713430,714455,717209,717749,721366,726045,726600,729247,730118,731673,732908,734056,737624,738644,740448,740703,740745,744658,745832,746980,747038,747660,748859,749569,750079,750959,755546,756203,756840,757278,758243,758260,758813,759545,760902,765102,765320,769408,769784,769896,774285,781134 CVE References: CVE-2009-4020,CVE-2010-3873,CVE-2010-4164,CVE-2010-4249,CVE-2011-1083,CVE-2011-1173,CVE-2011-2517,CVE-2011-2700,CVE-2011-2909,CVE-2011-2928,CVE-2011-3619,CVE-2011-3638,CVE-2011-4077,CVE-2011-4086,CVE-2011-4110,CVE-2011-4330,CVE-2012-0038,CVE-2012-0044,CVE-2012-0207,CVE-2012-1090,CVE-2012-1097,CVE-2012-1146,CVE-2012-2119,CVE-2012-2123,CVE-2012-2136,CVE-2012-2663 Sources used: openSUSE 11.4 (src): kernel-docs-2.6.37.6-24.2, kernel-source-2.6.37.6-24.1, kernel-syms-2.6.37.6-24.1, preload-1.2-6.19.1 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
http://bugzilla.novell.com/show_bug.cgi?id=683671 Swamp Workflow Management <swamp@suse.de> changed: What |Removed |Added ---------------------------------------------------------------------------- Whiteboard|obs:running:553:moderate |obs:running:553:moderate |obs:running:1049:moderate | -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.novell.com/show_bug.cgi?id=683671 Swamp Workflow Management <swamp@suse.de> changed: What |Removed |Added ---------------------------------------------------------------------------- Whiteboard|obs:running:553:moderate | -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com