[Bug 1179971] New: iwlwifi - "Microcode SW error detected"
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 Bug ID: 1179971 Summary: iwlwifi - "Microcode SW error detected" Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: Other OS: Other Status: NEW Severity: Major Priority: P5 - None Component: Kernel Assignee: kernel-bugs@opensuse.org Reporter: sb56637@gmail.com QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- Hi, I'm experiencing relatively frequent issues with my Intel WiFI connection dropping, at least once a day. (Lenovo T530 laptop) Just now I had it drop about 5 times in a row during a Zoom meeting. By "drop" I mean that NetworkManager shows no connection for about 10 seconds, and it has to re-associate. Of course multiple things could be going wrong given that it's WiFI, but my router is a high quality Mikrotik device giving a strong signal to my laptop, and I don't have major problems with WiFI spectrum congestion in this area. Also, the router log is showing corresponding events like "disconnected, ok", whereas in the past when I had problems with the actual connection quality it would throw an error indicating there was "major data loss". I also found some nasty looking errors with traces in dmesg that seem to correspond to the timeframes of the episondes, although I can't be sure because I put my machine to sleep every night and therefore the time stamps are wrong. I pulled the most interesting bits out and changed the MAC addresses. I should also mention that I usually have a Bluetooth headphone connected. Thanks for the help! -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c1 --- Comment #1 from S. B. <sb56637@gmail.com> --- Created attachment 844411 --> http://bugzilla.opensuse.org/attachment.cgi?id=844411&action=edit relevant dmesg -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c2 --- Comment #2 from S. B. <sb56637@gmail.com> --- Created attachment 844412 --> http://bugzilla.opensuse.org/attachment.cgi?id=844412&action=edit hwinfo - Thinkpad T530 all Intel hardware -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c3 Takashi Iwai <tiwai@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |tiwai@suse.com --- Comment #3 from Takashi Iwai <tiwai@suse.com> --- I'm afraid that this is a kind of long-standing problem the upstream has little cared. As shown in the dmesg output, this is likely the firmware problem, and Intel won't provide the new firmware for old chips. While we should report this again to the upstream bug tracker (bugzilla.kernel.org), you may try some iwlwifi module options that are related with the power management. Also, downgrading the firmware might be worth to try, too. It's iwlwifi-6000g2a, and currently iwlwifi-6000g2a-6.ucode is used. So try to remove /lib/firmware/iwlwifi-6000g2a-6.ucode (or ucode.xz) file and reboot. (Maybe safer to move to another place or rename the file instead of removing it, so that you can recover later, too). -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c4 --- Comment #4 from S. B. <sb56637@gmail.com> --- Hi Takashi, thanks for your extremely helpful and polite response as always. I think I'll try renaming the iwlwifi-6000g2a-6.ucode.xz for now. Hopefully I'll notice a difference after a few long days of work, this bug is just random enough so as to make troubleshooting difficult, but frequent enough to seriously interrupt important work. As for power management tweaks, I also suspected that could be a factor, as I use TLP which includes pretty aggressive power management tweaks across the board. Would you recommend anything in particular? I've seen recommendations for:
options iwlwifi 11n_disable=1 swcrypto=0 power_save=0 #also: 11n_disable=8 options iwlmvm power_scheme=1 options iwlwifi uapsd_disable=1
As well as:
options iwlwifi swcrypto=0 options iwlwifi power_save=0 options iwlmvm power_scheme=1 options iwlwifi uapsd_disable=1
Or simply:
options iwlwifi power_save=0
I'm not sure about the 11n_disable option codes or if that has anything to do with this? There are indeed quite a few reports of this on bugzilla.kernel.org, should I add on to an existing report or open a new one? - https://bugzilla.kernel.org/show_bug.cgi?id=207409 - https://bugzilla.kernel.org/show_bug.cgi?id=205387 - https://bugzilla.kernel.org/show_bug.cgi?id=208425 -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c5 --- Comment #5 from Takashi Iwai <tiwai@suse.com> --- (In reply to S. B. from comment #4)
As for power management tweaks, I also suspected that could be a factor, as I use TLP which includes pretty aggressive power management tweaks across the board. Would you recommend anything in particular? I've seen recommendations for:
options iwlwifi 11n_disable=1 swcrypto=0 power_save=0 #also: 11n_disable=8 options iwlmvm power_scheme=1 options iwlwifi uapsd_disable=1
As well as:
options iwlwifi swcrypto=0 options iwlwifi power_save=0 options iwlmvm power_scheme=1 options iwlwifi uapsd_disable=1
Or simply:
options iwlwifi power_save=0
I'm not sure about the 11n_disable option codes or if that has anything to do with this?
Honestly speaking, I have no idea what really matters. I would try the bottom line first, i.e. disable all suspicious ones.
There are indeed quite a few reports of this on bugzilla.kernel.org, should I add on to an existing report or open a new one? - https://bugzilla.kernel.org/show_bug.cgi?id=207409 - https://bugzilla.kernel.org/show_bug.cgi?id=205387 - https://bugzilla.kernel.org/show_bug.cgi?id=208425
It's fine to create a new report unless you find the exactly same bug with the same hardware (Lenovo T530). You can join to other bugs meanwhile, and close yours later as a duplicate. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c6 H�ctor Sanju�n <code@hector.link> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |code@hector.link --- Comment #6 from H�ctor Sanju�n <code@hector.link> --- I also have this problem, also with a Lenovo (X1 Carbon gen. 7), and also with a Mikrotik AP. But for me this is new or at least it is happening way more often now (every few minutes), particularly on when network is very busy. Apart from the "Microcode SW error detected. Restarting 0x0." I also see quite a bit of: "kernel: iwlwifi 0000:00:14.3: Unhandled alg: 0xc040071b" I will try the firmware downgrade and report back. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c7 --- Comment #7 from S. B. <sb56637@gmail.com> --- I seem to have fixed that particular error with the following in /etc/modprobe.d/iwlwifi.conf
options iwlwifi bt_coex_active=0 swcrypto=1 11n_disable=8
It appears that the issue was also being complicated by some wireless interference causing "extensive data loss" errors in RouterOS, but I managed to switch to a more free channel. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c8 --- Comment #8 from H�ctor Sanju�n <code@hector.link> ---
I seem to have fixed that particular error with the following in /etc/modprobe.d/iwlwifi.conf
options iwlwifi bt_coex_active=0 swcrypto=1 11n_disable=8
Does not seem to help. I did not enable bt_coex_active, as there is no BT device nearby.
It appears that the issue was also being complicated by some wireless interference causing "extensive data loss" errors in RouterOS, but I managed to switch to a more free channel.
I am on a very clean spectrum. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c9 --- Comment #9 from H�ctor Sanju�n <code@hector.link> --- With 5.10.16, along with a firmware update (kernel-firmware-iwlwifi-20210208-1.1), I don't see issues anymore. I am connected on 5Ghz (the issue before only appeared on this band), I have removed all iwlwifi options from modprobe and I have not seen any errors in a while, where before I'd suffer them every one or two minutes. The last kernel release has indeed included fixes to iwlwifi: https://lwn.net/Articles/846116/ -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c10 --- Comment #10 from H�ctor Sanju�n <code@hector.link> --- (In reply to H�ctor Sanju�n from comment #9)
With 5.10.16, along with a firmware update (kernel-firmware-iwlwifi-20210208-1.1), I don't see issues anymore.
I am connected on 5Ghz (the issue before only appeared on this band), I have removed all iwlwifi options from modprobe and I have not seen any errors in a while, where before I'd suffer them every one or two minutes.
The last kernel release has indeed included fixes to iwlwifi:
Woa, scratch all that. I saw the issue again. However it appears to have manifested after I disconnected the laptop from the AC, so perhaps it might be related to powersaving behaviour. I'll have to re-test with the powersave-related options. :( -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1179971 http://bugzilla.opensuse.org/show_bug.cgi?id=1179971#c11 --- Comment #11 from S. B. <sb56637@gmail.com> --- @H�ctor Sanju�n -- Interesting... I'm going to be trying the opposite of what you did. I disabled my iwlwifi.conf tweak from comment #7 , and I'm now using a new Mikrotik router with a 5Ghz network, whereas when I opened this bug report I was having major issues with my only 2.4Ghz network. I'll report back if anything gets better or worse, but this bug is frustratingly random as you've noted. -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@suse.com