[Bug 472833] New: ATA errors (soft resetting link)
https://bugzilla.novell.com/show_bug.cgi?id=472833 Summary: ATA errors (soft resetting link) Classification: openSUSE Product: openSUSE 10.3 Version: Final Platform: Other OS/Version: Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: noelamac@gmail.com QAContact: kernel-maintainers@forge.provo.novell.com Found By: --- After the last kernel update, I am getting the following ata errors on /var/log/messages: *** Feb 5 08:24:08 stthpc kernel: ata2.00: failed to IDENTIFY (I/O error, err_mask=0x1) Feb 5 08:24:08 stthpc kernel: ata2.00: revalidation failed (errno=-5) Feb 5 08:24:08 stthpc kernel: ata2: failed to recover some devices, retrying in 5 secs Feb 5 08:24:13 stthpc kernel: ata2: soft resetting link Feb 5 08:24:13 stthpc kernel: ata2.00: configured for UDMA/100 Feb 5 08:24:13 stthpc kernel: ata2: EH pending after completion, repeating EH (cnt=4) Feb 5 08:24:13 stthpc kernel: ata2: EH complete Feb 5 08:24:17 stthpc kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 Feb 5 08:24:17 stthpc kernel: ata2.00: cmd a0/00:00:00:00:20/00:00:00:00:00/a0 tag 0 cdb 0x0 data 0 Feb 5 08:24:17 stthpc kernel: res 51/20:03:00:00:00/00:00:00:00:00/a0 Emask 0x1 (device error) Feb 5 08:24:17 stthpc kernel: ata2.00: failed to IDENTIFY (I/O error, err_mask=0x1) Feb 5 08:24:17 stthpc kernel: ata2.00: revalidation failed (errno=-5) Feb 5 08:24:17 stthpc kernel: ata2: failed to recover some devices, retrying in 5 secs Feb 5 08:24:22 stthpc kernel: ata2: soft resetting link Feb 5 08:24:23 stthpc kernel: ata2.00: configured for UDMA/100 Feb 5 08:24:23 stthpc kernel: ata2: EH pending after completion, repeating EH (cnt=4) Feb 5 08:24:23 stthpc kernel: ata2: EH complete Feb 5 08:25:07 stthpc kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 Feb 5 08:25:07 stthpc kernel: ata2.00: cmd a0/00:00:00:00:20/00:00:00:00:00/a0 tag 0 cdb 0x43 data 12 in Feb 5 08:25:07 stthpc kernel: res 51/20:03:00:00:00/00:00:00:00:00/a0 Emask 0x1 (device error) *** AFAIK, there is no ide / ata port on this computer, only 4 SATA ports (even dvd drive is sata). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=472833
User gregkh@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c1
Greg Kroah-Hartman
https://bugzilla.novell.com/show_bug.cgi?id=472833
User noelamac@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c2
Camaleon --
Does this also happen on the 11.1 release?
I guess no. At least I cannot see such messages on a virtualized opensuse 11.1 installed on same hardware. In fact, I have checked several 10.3 boxes, but only one machine is logging these warnings. May I be worried? Can be a real hardware problem? :-? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=472833
Greg Kroah-Hartman
https://bugzilla.novell.com/show_bug.cgi?id=472833
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c3
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=472833
User noelamac@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c4
Camaleon --
Can you please test 11.1? Testing live CD should be enough.
Sorry for the delay :-) Yes, opensuse LiveCD 11.1 also shows the same logs... I am attaching /var/log/boot.msg and /var/log/messages for your review. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=472833
User noelamac@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c5
--- Comment #5 from Camaleon --
https://bugzilla.novell.com/show_bug.cgi?id=472833
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c6
Tejun Heo
https://bugzilla.novell.com/show_bug.cgi?id=472833
User noelamac@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c7
Camaleon --
You're constantly getting PHY errors. Can you please try to connect the device to a different using a different cable and put it on a different power connector?
You were rigth: I'm cured :-) I opened the case and checked the power cable of SATA dvd drive. It looked a bit "slack" so I disconnected and reconnected again (same cable). I also checked SATA cable, which looked o.k. And just rebooted. And no more "soft resetting link" :-) I think you can close this as "resolved / invalid" as clearly kernel was making its job in detecting that something wrong happened on that port. Anyway, for the record, I am attaching the new logs on opensuse 11.1 livecd. There are a bit errors on "boot.msg" but I guess they are related to some media bad sector (it's a cd-rw disk). OTOH, "messages" looks clean. On suse 10.3, currently installed on this system, /var/log/messages is back to normal logging again and dmesg shows nothing relevant. Well, sorry for the "false alarm" and thanks for all. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=472833
User noelamac@gmail.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c8
--- Comment #8 from Camaleon --
https://bugzilla.novell.com/show_bug.cgi?id=472833
Greg Kroah-Hartman
https://bugzilla.novell.com/show_bug.cgi?id=472833
User teheo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=472833#c9
Tejun Heo
participants (1)
-
bugzilla_noreply@novell.com