[Bug 642857] New: USB3.0 / usb-storage general protection fault after drive went to sleep
https://bugzilla.novell.com/show_bug.cgi?id=642857 https://bugzilla.novell.com/show_bug.cgi?id=642857#c0 Summary: USB3.0 / usb-storage general protection fault after drive went to sleep Classification: openSUSE Product: openSUSE 11.3 Version: Final Platform: x86-64 OS/Version: Other Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: lmb@novell.com QAContact: qa@suse.de Found By: Development Blocker: --- I have a USB 3.0 bridge with a USB 3.0 sata bridge. The drive went to sleep; the next access woke it up, but the kernel crashed with a general protection fault as below. Host system is a x200s ThinkPad. Details of the hardware components: 05:00.0 USB Controller: NEC Corporation uPD720200 USB 3.0 Host Controller (rev 03) (prog-if 30) Subsystem: Melco Inc Device 0242 Flags: bus master, fast devsel, latency 0, IRQ 19 Memory at f1f00000 (64-bit, non-prefetchable) [size=8K] Capabilities: [50] Power Management version 3 Capabilities: [70] MSI: Enable- Count=1/8 Maskable- 64bit+ Capabilities: [90] MSI-X: Enable- Count=8 Masked- Capabilities: [a0] Express Endpoint, MSI 00 Capabilities: [100] Advanced Error Reporting Capabilities: [140] Device Serial Number 00-00-00-00-00-62-50-00 Capabilities: [150] #18 Kernel driver in use: xhci_hcd Bus 009 Device 002: ID 1759:5002 LucidPort Technology, Inc. Device Descriptor: bLength 18 bDescriptorType 1 bcdUSB 3.00 bDeviceClass 0 (Defined at Interface level) bDeviceSubClass 0 bDeviceProtocol 0 bMaxPacketSize0 9 idVendor 0x1759 LucidPort Technology, Inc. idProduct 0x5002 bcdDevice 21.49 iManufacturer 1 USB 3.0 iProduct 2 MassStorage Device iSerial 3 WD-WCAU45227636 bNumConfigurations 1 Configuration Descriptor: bLength 9 bDescriptorType 2 wTotalLength 44 bNumInterfaces 1 bConfigurationValue 1 iConfiguration 0 bmAttributes 0xc0 Self Powered MaxPower 0mA Interface Descriptor: bLength 9 bDescriptorType 4 bInterfaceNumber 0 bAlternateSetting 0 bNumEndpoints 2 bInterfaceClass 8 Mass Storage bInterfaceSubClass 6 SCSI bInterfaceProtocol 80 Bulk (Zip) iInterface 0 Endpoint Descriptor: bLength 7 bDescriptorType 5 bEndpointAddress 0x01 EP 1 OUT bmAttributes 2 Transfer Type Bulk Synch Type None Usage Type Data wMaxPacketSize 0x0400 1x 1024 bytes bInterval 0 ** UNRECOGNIZED: 06 30 07 00 00 00 Endpoint Descriptor: bLength 7 bDescriptorType 5 bEndpointAddress 0x81 EP 1 IN bmAttributes 2 Transfer Type Bulk Synch Type None Usage Type Data wMaxPacketSize 0x0400 1x 1024 bytes bInterval 0 ** UNRECOGNIZED: 06 30 0f 00 00 00 Device Qualifier (for other device speed): bLength 10 bDescriptorType 6 bcdUSB 2.10 bDeviceClass 0 (Defined at Interface level) bDeviceSubClass 0 bDeviceProtocol 0 bMaxPacketSize0 64 bNumConfigurations 1 Device Status: 0x0001 Self Powered Sep 30 16:44:10 hermes kernel: [221295.135385] xhci_hcd 0000:05:00.0: WARN: Stalled endpoint Sep 30 16:44:10 hermes kernel: [221295.138443] xhci_hcd 0000:05:00.0: WARN: Stalled endpoint Sep 30 16:44:10 hermes kernel: [221295.140061] xhci_hcd 0000:05:00.0: WARN: Stalled endpoint Sep 30 16:44:10 hermes kernel: [221295.141688] xhci_hcd 0000:05:00.0: WARN: Stalled endpoint Sep 30 16:44:10 hermes kernel: [221295.143311] xhci_hcd 0000:05:00.0: WARN: Stalled endpoint Sep 30 16:44:10 hermes kernel: [221295.144944] xhci_hcd 0000:05:00.0: WARN: Stalled endpoint Sep 30 16:44:41 hermes kernel: [221325.807024] usb 9-1: reset SuperSpeed USB device using xhci_hcd and address 2 Sep 30 16:44:41 hermes kernel: [221325.818703] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.819305] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.819931] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.820551] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.821050] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.821181] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.821930] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.822046] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.822680] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.822803] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.823558] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.823680] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.824804] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.825304] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.825682] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.826045] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.826426] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.826795] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.827174] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.827549] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.827927] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.828307] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.828677] xhci_hcd 0000:05:00.0: ERROR Transfer event TRB DMA ptr not part of current TD Sep 30 16:44:41 hermes kernel: [221325.830598] general protection fault: 0000 [#1] PREEMPT SMP Sep 30 16:44:41 hermes kernel: [221325.830603] last sysfs file: /sys/devices/system/cpu/cpu1/cache/index2/shared_cpu_map Sep 30 16:44:41 hermes kernel: [221325.830606] CPU 0 Sep 30 16:44:41 hermes kernel: [221325.830608] Modules linked in: xfs exportfs raid1 xhci_hcd pciehp pci_hotplug snd_usb_audio snd_usb_lib snd_rawmidi usblp nls_iso8859_1 nls_cp437 vfat fat iptable_filter ip_tables x_tables joydev autofs4 af_packet rndis_host cdc_ether u sbnet tun usb_storage rfcomm sco bridge stp llc bnep l2cap snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device edd cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq mperf fuse loop arc4 snd_hda_codec_conexant ecb snd_hda_intel snd_hda_codec snd_hwdep iwl agn tpm_tis snd_pcm iwlcore tpm mac80211 thinkpad_acpi snd_timer cfg80211 tpm_bios btusb battery ac bluetooth sg pcspkr snd rfkill wmi soundcore iTCO_wdt i2c_i801 snd_page_alloc iTCO_vendor_support e1000e ext4 jbd2 crc16 linear i915 drm_kms_helper drm i2c_algo_bit sd_mod intel_agp button video dm_snapshot dm_mod fan processor ata_generic ahci libata scsi_mod thermal thermal_sys [last unloaded: preloadtrace] Sep 30 16:44:41 hermes kernel: [221325.830687] Sep 30 16:44:41 hermes kernel: [221325.830691] Pid: 9094, comm: usb-storage Not tainted 2.6.34.7-0.3-desktop #1 74705GG/74705GG Sep 30 16:44:41 hermes kernel: [221325.830694] RIP: 0010:[<ffffffff8131d95b>] [<ffffffff8131d95b>] descriptors_changed+0xcb/0x320 Sep 30 16:44:41 hermes kernel: [221325.830703] RSP: 0018:ffff880031823ca0 EFLAGS: 00010246 Sep 30 16:44:41 hermes kernel: [221325.830706] RAX: ffff880029c97a20 RBX: 0000000000000000 RCX: 000000000000002c Sep 30 16:44:41 hermes kernel: [221325.830709] RDX: 000000000000002c RSI: ffff880044bd2440 RDI: 0900000003000112 Sep 30 16:44:41 hermes kernel: [221325.830712] RBP: ffff88006ccfb800 R08: ffff880031822000 R09: dead000000200200 Sep 30 16:44:41 hermes kernel: [221325.830714] R10: dead000000100100 R11: 0000000000000001 R12: 0000000000000000 Sep 30 16:44:41 hermes kernel: [221325.830717] R13: 000000000000002c R14: ffff880044bd2440 R15: 0000000000000015 Sep 30 16:44:41 hermes kernel: [221325.830721] FS: 0000000000000000(0000) GS:ffff880001e00000(0000) knlGS:0000000000000000 Sep 30 16:44:41 hermes kernel: [221325.830724] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b Sep 30 16:44:41 hermes kernel: [221325.830726] CR2: 00007f45fe9ad000 CR3: 0000000020a65000 CR4: 00000000000006f0 Sep 30 16:44:41 hermes kernel: [221325.830729] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Sep 30 16:44:41 hermes kernel: [221325.830732] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Sep 30 16:44:41 hermes kernel: [221325.830735] Process usb-storage (pid: 9094, threadinfo ffff880031822000, task ffff880031820200) Sep 30 16:44:41 hermes kernel: [221325.830743] Stack: Sep 30 16:44:41 hermes kernel: [221325.830745] ffff880031823fd8 0000000281326a75 ffff880019bf1000 ffff88006ccfb800 Sep 30 16:44:41 hermes kernel: [221325.830748] <0> ffff88013af0a200 0000000000000001 0000000000000000 ffff880031823d20 Sep 30 16:44:41 hermes kernel: [221325.830751] <0> ffff880019bf1000 ffffffff8131dc98 000000000000000d ffff880031823d9c Sep 30 16:44:41 hermes kernel: [221325.830755] Call Trace: Sep 30 16:44:41 hermes kernel: [221325.830764] [<ffffffff8131dc98>] usb_reset_and_verify_device+0xe8/0x430 Sep 30 16:44:41 hermes kernel: [221325.830768] [<ffffffff8131e093>] usb_reset_device+0xb3/0x1b0 Sep 30 16:44:41 hermes kernel: [221325.830776] [<ffffffffa0540e39>] usb_stor_port_reset+0x69/0x70 [usb_storage] Sep 30 16:44:41 hermes kernel: [221325.830792] [<ffffffffa0540ed0>] usb_stor_invoke_transport+0x90/0x3d0 [usb_storage] Sep 30 16:44:41 hermes kernel: [221325.830799] [<ffffffffa05423a7>] usb_stor_control_thread+0x147/0x230 [usb_storage] Sep 30 16:44:41 hermes kernel: [221325.830808] [<ffffffff8107149e>] kthread+0x8e/0xa0 Sep 30 16:44:41 hermes kernel: [221325.830813] [<ffffffff81003df4>] kernel_thread_helper+0x4/0x10 Sep 30 16:44:41 hermes kernel: [221325.830816] Code: 84 eb 01 00 00 31 db 80 bd c1 02 00 00 00 75 31 e9 8b 00 00 00 0f 1f 00 48 8b 85 d8 03 00 00 89 c9 4c 89 f6 48 39 c9 4a 8b 3c e0 <f3> a6 0f 85 ed 00 00 00 0f b6 85 c1 02 00 00 83 c3 01 39 d8 76 Sep 30 16:44:41 hermes kernel: [221325.830844] RIP [<ffffffff8131d95b>] descriptors_changed+0xcb/0x320 Sep 30 16:44:41 hermes kernel: [221325.830847] RSP <ffff880031823ca0> Sep 30 16:44:41 hermes kernel: [221325.830850] ---[ end trace 5404ec4722246249 ]--- -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=642857
https://bugzilla.novell.com/show_bug.cgi?id=642857#c1
Lars Marowsky-Bree
https://bugzilla.novell.com/show_bug.cgi?id=642857
https://bugzilla.novell.com/show_bug.cgi?id=642857#c2
Greg Kroah-Hartman
https://bugzilla.novell.com/show_bug.cgi?id=642857
https://bugzilla.novell.com/show_bug.cgi?id=642857#c3
Lars Marowsky-Bree
How did the drive go to sleep, was the machine suspended?
No. The SATA-USB enclosure seems to auto-suspend, or more likely it is the drive powersaving's mode activating. I can't get hdparm to pass commands through, so I can't figure that out.
error handling has much improved in usb 3.0 in later kernels, can you try factory's kernel?
Yes, see my previous comment - it no longer crashes, but it still appears to take an inordinate amount of time for the system to recover (in addition to the one or two seconds for the drive to spin up, it pauses for 5-30s easily). Let me know if that is expected or if you want more info. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=642857
https://bugzilla.novell.com/show_bug.cgi?id=642857#c4
Greg Kroah-Hartman
participants (1)
-
bugzilla_noreply@novell.com