[Bug 737020] New: [AMD IOMMU PCI passthrough] assigning a PCIe device locks up the host directly on starting up the VM
https://bugzilla.novell.com/show_bug.cgi?id=737020 https://bugzilla.novell.com/show_bug.cgi?id=737020#c0 Summary: [AMD IOMMU PCI passthrough] assigning a PCIe device locks up the host directly on starting up the VM Classification: openSUSE Product: openSUSE 11.3 Version: Final Platform: x86-64 OS/Version: openSUSE 11.3 Status: NEW Severity: Major Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: andihartmann@freenet.de QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:8.0.1) Gecko/20111121 Firefox/8.0.1 SeaMonkey/2.5 Given are two PCIe network devices (see details above): bus ID: 03:00.0 and 04:00.0. Both devices use the same module (r8169) and have the same pciid (10ec:8168). One of both should be assigned to a VM (-> 03:00.0) <hostdev mode='subsystem' type='pci' managed='yes'> <source> <address domain='0x0000' bus='0x03' slot='0x00' function='0x0'/> </source> --> </hostdev> If the VM is started, the host is locked up. Reproducible: Always Steps to Reproduce: 1. Define a VM on a host with mostly equal PCIe devices and assign one of these PCIe devices. Before unload the module needed to run on the host (here: r8169). 2. virsh start VM 3. Actual Results: - The host is locked up. - If you manually unbind the device with echo "10ec 8168" > /sys/bus/pci/drivers/pci-stub/new_id echo "0000:03:00.0" > /sys/bus/pci/devices/0000\:03\:00.0/driver/unbind echo "0000:03:00.0" > /sys/bus/pci/drivers/pci-stub/bind both devices (03:00.0 and 04:00.0) are unbound. - removing the module r8169 removes the module from both devices. Expected Results: 1. Removing of the module (here: r8169) should be bus specific - not pciid specific (I wasn't able to distinguish between both devices). 2. Unbinding of one of both devices shouldn't unbind the other one. 3. Assigning one of both PCIe devices to a VM shouldn't lock up the machine at all. Softwarestack: kernel-desktop-3.0.6-1.1.x86_64 or higher kvm-0.15.1-127.1.x86_64 libvirt-0.9.7-183.1.x86_64 Hardware Motherboard: GA-990XA-UD3/GA-990XA-UD3, BIOS F9 10/13/2011, AMD 990X chipset 03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 01) Subsystem: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller Flags: bus master, fast devsel, latency 0, IRQ 43 I/O ports at de00 [size=256] Memory at fdbff000 (64-bit, non-prefetchable) [size=4K] [virtual] Expansion ROM at fda00000 [disabled] [size=128K] Capabilities: [40] Power Management version 2 Capabilities: [48] Vital Product Data Capabilities: [50] MSI: Enable+ Count=1/2 Maskable- 64bit+ Capabilities: [60] Express Endpoint, MSI 00 Capabilities: [84] Vendor Specific Information: Len=4c <?> Capabilities: [100] Advanced Error Reporting Capabilities: [12c] Virtual Channel Capabilities: [148] Device Serial Number f1-11-00-00-68-4c-e0-00 Capabilities: [154] Power Budgeting <?> Kernel driver in use: r8169 04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 06) Subsystem: Giga-byte Technology GA-EP45-DS5 Motherboard Flags: bus master, fast devsel, latency 0, IRQ 44 I/O ports at ce00 [size=256] Memory at fd6ff000 (64-bit, prefetchable) [size=4K] Memory at fd6f8000 (64-bit, prefetchable) [size=16K] Capabilities: [40] Power Management version 3 Capabilities: [50] MSI: Enable+ Count=1/1 Maskable- 64bit+ Capabilities: [70] Express Endpoint, MSI 01 Capabilities: [b0] MSI-X: Enable- Count=4 Masked- Capabilities: [d0] Vital Product Data Capabilities: [100] Advanced Error Reporting Capabilities: [140] Virtual Channel Capabilities: [160] Device Serial Number 12-34-56-78-12-34-56-78 Kernel driver in use: r8169 After removing the driver r8169: 03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 01) Subsystem: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller Flags: fast devsel, IRQ 17 I/O ports at de00 [size=256] Memory at fdbff000 (64-bit, non-prefetchable) [size=4K] [virtual] Expansion ROM at fda00000 [disabled] [size=128K] Capabilities: [40] Power Management version 2 Capabilities: [48] Vital Product Data Capabilities: [50] MSI: Enable- Count=1/2 Maskable- 64bit+ Capabilities: [60] Express Endpoint, MSI 00 Capabilities: [84] Vendor Specific Information: Len=4c <?> Capabilities: [100] Advanced Error Reporting Capabilities: [12c] Virtual Channel Capabilities: [148] Device Serial Number f1-11-00-00-68-4c-e0-00 Capabilities: [154] Power Budgeting <?> 04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 06) Subsystem: Giga-byte Technology GA-EP45-DS5 Motherboard Flags: fast devsel, IRQ 17 I/O ports at ce00 [size=256] Memory at fd6ff000 (64-bit, prefetchable) [size=4K] Memory at fd6f8000 (64-bit, prefetchable) [size=16K] Capabilities: [40] Power Management version 3 Capabilities: [50] MSI: Enable- Count=1/1 Maskable- 64bit+ Capabilities: [70] Express Endpoint, MSI 01 Capabilities: [b0] MSI-X: Enable- Count=4 Masked- Capabilities: [d0] Vital Product Data Capabilities: [100] Advanced Error Reporting Capabilities: [140] Virtual Channel Capabilities: [160] Device Serial Number 12-34-56-78-12-34-56-78 After unbinding of 03:00.0: 03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 01) Subsystem: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller Flags: fast devsel, IRQ 17 I/O ports at de00 [size=256] Memory at fdbff000 (64-bit, non-prefetchable) [size=4K] [virtual] Expansion ROM at fda00000 [disabled] [size=128K] Capabilities: [40] Power Management version 2 Capabilities: [48] Vital Product Data Capabilities: [50] MSI: Enable- Count=1/2 Maskable- 64bit+ Capabilities: [60] Express Endpoint, MSI 00 Capabilities: [84] Vendor Specific Information: Len=4c <?> Capabilities: [100] Advanced Error Reporting Capabilities: [12c] Virtual Channel Capabilities: [148] Device Serial Number f1-11-00-00-68-4c-e0-00 Capabilities: [154] Power Budgeting <?> Kernel driver in use: pci-stub 04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 06) Subsystem: Giga-byte Technology GA-EP45-DS5 Motherboard Flags: fast devsel, IRQ 17 I/O ports at ce00 [size=256] Memory at fd6ff000 (64-bit, prefetchable) [size=4K] Memory at fd6f8000 (64-bit, prefetchable) [size=16K] Capabilities: [40] Power Management version 3 Capabilities: [50] MSI: Enable- Count=1/1 Maskable- 64bit+ Capabilities: [70] Express Endpoint, MSI 01 Capabilities: [b0] MSI-X: Enable- Count=4 Masked- Capabilities: [d0] Vital Product Data Capabilities: [100] Advanced Error Reporting Capabilities: [140] Virtual Channel Capabilities: [160] Device Serial Number 12-34-56-78-12-34-56-78 Kernel driver in use: pci-stub -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=737020 https://bugzilla.novell.com/show_bug.cgi?id=737020#c1 Greg Kroah-Hartman <gregkh@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |INVALID --- Comment #1 from Greg Kroah-Hartman <gregkh@suse.com> 2011-12-15 16:35:19 UTC --- Sorry, we don't support this on 11.3, please try 12.1 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=737020 https://bugzilla.novell.com/show_bug.cgi?id=737020#c2 Andreas Hartmann <andihartmann@freenet.de> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |REOPENED Component|Kernel |Kernel Resolution|INVALID | Product|openSUSE 11.3 |openSUSE 12.1 Target Milestone|--- |Final OS/Version|openSUSE 11.3 |openSUSE 11.4 --- Comment #2 from Andreas Hartmann <andihartmann@freenet.de> 2011-12-16 07:17:43 UTC --- I tested with 12.1 - but it behaves exactly the same way as described above. The key question: - How can a PCI(e) device be addressed bijective to remove a module or to unbind just this device and not any other device having the same pciid it? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=737020 https://bugzilla.novell.com/show_bug.cgi?id=737020#c3 --- Comment #3 from Andreas Hartmann <andihartmann@freenet.de> 2011-12-16 07:23:40 UTC --- I forgot to mention, that it just works fine, if the additional PCIe ethernet card (03:00.0) is removed from the system. But this works even fine with openSUSE 11.3 and kernel 3.0.6 / kvm 0.15.1 and libvirt 0.9.7 (softwarestack as described above). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=737020 https://bugzilla.novell.com/show_bug.cgi?id=737020#c Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|kernel-maintainers@forge.pr |agraf@suse.com |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=737020 https://bugzilla.novell.com/show_bug.cgi?id=737020#c4 Jeff Mahoney <jeffm@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|REOPENED |RESOLVED Resolution| |WONTFIX --- Comment #4 from Jeff Mahoney <jeffm@suse.com> 2014-08-08 16:26:48 EDT --- This report is against openSUSE 12.1 which is no longer under maintenance. If you are able to reproduce it with openSUSE 13.1 or openSUSE Factory, please re-open and reset the the "Product" field to the appropriate release. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com