[Bug 254208] New: Kernel panic with 2.6.18.8-0.1-xen
https://bugzilla.novell.com/show_bug.cgi?id=254208 Summary: Kernel panic with 2.6.18.8-0.1-xen Product: openSUSE 10.2 Version: Final Platform: x86-64 OS/Version: Other Status: NEW Severity: Normal Priority: P5 - None Component: Xen AssignedTo: cgriffin@novell.com ReportedBy: greg.riedesel@wwu.edu QAContact: qa@suse.de After upgrading to the 2.6.18.8-0.1-xen kernel from 2.6.18.2-23-xen the machine would throw a kernel panic. I've attached a serial-console capture of the panic. Adding the line "agp=off" to the Additional Options line, not boot options, allowed boot to bypass the panic. "agpgart" is in the /etc/modprobe.d/blacklist file. The hardware is an Asus P5B Deluxe, with a Core2 CPU, running x86-64. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #1 from greg.riedesel@wwu.edu 2007-03-13 15:21 MST ------- Created an attachment (id=124157) --> (https://bugzilla.novell.com/attachment.cgi?id=124157&action=view) serial console capture of the panic -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #2 from greg.riedesel@wwu.edu 2007-03-13 15:29 MST ------- Created an attachment (id=124159) --> (https://bugzilla.novell.com/attachment.cgi?id=124159&action=view) Another serial-console capture of the panic -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #3 from greg.riedesel@wwu.edu 2007-03-13 15:39 MST ------- This bug does not affect the 'default' kernel. Only the 'xen' kernel. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 fkohler@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED QAContact|qa@suse.de |jdouglas@novell.com -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #4 from greg.riedesel@wwu.edu 2007-03-15 17:16 MST ------- Doing a diff of defconfig.default and defconfig.xen gives this tidbit: 1914c1845 < CONFIG_AGP_INTEL=m ---
CONFIG_AGP_INTEL=y
I don't know enough about Xen kernel configuration to know why Intel AGP is being hard-loaded into the kernel, but this would explain why the default kernel doesn't show the problem. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 lbendixs@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|cgriffin@novell.com |jbeulich@novell.com Status|ASSIGNED |NEW ------- Comment #5 from lbendixs@novell.com 2007-03-29 17:01 MST ------- Jan, this is an openSuse 10.2 bug entry. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #6 from jbeulich@novell.com 2007-03-30 05:05 MST ------- While this config selection wasn't intended to be that way, it also wasn't changed after the original release, so you had the driver built in there, too. I'm surprised this worked for you. (Any chance you have a boot.msg obtained with the old kernel?) Jason/Lynn, any chance we have a machine (Intel chipset driven by intel-agp and 4Gb+ of memory) in the lab this can be reproduced on? Regardless of that I think I found two issues with the code: - the use of GFP_DMA32, assuming the machine address will result in memory below 4G (which isn't true under Xen) - arithmetic extending across page boundaries on values returned from virt_to_gart() (the physical<->machine relationship isn't contiguous under Xen) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 jbeulich@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 jbeulich@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |greg.riedesel@wwu.edu ------- Comment #7 from jbeulich@novell.com 2007-03-30 06:13 MST ------- Please also provide output of lspci and lspci -n (obtained from the native kernel). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #8 from lbendixs@novell.com 2007-03-30 09:59 MST ------- (In reply to comment #6)
Jason/Lynn, any chance we have a machine (Intel chipset driven by intel-agp and 4Gb+ of memory) in the lab this can be reproduced on?
We probably have one but as this is for opensuse it is a low priority for us right now. We may have a chance to get to it inthe middle of next week. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #10 from greg.riedesel@wwu.edu 2007-03-30 10:12 MST ------- Created an attachment (id=127892) --> (https://bugzilla.novell.com/attachment.cgi?id=127892&action=view) Output of 'lspci' under the non-Xen kernel (2.6.18.8-0.1-default) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 greg.riedesel@wwu.edu changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|greg.riedesel@wwu.edu | ------- Comment #11 from greg.riedesel@wwu.edu 2007-03-30 10:13 MST ------- Created an attachment (id=127893) --> (https://bugzilla.novell.com/attachment.cgi?id=127893&action=view) Output of "lscpi -n" in the non-Xen kernel (2.6.18.8-0.1-default) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #12 from greg.riedesel@wwu.edu 2007-03-30 10:21 MST -------
While this config selection wasn't intended to be that way, it also wasn't changed after the original release, so you had the driver built in there, too. I'm surprised this worked for you. (Any chance you have a boot.msg obtained with the old kernel?)
Bug #227324 describes some of the problem I had with the Final kernel (2.6.18.2-34) series. In that case "agp=off" also seemed to bypass the problems, though I did have luck using the modprobe blacklist. It was the agpgart problems that had me keep the 2.6.18.2-23-Xen kernel after 10.2 released, as that kernel didn't seem to have the same problem. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 jbeulich@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |FIXED ------- Comment #15 from jbeulich@novell.com 2007-04-02 02:01 MST ------- So with native not working (without agp=off or blacklisting intel-agp, as the referenced bug #227324 described), this is not really a Xen bug but a generic issue; it just happens that under the Xen kernel, due to intel-agp inadvertently being built in, you can't use the blacklisting method but have to use agp=off. Nevertheless, I believe looking closely at this code has revealed a number of weaknesses on the Xen side. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 greg.riedesel@wwu.edu changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |REOPENED Resolution|FIXED | ------- Comment #16 from greg.riedesel@wwu.edu 2007-06-15 10:56 MST ------- Bug 271573 is a duplicate of this one, with newer code. Kernel 2.6.18.8-0.3 still has this issue. The standard kernel does not show the problem, but the xen kernel does. As with the earlier one, adding agp=off to the kernel options bypasses this bug. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208 ------- Comment #17 from greg.riedesel@wwu.edu 2007-06-15 10:59 MST ------- Created an attachment (id=146580) --> (https://bugzilla.novell.com/attachment.cgi?id=146580&action=view) boot.msg file for the standard kernel boot. THis is the boot.msg file for a standard kernel boot of 2.6.18.8-0.3-default. THis had no issues. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=254208#c18
--- Comment #18 from Jan Beulich
https://bugzilla.novell.com/show_bug.cgi?id=254208#c19
Jan Beulich
https://bugzilla.novell.com/show_bug.cgi?id=254208#c20
--- Comment #20 from Greg Riedesel
https://bugzilla.novell.com/show_bug.cgi?id=254208#c21
Greg Riedesel
https://bugzilla.novell.com/show_bug.cgi?id=254208#c22
--- Comment #22 from Greg Riedesel
CONFIG_AGP_INTEL=y
Which tells me that the default kernel has intel_agp as a module, and the Xen kernel has intel_agp static in the kernel. The lsmod output for the default kernel does not show "intel_agp" loaded. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=254208#c23
Jan Beulich
participants (1)
-
bugzilla_noreply@novell.com