[Bug 393537] New: P550-Power5 not booting after installing Opensuse-11-Beta3-ppc64 image.
https://bugzilla.novell.com/show_bug.cgi?id=393537 User bugproxy@us.ibm.com added comment https://bugzilla.novell.com/show_bug.cgi?id=393537#c1 Summary: P550-Power5 not booting after installing Opensuse-11- Beta3-ppc64 image. Product: openSUSE 11.0 Version: Beta 2 Platform: PowerPC-64 OS/Version: All Status: NEW Severity: Blocker Priority: P5 - None Component: Installation AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: bugproxy@us.ibm.com QAContact: jsrain@novell.com Found By: Third Party Developer/Partner Partner ID: LTC 44935 =Comment: #1================================================= Cijurajan Kollanoor <cijurajan@in.ibm.com> - 2008-05-22 07:48 EDT Problem description: ==================== After installation with Opensuse-11-Beta3 on P550 lpars, lpars are not coming up. It is hitting OOPs at scheduling part of the kernel code and generating call traces. Following are the boot messages with OOP and call traces: ------ boot messages that has the OOPs and call traces ---------- IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM IBM / Elapsed time since release of system processors: 165799 mins 52 secs yaboot starting: loaded at 00040000 000633f0 (0/0/00c39a50; sp: 019fffd0) Welcome to yaboot version 10.1.22-r1034.SuSE booted from '/vdevice/v-scsi@3000000d/disk@8100000000000000' Using configfile 'built-in' Enter "help" to get some basic usage information boot: * Ppc64 linux boot: Please wait, loading kernel... Allocated 00a00000 bytes for kernel @ 00200000 Elf64 kernel loaded... Loading ramdisk... ramdisk loaded 00485000 @ 02b00000 OF stdout device is: /vdevice/vty@30000000 Hypertas detected, assuming LPAR ! command line: root=/dev/sda5 quiet sysrq=1 memory layout at init: alloc_bottom : 0000000002f85000 alloc_top : 0000000008000000 alloc_top_hi : 0000000072000000 rmo_top : 0000000008000000 ram_top : 0000000072000000 Looking for displays instantiating rtas at 0x00000000077ca000 ... done 0000000000000000 : boot cpu 0000000000000000 0000000000000002 : starting cpu hw idx 0000000000000002... done 0000000000000004 : starting cpu hw idx 0000000000000004... done 0000000000000006 : starting cpu hw idx 0000000000000006... done 0000000000000008 : starting cpu hw idx 0000000000000008... done 000000000000000a : starting cpu hw idx 000000000000000a... done 000000000000000c : starting cpu hw idx 000000000000000c... done 000000000000000e : starting cpu hw idx 000000000000000e... done 0000000000000010 : starting cpu hw idx 0000000000000010... done copying OF device tree ... Building dt strings... Building dt structure... Device tree strings 0x0000000003386000 -> 0x000000000338723f Device tree struct 0x0000000003388000 -> 0x0000000003393000 Calling quiesce ... returning from prom_init doing fast boot Creating device nodes with udev sd 2:0:1:0: [sda] Assuming drive cache: write through sd 2:0:1:0: [sda] Assuming drive cache: write through blogd: can not open pty/tty pair: No such file or directory Waiting for device /dev/sda5 to appear: ok fsck 1.40.8 (13-Mar-2008) [/sbin/fsck.ext3 (1) -- /] fsck.ext3 -a /dev/sda5 /dev/sda5: recovering journal /dev/sda5: clean, 126798/1089536 files, 858162/4351599 blocks fsck succeeded. Mounting root device read-write. Mounting root /dev/sda5 INIT: version 2.86 booting System Boot Control: Running /etc/init.d/boot Mounting procfs at /proc done Mounting sysfs at /sys done Mounting debugfs at /sys/kernel/debug done Initializing /dev done Mounting devpts at /dev/pts done Boot logging started on /dev/hvc0(/dev/console (deleted)) at Mon May 19 17:07:29 2008 done Starting udevd: ------------[ cut here ]------------ Badness at kernel/sched_rt.c:294 ------------[ cut here ]------------ Badness at kernel/sched_rt.c:300 ------------[ cut here ]------------ kernel BUG at kernel/sched_rt.c:502! Oops: Exception in kernel mode, sig: 5 [#1] SMP NR_CPUS=128 NUMA pSeries Modules linked in: e1000(+) sg sd_mod ibmvscsic scsi_transport_srp scsi_tgt pata_pdc2027x libata scsi_mod NIP: c00000000006b34c LR: c0000000004371b8 CTR: c00000000006b2dc REGS: c00000006e823780 TRAP: 0700 Tainted: G N (2.6.25.3-2-ppc64) MSR: 8000000000029032 <EE,ME,IR,DR> CR: 24000044 XER: 20000001 TASK = c000000070593580[79] 'rtasd' THREAD: c00000006e820000 CPU: 8 GPR00: 0000000000000001 c00000006e823a00 c00000000077b6a0 c000000000923080 GPR04: c000000000923108 00000000000319cc 0000000000000001 00000000000319cc GPR08: c0000000009230e0 0000000000000064 c000000000923080 c000000000923158 GPR12: 0000000000000000 c0000000006d1000 c000000000651608 0000000000000000 GPR16: c00000006e823a88 c00000006e820000 c000000000660920 c000000000661080 GPR20: c0000000705938f0 0000000000000008 c000000000659890 c00000006e823e90 GPR24: c000000000923080 c000000070593580 c00000006e823de0 0000000000000002 GPR28: 7fffffffffffffff c00000006e820080 c00000000070b290 c000000000479b10 NIP [c00000000006b34c] .pick_next_task_rt+0x70/0xb0 LR [c0000000004371b8] .schedule+0x5c4/0x8c4 Call Trace: [c00000006e823a00] [c000000000437164] .schedule+0x570/0x8c4 (unreliable) [c00000006e823b50] [c0000000000768a8] .__cond_resched+0x2c/0x5c [c00000006e823be0] [c000000000437664] ._cond_resched+0x44/0x64 [c00000006e823c60] [c0000000004376d0] .wait_for_common+0x4c/0x250 [c00000006e823d40] [c000000000076ab0] .set_cpus_allowed+0x128/0x16c [c00000006e823e20] [c00000000004880c] .do_event_scan_all_cpus+0xac/0x1c0 [c00000006e823ef0] [c0000000000489b8] .rtasd+0x98/0xf8 [c00000006e823f90] [c000000000027894] .kernel_thread+0x4c/0x68 Instruction dump: 7d200074 2000003f 48000018 e92b0008 7c0900d0 7c004838 7c000074 2000007f 7c0907b4 2f890063 7c000026 5400f7fe <0b000000> 38090001 780026e4 7c6b002a ---[ end trace 400babcfe43f968b ]--- smp_call_function on cpu 10: other cpus not responding (14) smp_call_function on cpu 10: other cpus not responding (14) If this is a customer issue, please indicate the impact to the customer: ------------------------------------------------------------------------------- Machine information: ==================== Found this issue in P550 lpar1 and lpar3. Could reproduce this several times. Machine type (p650, x235, SF2, etc.): p510 Cpu type (Power4, Power5, IA-64, etc.):Power5 Reproducible: ============= Yes, it is reproducible. Describe the steps: -> Start the installation of Opensuse-11-beta3 using nfs method of installation. -> After Installation completes and yast-installer boots, watch the messages thrown on the console. (You can see machine hanging at 'Starting udevd' and generating this messages) Hung: ===== Is the system (not just the application) hung? No, its in a kind of wait state, but not dead. If so, describe how you determined this: I could see the status of the lpar in HMC as running. OOPs: ===== Did the system produce an OOPS message on the console? Yes If so, copy it here: Oops: Exception in kernel mode, sig: 5 [#1] SMP NR_CPUS=128 NUMA pSeries =Comment: #2================================================= Cijurajan Kollanoor <cijurajan@in.ibm.com> - 2008-05-22 07:50 EDT Problem is happening only in the machine model p550. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=393537 Stephan Kulow <coolo@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|bnc-team-screening@forge.provo.novell.com |kernel-maintainers@forge.provo.novell.com Flag| |SHIP_STOPPER- -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=393537 Greg Kroah-Hartman <gregkh@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|kernel-maintainers@forge.provo.novell.com |olh@novell.com -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=393537 User olh@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=393537#c1 Olaf Hering <olh@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO Info Provider| |bugproxy@us.ibm.com --- Comment #1 from Olaf Hering <olh@novell.com> 2008-05-26 02:24:58 MDT --- can you try with vanilla kernel, or 2.6.26? It would be nice if you can provide a patch. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=393537 User bugproxy@us.ibm.com added comment https://bugzilla.novell.com/show_bug.cgi?id=393537#c2 LTC BugProxy <bugproxy@us.ibm.com> changed: What |Removed |Added ---------------------------------------------------------------------------- URL| |http:// --- Comment #2 from LTC BugProxy <bugproxy@us.ibm.com> 2008-05-26 04:32:40 MDT --- ------- Comment From cijurajan@in.ibm.com 2008-05-26 06:26 EDT------- Novell, We will try to install the latest working openSUSE-11.0 and upgrade the kernel to 2.6.26 and see if it helps. Thanks Ciju -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=393537 User olh@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=393537#c3 --- Comment #3 from Olaf Hering <olh@novell.com> 2008-05-26 08:06:58 MDT --- I have tested it on our p550, and booting works, on a server partition. I have not tested on a partition with vscsi. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=393537 User bugproxy@us.ibm.com added comment https://bugzilla.novell.com/show_bug.cgi?id=393537#c4 --- Comment #4 from LTC BugProxy <bugproxy@us.ibm.com> 2008-06-04 02:48:41 MDT --- ------- Comment From IndhuDurai@in.ibm.com 2008-06-04 04:46 EDT------- hi, I could install Opensuse-11-RC1-ppc64 on p550 successfully and could successfully log into the machine. Regards, Indhu D -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=393537 User bugproxy@us.ibm.com added comment https://bugzilla.novell.com/show_bug.cgi?id=393537#c5 --- Comment #5 from LTC BugProxy <bugproxy@us.ibm.com> 2008-06-04 02:56:34 MDT --- ------- Comment From cijurajan@in.ibm.com 2008-06-04 04:51 EDT------- Novell, We are closing this bug now. Will re-open if we are seeing it in the upcoming releases. Thanks Ciju -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=393537 User olh@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=393537#c6 Olaf Hering <olh@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |RESOLVED Info Provider|bugproxy@us.ibm.com | Resolution| |FIXED --- Comment #6 from Olaf Hering <olh@novell.com> 2008-06-04 03:01:53 MDT --- closing -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com