[Bug 780736] New: Sles10SP4 32 bit system crash on LSI MegaRAID card
https://bugzilla.novell.com/show_bug.cgi?id=780736 https://bugzilla.novell.com/show_bug.cgi?id=780736#c0 Summary: Sles10SP4 32 bit system crash on LSI MegaRAID card Classification: openSUSE Product: openSUSE 11.4 Version: Final Platform: All OS/Version: SuSE Linux 10.1 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: sumit.saxena@lsi.com QAContact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; WOW64; Trident/4.0; GTB7.4; SLCC2; .NET CLR 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30729; Media Center PC 6.0; .NET4.0C; InfoPath.2) The issue is obeserved with SLES10 SP4, but I couldn't find the place to raise bug for SLES10 SP4 product, so please redirect it to SLES10 SP4. Here are details- We have seen kernel crash related to SLES10SP4. Below are few details about the crash. Also I did basic analysis and looks like may be a potential issue in SLES10SP4. Not sure if this is known issue for SUSE. Kernel crash detail. 4801 CPU: 1^M 4802 EIP: 0060:[<c011d6e4>] Tainted: G X VLI^M 4803 EFLAGS: 00010092 (2.6.16.60-0.85.1-bigsmp #1) ^M 4804 EIP is at try_to_wake_up+0x1e/0x3fd^M 4805 eax: f6b7dee0 ebx: c03ce320 ecx: 00000000 edx: 00000001^M 4806 esi: 00000002 edi: f7b8ec04 ebp: f6b7def0 esp: f6b7dea8^M 4807 ds: 007b es: 007b ss: 0068^M 4808 Process java (pid: 4579, threadinfo=f6b7c000 task=dfd20840)^M 4809 Stack: <0>00000000 00000001 00000000 00000000 00000001 c031fe40 00000046 f6b7df38 ^M 4810 00000046 00000046 f6b298f8 f6b29924 f6b7df38 c012e125 00000092 f650100c ^M 4811 00000001 f7b8ec04 f6b7df14 c011c2de 00000000 00000001 f7b8ec00 00000002 ^M 4812 Call Trace:^M 4813 [<c012e125>] __dequeue_signal+0x16c/0x177^M 4814 [<c011c2de>] __wake_up_common+0x2f/0x53^M 4815 [<c011d15c>] __wake_up+0x2d/0x41^M 4816 [<c01744ea>] pipe_writev+0x348/0x39b^M 4817 [<c017453d>] pipe_write+0x0/0x21^M 4818 [<c0174559>] pipe_write+0x1c/0x21^M 4819 [<c01696bb>] vfs_write+0xaa/0x152^M 4820 [<c0169cdd>] sys_write+0x3c/0x64^M 4821 [<c0103dcb>] sysenter_past_esp+0x54/0x79^M There are some changes in OS components which may be root cause of this crash. Earlier calculating pending signal for task was not done correctly. It has been fixed by below patch. http://help.lockergnome.com/linux/PATCH-__dequeue_signal-cleanup--ftopict368... Same code is first checked in to Linux 2.6.19 http://lxr.linux.no/#linux+v2.6.19/kernel/signal.c#L452 Above mentioned fix is not part of SLES10SP4. Reproducible: Always Steps to Reproduce: 1.Install SLES10 SP4 32 bit Operating System 2.Insert MegaRAID card and update latest driver and firmware. 3.Create a 4 span R10 VD with 2 drives in each span. 4.Mount the volume and start I/O 5.Assign 4 Global Hot Spare 5.Pull out a drive from each span. 6.Observe that the system hangs and crashes. 7.Firmware was still intact and rebuild gets kicked in there Actual Results: While I/O operations is in progress on VD, system hangs/crashes on removal of some drives of RAID volume. Expected Results: System should not hang/crash on removal of drives of RAID volume. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com