New subject: [suse-amd64] Suse 9 Pro on dual opteron + 6GB mem crashes/panics

20 Dec 2003

      Hi list,

We're running Suse 9 pro (latest suse smp kernel) on a dual Opteron 242 
with 6GB memory, LSI megaraid 1600 + 2 disks in raid 0 and 4 in raid 5 
on a Tyan Thunder K8S (with latest bios).

Uname -a:
Linux apollo 2.4.21-149-smp #1 SMP Thu Nov 13 23:24:40 UTC 2003 x86_64 
x86_64 x86_64 GNU/Linux

The machine is running a heavy mysql database.

It can stay up for about half an hour, before it crashes, leaving this 
in the warn-log:

Dec 20 09:59:14 apollo kernel: Unable to handle kernel paging request at 
virtual address 00000103c003a644
Dec 20 09:06:43 apollo last message repeated 3 times
Dec 20 09:59:14 apollo kernel:  printing rip:
Dec 20 09:59:14 apollo kernel: ffffffff80148b29
Dec 20 09:59:14 apollo kernel: PML4 8063 PGD 0

Once it crashed with a more complete (and different) oops/panic:

Dec 20 20:28:02 apollo kernel: Unable to handle kernel paging request at 
virtual address 0000007f804537e0
Dec 20 20:28:02 apollo kernel:  printing rip:
Dec 20 20:28:02 apollo kernel: ffffffff801494f7
Dec 20 20:28:02 apollo kernel: PML4 1048b1067 PGD 0
Dec 20 20:28:02 apollo kernel: Oops: 0000
Dec 20 20:28:02 apollo kernel: CPU 1
Dec 20 20:28:02 apollo kernel: Pid: 7, comm: kswapd Not tainted
Dec 20 20:28:02 apollo kernel: RIP: 
0010:[kmem_cache_reap+343/880]{kmem_cache_reap+343}
Dec 20 20:28:02 apollo kernel: RIP: 
0010:[<ffffffff801494f7>]{kmem_cache_reap+343}
Dec 20 20:28:02 apollo kernel: RSP: 0000:0000010100009df8  EFLAGS: 00010016
Dec 20 20:28:02 apollo kernel: RAX: 000ffffff0000000 RBX: 
0000000000000003 RCX: 0000000000000019
Dec 20 20:28:02 apollo kernel: RDX: 0000007fffff8000 RSI: 
0000000000000000 RDI: 00000100e78f3b10
Dec 20 20:28:02 apollo kernel: RBP: 00000100e78f4080 R08: 
0000000000000033 R09: 00000100e78f3b30
Dec 20 20:28:02 apollo kernel: R10: 0000010102c44c30 R11: 
0000010102c44c00 R12: 0000000000000058
Dec 20 20:28:02 apollo kernel: R13: 0000000000000002 R14: 
ffffffff7fffffff R15: 0000000080000000
Dec 20 20:28:02 apollo kernel: FS:  0000000000560b00(0000) 
GS:ffffffff804bbb00(0000) knlGS:0000000000000000
Dec 20 20:28:02 apollo kernel: CS:  0010 DS: 0018 ES: 0018 CR0: 
000000008005003b
Dec 20 20:28:02 apollo kernel: CR2: 0000007f804537e0 CR3: 
00000000e7902000 CR4: 00000000000006e0
Dec 20 20:28:02 apollo kernel: Process kswapd (pid: 7, 
stackpage=10100009000)
Dec 20 20:28:02 apollo kernel: Stack: 0000010100009df8 0000000000000000 
ffffffff8014ae20 00000100e78f3b20
Dec 20 20:28:02 apollo kernel:        0000000200000000 0000010001000048 
0000000000000020 00000000000001d0
Dec 20 20:28:02 apollo kernel:        00000101000003c0 0000010100009e84 
0000000000000000 0000000000000000
Dec 20 20:28:02 apollo kernel: Call Trace: 
[shrink_cache+1104/1184]{shrink_cache+1104}
Dec 20 20:28:02 apollo kernel: Call Trace: 
[<ffffffff8014ae20>]{shrink_cache+1104}
Dec 20 20:28:02 apollo kernel: 
[shrink_caches+41/128]{shrink_caches+41} 
[try_to_free_pages_zone+98/272]{try_to_free_pages_zone+98}
Dec 20 20:28:02 apollo kernel: 
[<ffffffff8014b0f9>]{shrink_caches+41} 
[<ffffffff8014b1b2>]{try_to_free_pages_zone+98}
Dec 20 20:28:02 apollo kernel: 
[kswapd_balance_pgdat+113/224]{kswapd_balance_pgdat+113} 
[kswapd_balance+28/64]{kswapd_balance+28}
Dec 20 20:28:02 apollo kernel: 
[<ffffffff8014b3d1>]{kswapd_balance_pgdat+113} 
[<ffffffff8014b45c>]{kswapd_balance+28}
Dec 20 20:28:02 apollo kernel:        [kswapd+168/195]{kswapd+168} 
[child_rip+8/16]{child_rip+8}
Dec 20 20:28:02 apollo kernel:        [<ffffffff8014b5b8>]{kswapd+168} 
[<ffffffff80110ae4>]{child_rip+8}
Dec 20 20:28:02 apollo kernel:        [kswapd+0/195]{kswapd+0} 
[child_rip+0/16]{child_rip+0}
Dec 20 20:28:02 apollo kernel:        [<ffffffff8014b510>]{kswapd+0} 
[<ffffffff80110adc>]{child_rip+0}
Dec 20 20:28:02 apollo kernel:
Dec 20 20:28:02 apollo kernel:
Dec 20 20:28:02 apollo kernel: Code: 48 0f b6 92 e0 b7 45 80 48 8b 14 d5 
00 b6 45 80 48 8b 8a c8

I hope you guys are able to help,

Best regards,

Arjen van der Meijden
Sysadmin Tweakers.net

Suse 9 Pro on dual opteron + 6GB mem crashes/panics

Arjen van der Meijden

Arjen van der Meijden

Andi Kleen

Arjen van der Meijden

Andi Kleen

Arjen van der Meijden

Kris Ongbongan

Andi Kleen

tags

participants (3)