[opensuse-kernel] crash with drbd
Hi, I am running 3.0.34-0.7-default #1 SMP Tue Jun 19 09:56:30 UTC 2012 (fbfc70c) x86_64 x86_64 x86_64 GNU/Linux on a HP DL380 G7 SLES11SP2 server. When trying to deal with a DRDB split brain situation I am getting sometimes: [10868.575914] block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0 [10868.621929] BUG: unable to handle kernel NULL pointer dereference at 0000000000000030 [10868.655975] IP: [<ffffffff81370258>] sock_ioctl+0x28/0x280 [10868.655981] PGD 5c8afd067 PUD 5eea4b067 PMD 0 [10868.655984] Oops: 0000 [#6] SMP [10868.655986] CPU 4 [10868.655987] Modules linked in: md5 sctp dlm configfs drbd crc32c libcrc32c binfmt_misc edd af_packet bridge stp llc cpufreq_conservative cpufreq_userspace cpufreq_powersave pcc_cpufreq mperf microcode fuse loop vhost_net macvtap macvlan tun kvm_intel ipv6 kvm ipv6_lib be2net(X) joydev i7core_edac pcspkr rtc_cmos bnx2 edac_core iTCO_wdt hpwdt acpi_power_meter hpilo iTCO_vendor_support button container ext3 jbd mbcache usbhid hid dm_mirror dm_region_hash dm_log linear uhci_hcd ehci_hcd usbcore usb_common thermal processor thermal_sys hwmon scsi_dh_emc scsi_dh_alua scsi_dh_hp_sw scsi_dh_rdac scsi_dh dm_snapshot dm_mod cciss(X) hpahcisr(PX) scsi_mod [last unloaded: ocfs2_stackglue] [10868.656013] Supported: Yes, External [10868.656014] [10868.656016] Pid: 9482, comm: drbdadm Tainted: P D X 3.0.34-0.7-default #1 HP ProLiant DL380 G7 [10868.656018] RIP: 0010:[<ffffffff81370258>] [<ffffffff81370258>] sock_ioctl+0x28/0x280 [10868.656021] RSP: 0018:ffff880bb7bdbee8 EFLAGS: 00010296 [10868.656023] RAX: 0000000000000000 RBX: 0000000000005401 RCX: 00007fff8f7810b0 [10868.656024] RDX: 00007fff8f7810b0 RSI: 0000000000005401 RDI: ffff8805d3976d40 [10868.656025] RBP: 00007fff8f7810b0 R08: 0000000000000000 R09: 00007fdc3e232640 [10868.656027] R10: 00007fff8f780ee0 R11: ffffffff811e0a90 R12: 00007fff8f7810b0 [10868.656028] R13: 0000000000000000 R14: 0000000000005401 R15: 0000000000000000 [10868.656030] FS: 00007fdc3e404700(0000) GS:ffff88061fc40000(0000) knlGS:0000000000000000 [10868.656031] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [10868.656032] CR2: 0000000000000030 CR3: 00000005f2d85000 CR4: 00000000000006e0 [10868.656033] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [10868.656035] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [10868.656036] Process drbdadm (pid: 9482, threadinfo ffff880bb7bda000, task ffff880bc4f66040) [10868.656037] Stack: [10868.656038] 0000000000000202 ffff8805f5883680 00007fff8f7810b0 00007fff8f7810b0 [10868.656040] 0000000000000000 ffffffff81160f5b 0000000000000006 0000000000000000 [10868.656043] 0000000000000000 ffff8805f5883680 00007fff8f7810b0 ffffffff81161321 [10868.656045] Call Trace: [10868.656054] [<ffffffff81160f5b>] do_vfs_ioctl+0x8b/0x3b0 [10868.656057] [<ffffffff81161321>] sys_ioctl+0xa1/0xb0 [10868.656061] [<ffffffff81449392>] system_call_fastpath+0x16/0x1b [10868.656988] DWARF2 unwinder stuck at system_call_fastpath+0x16/0x1b [10868.656989] [10868.656990] Leftover inexact backtrace: [10868.656990] [10868.656992] Code: 00 00 00 48 83 ec 28 48 89 5c 24 08 48 89 6c 24 10 89 f3 4c 89 64 24 18 4c 89 6c 24 20 48 89 d5 48 8b bf a0 00 00 00 48 8b 47 20 <4c> 8b 60 30 8d 83 10 76 ff ff 83 f8 0f 0f 86 a5 00 00 00 8d 83 [10868.657003] RIP [<ffffffff81370258>] sock_ioctl+0x28/0x280 [10868.657005] RSP <ffff880bb7bdbee8> [10868.657006] CR2: 0000000000000030 [10868.657082] ---[ end trace 5ae4fc189d552994 ]--- Any else information I should provide? Best regards Martin Konold Robert Bosch GmbH Automotive Electronics Postfach 13 42 72703 Reutlingen GERMANY www.bosch.com Tel. +49 7121 35 3322 Sitz: Stuttgart, Registergericht: Amtsgericht Stuttgart, HRB 14000; Aufsichtsratsvorsitzender: Hermann Scholl; Geschäftsführung: Franz Fehrenbach, Siegfried Dais; Stefan Asenkerschbaumer, Bernd Bohr, Rudolf Colm, Volkmar Denner, Christoph Kübel, Uwe Raschke, Wolf-Henning Scheider, Werner Struth, Peter Tyroller -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-kernel+owner@opensuse.org
On Friday 29 of June 2012 16:41EN, EXTERNAL Konold Martin wrote:
I am running 3.0.34-0.7-default #1 SMP Tue Jun 19 09:56:30 UTC 2012 (fbfc70c) x86_64 x86_64 x86_64 GNU/Linux on a HP DL380 G7 SLES11SP2 server.
This doesn't look like an OpenSuSE kernel, more like the latest SLES 11 SP2 kernel update. So this is probably not the right mailing list.
[10868.656016] Pid: 9482, comm: drbdadm Tainted: P D X 3.0.34-0.7-default #1 HP ProLiant DL380 G7
The "D" taint flag indicates there was a serious problem before this. You should try to find the first oops/bug message. Michal Kubeček -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-kernel+owner@opensuse.org
On 06/29/2012 11:06 AM, Michal Kubeček wrote:
On Friday 29 of June 2012 16:41EN, EXTERNAL Konold Martin wrote:
I am running 3.0.34-0.7-default #1 SMP Tue Jun 19 09:56:30 UTC 2012 (fbfc70c) x86_64 x86_64 x86_64 GNU/Linux on a HP DL380 G7 SLES11SP2 server.
This doesn't look like an OpenSuSE kernel, more like the latest SLES 11 SP2 kernel update. So this is probably not the right mailing list.
[10868.656016] Pid: 9482, comm: drbdadm Tainted: P D X 3.0.34-0.7-default #1 HP ProLiant DL380 G7
The "D" taint flag indicates there was a serious problem before this. You should try to find the first oops/bug message.
You will also get more attention if you can repeat the problem without that "P" flag caused by loading module hpahcisr. Larry -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-kernel+owner@opensuse.org
Hi Larry,
[10868.656016] Pid: 9482, comm: drbdadm Tainted: P D X 3.0.34-0.7-default #1 HP ProLiant DL380 G7
The "D" taint flag indicates there was a serious problem before this. You should try to find the first oops/bug message.
You will also get more attention if you can repeat the problem without that "P" flag caused by loading module hpahcisr.
Thanks for this hint. I deinstalled hpahcisr-kmp-default no because SATA is not used on this system anyway. Best regards Martin Konold Robert Bosch GmbH Automotive Electronics Postfach 13 42 72703 Reutlingen GERMANY www.bosch.com Tel. +49 7121 35 3322 Sitz: Stuttgart, Registergericht: Amtsgericht Stuttgart, HRB 14000; Aufsichtsratsvorsitzender: Franz Fehrenbach; Geschäftsführung: Volkmar Denner, Siegfried Dais; Stefan Asenkerschbaumer, Bernd Bohr, Rudolf Colm, Dirk Hoheisel, Christoph Kübel, Uwe Raschke, Wolf-Henning Scheider, Werner Struth, Peter Tyroller -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-kernel+owner@opensuse.org
Hi,
This doesn't look like an OpenSuSE kernel, more like the latest SLES 11 SP2 kernel update. So this is probably not the right mailing list.
Can you please provide me a hint which ML would be more appropriate.
[10868.656016] Pid: 9482, comm: drbdadm Tainted: P D X 3.0.34-0.7-default #1 HP ProLiant DL380 G7
The "D" taint flag indicates there was a serious problem before this. You should try to find the first oops/bug message.
The first oops/bug message was identical. Best regards Martin Konold Robert Bosch GmbH Automotive Electronics Postfach 13 42 72703 Reutlingen GERMANY www.bosch.com Tel. +49 7121 35 3322 Sitz: Stuttgart, Registergericht: Amtsgericht Stuttgart, HRB 14000; Aufsichtsratsvorsitzender: Franz Fehrenbach; Geschäftsführung: Volkmar Denner, Siegfried Dais; Stefan Asenkerschbaumer, Bernd Bohr, Rudolf Colm, Dirk Hoheisel, Christoph Kübel, Uwe Raschke, Wolf-Henning Scheider, Werner Struth, Peter Tyroller -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-kernel+owner@opensuse.org
participants (3)
-
EXTERNAL Konold Martin (erfrakon, RtP2/TEF72)
-
Larry Finger
-
Michal Kubeček