https://bugzilla.novell.com/show_bug.cgi?id=399966
User arnd@gronenberg.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=399966#c44
Arnd Gronenberg changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |arnd@gronenberg.com
--- Comment #44 from Arnd Gronenberg 2008-12-27 18:46:57 MST ---
I upgraded a server a week ago from 10.2 to 11.0 (HW: IBM x345, Dual Xeon, 3GB,
6x 15k 36GB SCSI320 RAID5) and I encountered the above mentioned problem twice.
Setup is MD RAID 5 on 6 15k SCSI320 disks with LVM and reiserfs on top. Kernel
is 2.6.25.18-0.2-pae and system is current.
The problem only occured during nightly scheduled backups (IBM TSM, client and
server process on same system) and caused the TSM backup server process to
hang. After killing the process and restarting, the following backup attempts
(manual and scheduled) did not cause problems. Backups are performed as
follows: LVM create snapshot, snapshot mounted read-only, backup performed from
snapshot, umount of snapshot, LVM removal of snapshot. Based on timing
comparison (failing backup to successful one), it seems the problem occured
shortly before, around of after the time of ending the backup (ie. umount /
lvremove)...
Is there any possibility to find out which file was being processed at the time
the error occurred?
Please find attached the excerpt from the log:
==============================================
Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): found reiserfs format
"3.6" with standard journal
Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): using ordered data
mode
Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): journal params: device
dm-11, size 8192, journal first block 18, max trans len 1024, max batch 900,
max commit age 30, max trans age 30
Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): checking transaction
log (dm-11)
Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): Using r5 hash to sort
names
Dec 28 01:39:00 arndsrv kernel: ------------[ cut here ]------------
Dec 28 01:39:00 arndsrv kernel: kernel BUG at fs/reiserfs/journal.c:1036!
Dec 28 01:39:00 arndsrv kernel: invalid opcode: 0000 [#1] SMP
Dec 28 01:39:00 arndsrv kernel: last sysfs file:
/sys/devices/system/cpu/cpu3/topology/core_siblings
Dec 28 01:39:00 arndsrv kernel: Modules linked in: udf crc_itu_t ip6t_LOG
ipt_MASQUERADE ipt_REDIRECT xt_mark xt_pkttype xt_TCPMSS xt_tcpudp ipt_LOG
xt_limit xt_MARK tun af_packet 8021q cls_u32 sch_sfq sch_htb capidrv isdn slhc
b1pci b1dma b1 ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle
iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_ipv4
nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 capi capifs
kernelcapi snd_pcm_oss snd_mixer_oss snd_seq fuse dm_crypt crypto_blkcipher
ext2 loop snd_usb_audio snd_pcm snd_timer snd_page_alloc snd_usb_lib st
i2c_piix4 rtc_cmos i2c_core snd_rawmidi snd_seq_device joydev rtc_core
snd_hwdep rtc_lib snd osst sg soundcore usb_storage usblp e1000 sr_mod button
sworks_agp ibmasm agpgart cdrom usbhid hid ff_memless linear raid0 ehci_hcd
ohci_hcd sd_mod usbcore dm_snapshot raid456 async_xor async_memcpy async_tx xor
raid1 ext3 jbd mbcache aic7xxx mptsas scsi_transport_sas mptfc
scsi_transport_fc scsi_tgt piix ide_core edd dm_mod reiserfs
Dec 28 01:39:00 arndsrv kernel: fan pata_serverworks libata dock mptspi
mptscsih mptbase scsi_transport_spi scsi_mod thermal processor [last unloaded:
speedstep_lib]
Dec 28 01:39:00 arndsrv kernel:
Dec 28 01:39:00 arndsrv kernel: Pid: 429, comm: dsmserv Tainted: G N
(2.6.25.18-0.2-pae #1)
Dec 28 01:39:00 arndsrv kernel: EIP: 0060:[<f8fb6ee8>] EFLAGS: 00210246 CPU: 3
Dec 28 01:39:00 arndsrv kernel: EIP is at flush_commit_list+0x5e/0x58d
[reiserfs]
Dec 28 01:39:00 arndsrv kernel: EAX: f78640a0 EBX: f992b000 ECX: f6d21a00 EDX:
f8fc2ed2
Dec 28 01:39:00 arndsrv kernel: ESI: f6d45e00 EDI: 007cc033 EBP: d6b7bf1c ESP:
d6b7bedc
Dec 28 01:39:00 arndsrv kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
Dec 28 01:39:00 arndsrv kernel: Process dsmserv (pid: 429, ti=d6b7a000
task=f78640a0 task.ti=d6b7a000)
Dec 28 01:39:00 arndsrv kernel: Stack: 00000001 f6d21a00 00000000 f992b000
00000000 00000000 08d456d8 04000001
Dec 28 01:39:00 arndsrv kernel: f4c632b4 c2832bb4 f7c7f120 00000000
00000000 f992b000 f6d21a00 007cc033
Dec 28 01:39:00 arndsrv kernel: d6b7bf64 f8fb9972 f2dcdc8c f6d45e00
00000000 00000000 00000001 00000000
Dec 28 01:39:00 arndsrv kernel: Call Trace:
Dec 28 01:39:00 arndsrv kernel: [<f8fb9972>]
reiserfs_commit_for_inode+0x14f/0x17d [reiserfs]
Dec 28 01:39:00 arndsrv kernel: [<f8fa752d>] reiserfs_sync_file+0x36/0x74
[reiserfs]
Dec 28 01:39:00 arndsrv kernel: [<c01958b2>] do_fsync+0x48/0x75
Dec 28 01:39:00 arndsrv kernel: [<c01958fe>] __do_fsync+0x1f/0x2f
Dec 28 01:39:00 arndsrv kernel: [<c019592d>] sys_fsync+0xd/0xf
Dec 28 01:39:01 arndsrv kernel: [<c01059e4>] sysenter_past_esp+0x6d/0xa9
Dec 28 01:39:01 arndsrv kernel: [<ffffe430>] 0xffffe430
Dec 28 01:39:01 arndsrv kernel: =======================
Dec 28 01:39:01 arndsrv kernel: Code: 45 c4 e8 c7 fd ff ff 83 7e 14 00 c7 45 d4
00 00 00 00 0f 85 14 05 00 00 64 a1 00 00 4d c0 f0 ff 80 b4 06 00 00 83 7e 08
00 75 04 <0f> 0b eb fe 8b 4d cc 8b 45 c8 3b 41 18 75 04 0f 0b eb fe ff 46
Dec 28 01:39:01 arndsrv kernel: EIP: [<f8fb6ee8>] flush_commit_list+0x5e/0x58d
[reiserfs] SS:ESP 0068:d6b7bedc
Dec 28 01:39:01 arndsrv kernel: ---[ end trace a8ee4669643ba7e6 ]---
Dec 28 01:39:01 arndsrv kernel: ------------[ cut here ]------------
Dec 28 01:39:01 arndsrv kernel: WARNING: at kernel/exit.c:892
do_exit+0x31/0x5c6()
Dec 28 01:39:01 arndsrv kernel: Modules linked in: udf crc_itu_t ip6t_LOG
ipt_MASQUERADE ipt_REDIRECT xt_mark xt_pkttype xt_TCPMSS xt_tcpudp ipt_LOG
xt_limit xt_MARK tun af_packet 8021q cls_u32 sch_sfq sch_htb capidrv isdn slhc
b1pci b1dma b1 ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle
iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_ipv4
nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 capi capifs
kernelcapi snd_pcm_oss snd_mixer_oss snd_seq fuse dm_crypt crypto_blkcipher
ext2 loop snd_usb_audio snd_pcm snd_timer snd_page_alloc snd_usb_lib st
i2c_piix4 rtc_cmos i2c_core snd_rawmidi snd_seq_device joydev rtc_core
snd_hwdep rtc_lib snd osst sg soundcore usb_storage usblp e1000 sr_mod button
sworks_agp ibmasm agpgart cdrom usbhid hid ff_memless linear raid0 ehci_hcd
ohci_hcd sd_mod usbcore dm_snapshot raid456 async_xor async_memcpy async_tx xor
raid1 ext3 jbd mbcache aic7xxx mptsas scsi_transport_sas mptfc
scsi_transport_fc scsi_tgt piix ide_core edd dm_mod reiserfs
Dec 28 01:39:01 arndsrv kernel: fan pata_serverworks libata dock mptspi
mptscsih mptbase scsi_transport_spi scsi_mod thermal processor [last unloaded:
speedstep_lib]
Dec 28 01:39:01 arndsrv kernel: Pid: 429, comm: dsmserv Tainted: G D N
2.6.25.18-0.2-pae #1
Dec 28 01:39:01 arndsrv kernel: [<c01071d9>] dump_trace+0x63/0x227
Dec 28 01:39:01 arndsrv kernel: [<c0107c8a>] show_trace+0x15/0x29
Dec 28 01:39:01 arndsrv kernel: [<c02e2e65>] dump_stack+0x5b/0x65
Dec 28 01:39:01 arndsrv kernel: [<c01257b9>] warn_on_slowpath+0x41/0x67
Dec 28 01:39:01 arndsrv kernel: [<c0128856>] do_exit+0x31/0x5c6
Dec 28 01:39:01 arndsrv kernel: [<c0107702>] die+0x15e/0x166
Dec 28 01:39:01 arndsrv kernel: [<c02e5909>] do_trap+0x8a/0xa3
Dec 28 01:39:01 arndsrv kernel: [<c0107b25>] do_invalid_op+0x6c/0x76
Dec 28 01:39:01 arndsrv kernel: [<c02e5252>] error_code+0x72/0x80
Dec 28 01:39:01 arndsrv kernel: [<f8fb6ee8>] flush_commit_list+0x5e/0x58d
[reiserfs]
Dec 28 01:39:01 arndsrv kernel: [<f8fb9972>]
reiserfs_commit_for_inode+0x14f/0x17d [reiserfs]
Dec 28 01:39:01 arndsrv kernel: [<f8fa752d>] reiserfs_sync_file+0x36/0x74
[reiserfs]
Dec 28 01:39:01 arndsrv kernel: [<c01958b2>] do_fsync+0x48/0x75
Dec 28 01:39:01 arndsrv kernel: [<c01958fe>] __do_fsync+0x1f/0x2f
Dec 28 01:39:01 arndsrv kernel: [<c019592d>] sys_fsync+0xd/0xf
Dec 28 01:39:01 arndsrv kernel: [<c01059e4>] sysenter_past_esp+0x6d/0xa9
Dec 28 01:39:01 arndsrv kernel: [<ffffe430>] 0xffffe430
Dec 28 01:39:01 arndsrv kernel: =======================
Dec 28 01:39:01 arndsrv kernel: ---[ end trace a8ee4669643ba7e6 ]---
==============================================
Thanks, Arnd
--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.