[Bug 399966] New: kernel BUG at fs/reiserfs/journal.c:1036!
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c1 Summary: kernel BUG at fs/reiserfs/journal.c:1036! Product: openSUSE 11.0 Version: Final Platform: Other OS/Version: openSUSE 11.0 Status: NEW Severity: Critical Priority: P5 - None Component: Kernel AssignedTo: jeffm@novell.com ReportedBy: lmb@novell.com QAContact: qa@suse.de CC: jack@novell.com Found By: --- My home directory is on reiserfs, on a separate logical volume. When trying to access a particular directory (my SUSE e-mail folder, of all lucky guesses), I get a kernel oops: ------------[ cut here ]------------ kernel BUG at fs/reiserfs/journal.c:1036! invalid opcode: 0000 [#1] SMP last sysfs file: /sys/devices/LNXSYSTM:00/device:00/PNP0A08:00/device:01/PNP0C09 Modules linked in: authenc xfrm4_mode_tunnel deflate zlib_deflate ctr twofish_i5 Pid: 4801, comm: mutt Tainted: G N (2.6.25.5-1.1-pae #1) EIP: 0060:[<f965cedc>] EFLAGS: 00010246 CPU: 0 EIP is at flush_commit_list+0x5e/0x58d [reiserfs] EAX: f2c6b0a0 EBX: f9cf7000 ECX: f7851000 EDX: f9668ec6 ESI: f2d2f0c0 EDI: 0089e837 EBP: f2cb1f1c ESP: f2cb1edc DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Process mutt (pid: 4801, ti=f2cb0000 task=f2c6b0a0 task.ti=f2cb0000) Stack: 00000001 f7851000 0089e837 f9cf7000 f347d8c0 00000000 f700eb84 00000000 f2cb1f58 f2cb1f10 c015f623 f700eb84 00000000 f9cf7000 f7851000 0089e837 f2cb1f64 f965f966 f700ead8 f2d2f0c0 00000000 00000000 00000001 00000000 Call Trace: [<f965f966>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] [<f964d521>] reiserfs_sync_file+0x36/0x74 [reiserfs] [<c01954e2>] do_fsync+0x48/0x75 [<c019552e>] __do_fsync+0x1f/0x2f [<c019555d>] sys_fsync+0xd/0xf [<c01059e4>] sysenter_past_esp+0x6d/0xa9 [<ffffe430>] 0xffffe430 ======================= Code: 45 c4 e8 c7 fd ff ff 83 7e 14 00 c7 45 d4 00 00 00 00 0f 85 14 05 00 00 64 EIP: [<f965cedc>] flush_commit_list+0x5e/0x58d [reiserfs] SS:ESP 0068:f2cb1edc ---[ end trace 42b3c6f590148f7c ]--- ------------[ cut here ]------------ WARNING: at kernel/exit.c:892 do_exit+0x31/0x5c6() Modules linked in: authenc xfrm4_mode_tunnel deflate zlib_deflate ctr twofish_i5 Pid: 4801, comm: mutt Tainted: G D N 2.6.25.5-1.1-pae #1 [<c01071d9>] dump_trace+0x63/0x227 [<c0107c8a>] show_trace+0x15/0x29 [<c02e84b5>] _etext+0x5b/0x65 [<c012573d>] warn_on_slowpath+0x41/0x67 [<c01287da>] do_exit+0x31/0x5c6 [<c0107702>] die+0x15e/0x166 [<c02e5279>] do_trap+0x8a/0xa3 [<c0107b25>] do_invalid_op+0x6c/0x76 [<c02e4bc2>] error_code+0x72/0x80 [<f965cedc>] flush_commit_list+0x5e/0x58d [reiserfs] [<f965f966>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] [<f964d521>] reiserfs_sync_file+0x36/0x74 [reiserfs] [<c01954e2>] do_fsync+0x48/0x75 [<c019552e>] __do_fsync+0x1f/0x2f [<c019555d>] sys_fsync+0xd/0xf [<c01059e4>] sysenter_past_esp+0x6d/0xa9 [<ffffe430>] 0xffffe430 ======================= ---[ end trace 42b3c6f590148f7c ]--- This is reproducible. openSUSE 10.3 worked fine. It might be related to bug #389656, but I'm not seeing a hard hang. Please advise. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c1 --- Comment #1 from Lars Marowsky-Bree <lmb@novell.com> 2008-06-13 03:27:50 MDT --- kotd 2.6.25.6-SL110_BRANCH_20080612174803-default fixes this for me. I apologize, I don't have the bandwidth to bisect today, but maybe this is helpful. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c2 --- Comment #2 from Lars Marowsky-Bree <lmb@novell.com> 2008-06-13 03:28:32 MDT --- Oh, and one thing I forgot to mention - fsck did not find any issues. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c3 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED --- Comment #3 from Jeff Mahoney <jeffm@novell.com> 2008-06-13 08:19:24 MDT --- Huh, the last change against reiserfs was on 2 June. The 11.0 kernel was generated from source timestamped 7 June. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c4 --- Comment #4 from Lars Marowsky-Bree <lmb@novell.com> 2008-06-15 17:15:45 MDT --- Yes, and it appears that comment #1 was indeed a redherring; it does not fix the problem, it just somehow moved it a little. Now another directory appears affected, but still within my mail spool: ------------[ cut here ]------------ kernel BUG at fs/reiserfs/journal.c:1036! invalid opcode: 0000 [#1] SMP last sysfs file: /sys/devices/LNXSYSTM:00/device:00/PNP0A08:00/device:01/PNP0C09:00/PNP0C0A:00/power_supply/BAT0/energy_full Modules linked in: nfs lockd nfs_acl sunrpc xfrm_user xfrm4_tunnel af_key cpufreq_stats ppp_deflate bsd_comp ppp_async ppp_generic slhc authenc xfrm4_mode_tunnel deflate zlib_deflate ctr twofish_i586 twofish_common camellia serpent blowfish cbc xcbc crypto_null tunnel4 ipcomp esp4 aead ah4 aes_i586 aes_generic des_generic md5 sha1_generic sha256_generic iptable_filter ip_tables ip6table_filter ip6_tables x_tables af_packet arc4 ecb ieee80211_crypt_wep ipv6 radeon drm bridge bnep cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq speedstep_lib fuse dm_crypt crypto_blkcipher loop rfcomm l2cap pcmcia nsc_ircc snd_intel8x0m snd_intel8x0 ipw2200 snd_ac97_codec irda ppdev thinkpad_acpi rtc_cmos yenta_socket ac97_bus parport_pc ieee80211 hci_usb ieee80211_crypt rtc_core parport nvram rsrc_nonstatic crc_ccitt rtc_lib video snd_pcm intel_agp bluetooth battery ac output snd_timer firmware_class pcmcia_core snd button usb_storage sr_mod soundcore iTCO_wdt agpgart i2c_i801 cdrom tg3 i2c_core iTCO_vendor_support snd_page_alloc joydev sg uinput linear sd_mod ata_piix ehci_hcd uhci_hcd usbcore dm_snapshot edd dm_mod reiserfs fan ahci libata scsi_mod dock thermal processor [last unloaded: xfrm_user] Pid: 24525, comm: mutt Tainted: G N (2.6.25.6-SL110_BRANCH_20080612174803-default #1) EIP: 0060:[<f92c0edc>] EFLAGS: 00010246 CPU: 0 EIP is at flush_commit_list+0x5e/0x58d [reiserfs] EAX: f670d120 EBX: f9c2b000 ECX: f74f6200 EDX: f92ccec6 ESI: eff252c0 EDI: 008a5368 EBP: f6763f1c ESP: f6763edc DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Process mutt (pid: 24525, ti=f6762000 task=f670d120 task.ti=f6762000) Stack: 00000001 f74f6200 008a5368 f9c2b000 efd74780 00000000 cbc9d9d0 00000000 f6763f58 f6763f10 c015dbeb cbc9d9d0 00000000 f9c2b000 f74f6200 008a5368 f6763f64 f92c3966 cbc9d924 eff252c0 00000000 00000000 00000001 00000000 Call Trace: [<f92c3966>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] [<f92b1521>] reiserfs_sync_file+0x36/0x74 [reiserfs] [<c01924a2>] do_fsync+0x48/0x75 [<c01924ee>] __do_fsync+0x1f/0x2f [<c019251d>] sys_fsync+0xd/0xf [<c01059e4>] sysenter_past_esp+0x6d/0xa9 [<ffffe430>] 0xffffe430 ======================= Code: 45 c4 e8 c7 fd ff ff 83 7e 14 00 c7 45 d4 00 00 00 00 0f 85 14 05 00 00 64 a1 00 f0 4b c0 90 ff 80 a4 06 00 00 83 7e 08 00 75 04 <0f> 0b eb fe 8b 4d cc 8b 45 c8 3b 41 18 75 04 0f 0b eb fe ff 46 EIP: [<f92c0edc>] flush_commit_list+0x5e/0x58d [reiserfs] SS:ESP 0068:f6763edc ---[ end trace 5ddcd558f5c61f6f ]--- ------------[ cut here ]------------ WARNING: at kernel/exit.c:892 do_exit+0x31/0x5c6() Modules linked in: nfs lockd nfs_acl sunrpc xfrm_user xfrm4_tunnel af_key cpufreq_stats ppp_deflate bsd_comp ppp_async ppp_generic slhc authenc xfrm4_mode_tunnel deflate zlib_deflate ctr twofish_i586 twofish_common camellia serpent blowfish cbc xcbc crypto_null tunnel4 ipcomp esp4 aead ah4 aes_i586 aes_generic des_generic md5 sha1_generic sha256_generic iptable_filter ip_tables ip6table_filter ip6_tables x_tables af_packet arc4 ecb ieee80211_crypt_wep ipv6 radeon drm bridge bnep cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq speedstep_lib fuse dm_crypt crypto_blkcipher loop rfcomm l2cap pcmcia nsc_ircc snd_intel8x0m snd_intel8x0 ipw2200 snd_ac97_codec irda ppdev thinkpad_acpi rtc_cmos yenta_socket ac97_bus parport_pc ieee80211 hci_usb ieee80211_crypt rtc_core parport nvram rsrc_nonstatic crc_ccitt rtc_lib video snd_pcm intel_agp bluetooth battery ac output snd_timer firmware_class pcmcia_core snd button usb_storage sr_mod soundcore iTCO_wdt agpgart i2c_i801 cdrom tg3 i2c_core iTCO_vendor_support snd_page_alloc joydev sg uinput linear sd_mod ata_piix ehci_hcd uhci_hcd usbcore dm_snapshot edd dm_mod reiserfs fan ahci libata scsi_mod dock thermal processor [last unloaded: xfrm_user] Pid: 24525, comm: mutt Tainted: G D N 2.6.25.6-SL110_BRANCH_20080612174803-default #1 [<c01071d9>] dump_trace+0x63/0x227 [<c0107c8a>] show_trace+0x15/0x29 [<c02e6588>] _etext+0x5b/0x65 [<c0124401>] warn_on_slowpath+0x41/0x67 [<c0127496>] do_exit+0x31/0x5c6 [<c0107702>] die+0x15e/0x166 [<c02e3529>] do_trap+0x8a/0xa3 [<c0107b25>] do_invalid_op+0x6c/0x76 [<c02e2e72>] error_code+0x72/0x80 [<f92c0edc>] flush_commit_list+0x5e/0x58d [reiserfs] [<f92c3966>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] [<f92b1521>] reiserfs_sync_file+0x36/0x74 [reiserfs] [<c01924a2>] do_fsync+0x48/0x75 [<c01924ee>] __do_fsync+0x1f/0x2f [<c019251d>] sys_fsync+0xd/0xf [<c01059e4>] sysenter_past_esp+0x6d/0xa9 [<ffffe430>] 0xffffe430 ======================= ---[ end trace 5ddcd558f5c61f6f ]--- The trace is very similar, except that this time I could capture without truncated lines. I will be travelling to India soon, so please let me know if you need me to provide anything. I reran reiserfsck and it didn't find anything, and memtest passed for a couple of hours too. (Just checking whether the hardware happened to have gone bad on me just as I was updating to 11.0.) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c5 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |lmb@novell.com --- Comment #5 from Jeff Mahoney <jeffm@novell.com> 2008-06-18 10:44:52 MDT --- Does the latest KOTD fix this? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c6 Lars Marowsky-Bree <lmb@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|lmb@novell.com | --- Comment #6 from Lars Marowsky-Bree <lmb@novell.com> 2008-06-19 00:49:21 MDT --- No. Exactly the same trace: Pid: 3724, comm: mutt Tainted: G N (2.6.25.7-SL110_BRANCH_20080618144016-default #1) EIP: 0060:[<f92c0ee8>] EFLAGS: 00010246 CPU: 0 EIP is at flush_commit_list+0x5e/0x58d [reiserfs] This kernel includes * Tue Jun 17 2008 jeffm@suse.de - patches.fixes/reiserfs-discard-xattr-prealloc: reiserfs: discard prealloc in reiserfs_delete_inode (bnc#389656). I'll try with noacl as a mount option next. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c7 --- Comment #7 from Lars Marowsky-Bree <lmb@novell.com> 2008-06-21 10:48:02 MDT --- noacl, as well as removing user_xattr, did not help - I just hit the very same oops. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c8 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |lmb@novell.com --- Comment #8 from Jeff Mahoney <jeffm@novell.com> 2008-06-23 10:14:57 MDT --- Between this bug, 396200 ("journal-1409 journal_mark_dirty: returning because j_wcount was 0"), and 389656, I'm suspecting that something is not using the BKL quite correctly, and not necessarily inside reiserfs. j_wcount == 0 means that someone is doing a journal_mark_dirty() without a transaction open. It's not hitting the BUG_ON(!th->t_trans_id) because reiserfs transactions aren't cleared on allocation (stack variable) or reset on completion. The former would be a pain to do everywhere, but the latter is easy. kernel BUG at fs/reiserfs/journal.c:1036! is issued when a transaction doesn't contain any blocks to flush. Lars, do you see the j_wcount warning on your machine too? Can you try to reproduce with kernel-lockdep (new flavor I added today)? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c9 --- Comment #9 from Lars Marowsky-Bree <lmb@novell.com> 2008-06-23 12:21:07 MDT --- I did not see the j_wcount warning; everything I saw was posted. I will see about retesting with the lockdep kernel, but since this impacted my ability to work while travelling, I moved my active home directory to ext3; however that means that the reiserfs version is available for reproduction, hopefully it will still be possible to. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c10 --- Comment #10 from Lars Marowsky-Bree <lmb@novell.com> 2008-07-02 04:32:31 MDT --- I can't seem to trigger this on-demand unless I'm actively using the home directory, which I'm kind of weary to do as the only place where it hits is my mailspool. I'm not sure I can be of much assistance debugging this further :-( -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c12 --- Comment #12 from Jeff Mahoney <jeffm@novell.com> 2008-07-08 10:08:22 MDT --- I ran into this as well, but since updating to the latest KOTD, I haven't been able to reproduce it. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User R.Vickers@cs.rhul.ac.uk added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c13 --- Comment #13 from Bob Vickers <R.Vickers@cs.rhul.ac.uk> 2008-07-09 03:59:30 MDT --- I upgraded our file server on Monday from 10.2 to 11.0 and yesterday there were a number of hangs where the system became unusable and had to be rebooted. On one of these occasions there was the kernel BUG at fs/reiserfs/journal.c:1036! mentioned above. On another occasion there was a slightly different one: Jul 8 15:04:43 csnewton kernel: ------------[ cut here ]------------ Jul 8 15:04:43 csnewton kernel: kernel BUG at fs/reiserfs/journal.c:3017! Jul 8 15:04:43 csnewton kernel: invalid opcode: 0000 [#1] SMP Jul 8 15:04:43 csnewton kernel: last sysfs file: /sys/devices/system/cpu/cpu1/topology/core_siblings Jul 8 15:04:43 csnewton kernel: Modules linked in: nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs w83627hf lm85 w83781d hwmon_vid adm1021 iptable_filter ip_tables ip6table_filter ip6_tables x_tables ipv6 quota _v2 fuse dm_crypt crypto_blkcipher reiserfs loop dm_mod ppdev parport_pc parport osst e1000 st container sg rtc_cmos intel_rng rtc_core i2c_i801 rtc_lib sr_mod shpchp e7xxx_edac button iTCO_wdt edac_core i2c_co re pci_hotplug iTCO_vendor_support cdrom uhci_hcd usbcore sd_mod piix ide_core edd ext3 mbcache jbd fan ata_piix libata dock aic79xx scsi_transport_spi scsi_mod thermal processor [last unloaded: speedstep_lib] Jul 8 15:04:43 csnewton kernel: Jul 8 15:04:43 csnewton kernel: Pid: 9963, comm: smbd Tainted: G N (2.6.25.5-1.1-pae #1) Jul 8 15:04:43 csnewton kernel: EIP: 0060:[<e0ce0430>] EFLAGS: 00210206 CPU: 0 Jul 8 15:04:43 csnewton kernel: EIP is at do_journal_begin_r+0x37/0x25b [reiserfs] Jul 8 15:04:43 csnewton kernel: EAX: dec5d120 EBX: cb8ade68 ECX: df807a00 EDX: e0cea39d Jul 8 15:04:43 csnewton kernel: ESI: e0c8e000 EDI: 0000044e EBP: cb8ade30 ESP: cb8addf8 Jul 8 15:04:43 csnewton kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Jul 8 15:04:43 csnewton kernel: Process smbd (pid: 9963, ti=cb8ac000 task=dec5d120 task.ti=cb8ac000) Jul 8 15:04:43 csnewton kernel: Stack: df807a00 cb8ade68 d61802c8 ffffffff cb8ade40 cb8ade20 c01c8dad cb8ade90 Jul 8 15:04:43 csnewton kernel: cb8ade94 00000053 cb8ade4c cb8ade68 00000000 df807a00 cb8ade4c e0ce07de Jul 8 15:04:43 csnewton kernel: 00000000 0000044e 0000044e 00000380 c0da0408 cb8adea4 e0cc8fc4 00008180 Jul 8 15:04:43 csnewton kernel: Call Trace: Jul 8 15:04:43 csnewton kernel: [<e0ce07de>] journal_begin+0xba/0xf3 [reiserfs] Jul 8 15:04:43 csnewton kernel: [<e0cc8fc4>] reiserfs_create+0xae/0x19e [reiserfs] Jul 8 15:04:43 csnewton kernel: [<c0181705>] vfs_create+0x12e/0x19d Jul 8 15:04:43 csnewton kernel: [<c01835d3>] open_namei+0x159/0x596 Jul 8 15:04:43 csnewton kernel: [<c01785af>] do_filp_open+0x20/0x36 Jul 8 15:04:43 csnewton kernel: [<c0178605>] do_sys_open+0x40/0xbb Jul 8 15:04:43 csnewton kernel: [<c01786c2>] sys_open+0x1e/0x26 Jul 8 15:04:43 csnewton kernel: [<c01059e4>] sysenter_past_esp+0x6d/0xa9 Jul 8 15:04:43 csnewton kernel: [<ffffe430>] 0xffffe430 Jul 8 15:04:43 csnewton kernel: ======================= Jul 8 15:04:43 csnewton kernel: Code: 55 c8 89 45 cc e8 5e a3 45 df 8b 55 c8 8b 82 6c 01 00 00 ba 9d a3 ce e0 8b 70 0c 8b 45 c8 e8 57 d8 ff ff 3b be 94 00 00 00 76 04 <0f> 0b eb fe 8b 4d cc c7 41 04 01 00 00 0 0 8b 45 c8 89 01 8b 55 Jul 8 15:04:43 csnewton kernel: EIP: [<e0ce0430>] do_journal_begin_r+0x37/0x25b [reiserfs] SS:ESP 0068:cb8addf8 Jul 8 15:04:43 csnewton kernel: ---[ end trace e156ae35d344c061 ]--- I have reverted the server to 10.2 for the time being. Unfortunately I can't do any experiments on this machine because any downtime is very disruptive, but I am happy to supply any further information from the logs that might be helpful. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c14 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |gerberb@zenez.com --- Comment #14 from Jeff Mahoney <jeffm@novell.com> 2008-07-15 09:18:57 MDT --- *** Bug 409054 has been marked as a duplicate of this bug. *** https://bugzilla.novell.com/show_bug.cgi?id=409054 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c15 --- Comment #15 from Boyd Gerber <gerberb@zenez.com> 2008-07-15 09:36:10 MDT --- Hang of access to random reiserfs filesystem files. Description: I have been having random hangs accessing files on my Rieserfs system. I upgraded this system from OpenSUSE 10.2 to 11.0. While using system suddenly a file becomes in accessible. I am not sure what is causing it. I can access other files and use the system, but this file will become unaccessable till I reboot the system. Once the system is reboot I can access the file. I never know which file/s will become unaccessible. I have had files in /tmp/ /home/user/ and randomly all over the place the current file is /home/gerberb/.spamassassin/bayes_tok. Anything accessing this file hangs. I have had to reboot the machine 8-26 time per day to keep using the system. Nothing works. I am unable to often kill the process that is using the file. When I do the next access to the file also hangs. Once I reboot the system I am able to access the file with out problems. What information do you need. Thanks, Comments ------- Comment #1 From Boyd Gerber 2008-07-14 21:53:17 MDT ------- i finally received this back trace Jul 14 21:45:37 xenau kernel: ------------[ cut here ]------------ Jul 14 21:45:37 xenau kernel: kernel BUG at fs/reiserfs/journal.c:1036! Jul 14 21:45:37 xenau kernel: invalid opcode: 0000 [#1] SMP Jul 14 21:45:37 xenau kernel: last sysfs file: /sys/firmware/edd/int13_dev80/extensions Jul 14 21:45:37 xenau kernel: Modules linked in: ip6t_LOG xt_pkttype xt_TCPMSS xt_limit ipt_LOG ipt_recent xt_tcpudp nfsd lockd nfs_acl auth_rpcgss exportfs raw deflate zlib_deflate ctr twofish_i586 twofish_common camellia serpent blowfish cbc xcbc crypto_null xfrm_user xfrm4_tunnel tunnel4 ipcomp esp4 aead ah4 aes_i586 crypto_blkcipher aes_generic des_generic md5 sha1_generic sha256_generic af_key af_packet sunrpc ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 fuse loop dm_mod parport_pc ppdev parport rtc_cmos rtc_core rtc_lib st osst sg button sworks_agp e100 i2c_piix4 shpchp ide_cd_mod cdrom i2c_core mii pci_hotplug agpgart ide_disk ohci_hcd usbcore sd_mod piix edd reiserfs fan aic7xxx scsi_transport_spi scsi_mod serverworks ide_core thermal processor [last unloaded: speedstep_lib] Jul 14 21:45:37 xenau kernel: Jul 14 21:45:37 xenau kernel: Pid: 5163, comm: alpine Tainted: G N (2.6.25.9-0.2-default #1) Jul 14 21:45:37 xenau kernel: EIP: 0060:[<e0efeeec>] EFLAGS: 00210246 CPU: 0 Jul 14 21:45:37 xenau kernel: EIP is at flush_commit_list+0x5e/0x58d [reiserfs] Jul 14 21:45:37 xenau kernel: EAX: dc4af040 EBX: e1200000 ECX: def8d600 EDX: e0f0aed6 Jul 14 21:45:37 xenau kernel: ESI: deb73be0 EDI: 013c01bf EBP: dd7d9f1c ESP: dd7d9edc Jul 14 21:45:37 xenau kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Jul 14 21:45:37 xenau kernel: Process alpine (pid: 5163, ti=dd7d8000 task=dc4af040 task.ti=dd7d8000) Jul 14 21:45:37 xenau kernel: Stack: 00000001 def8d600 005a5492 e1200000 df014b00 00000000 dc9c818c 00000000 Jul 14 21:45:37 xenau kernel: dd7d9f58 dd7d9f10 c015dc1b dc9c818c 00000000 e1200000 def8d600 013c01bf Jul 14 21:45:37 xenau kernel: dd7d9f64 e0f01976 dc9c80e0 deb73be0 00000000 00000000 00000001 00000000 Jul 14 21:45:37 xenau kernel: Call Trace: Jul 14 21:45:37 xenau kernel: [<e0f01976>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] Jul 14 21:45:37 xenau kernel: [<e0eef531>] reiserfs_sync_file+0x36/0x74 [reiserfs] Jul 14 21:45:37 xenau kernel: [<c0192516>] do_fsync+0x48/0x75 Jul 14 21:45:37 xenau kernel: [<c0192562>] __do_fsync+0x1f/0x2f Jul 14 21:45:37 xenau kernel: [<c0192591>] sys_fsync+0xd/0xf Jul 14 21:45:37 xenau kernel: [<c0105a62>] syscall_call+0x7/0xb Jul 14 21:45:37 xenau kernel: [<b7d6bd15>] 0xb7d6bd15 Jul 14 21:45:37 xenau kernel: ======================= Jul 14 21:45:37 xenau kernel: Code: 45 c4 e8 c7 fd ff ff 83 7e 14 00 c7 45 d4 00 00 00 00 0f 85 14 05 00 00 64 a1 00 f0 4b c0 90 ff 80 a4 06 00 00 83 7e 08 00 75 04 <0f> 0b eb fe 8b 4d cc 8b 45 c8 3b 41 18 75 04 0f 0b eb fe ff 46 Jul 14 21:45:37 xenau kernel: EIP: [<e0efeeec>] flush_commit_list+0x5e/0x58d [reiserfs] SS:ESP 0068:dd7d9edc Jul 14 21:45:37 xenau kernel: ---[ end trace 17cc811cb40d2740 ]--- Jul 14 21:45:37 xenau kernel: ------------[ cut here ]------------ Jul 14 21:45:37 xenau kernel: WARNING: at kernel/exit.c:892 do_exit+0x31/0x5c6() Jul 14 21:45:37 xenau kernel: Modules linked in: ip6t_LOG xt_pkttype xt_TCPMSS xt_limit ipt_LOG ipt_recent xt_tcpudp nfsd lockd nfs_acl auth_rpcgss exportfs raw deflate zlib_deflate ctr twofish_i586 twofish_common camellia serpent blowfish cbc xcbc crypto_null xfrm_user xfrm4_tunnel tunnel4 ipcomp esp4 aead ah4 aes_i586 crypto_blkcipher aes_generic des_generic md5 sha1_generic sha256_generic af_key af_packet sunrpc ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 fuse loop dm_mod parport_pc ppdev parport rtc_cmos rtc_core rtc_lib st osst sg button sworks_agp e100 i2c_piix4 shpchp ide_cd_mod cdrom i2c_core mii pci_hotplug agpgart ide_disk ohci_hcd usbcore sd_mod piix edd reiserfs fan aic7xxx scsi_transport_spi scsi_mod serverworks ide_core thermal processor [last unloaded: speedstep_lib] Jul 14 21:45:37 xenau kernel: Pid: 5163, comm: alpine Tainted: G D N 2.6.25.9-0.2-default #1 Jul 14 21:45:37 xenau kernel: [<c01071d9>] dump_trace+0x63/0x227 Jul 14 21:45:37 xenau kernel: [<c0107c8a>] show_trace+0x15/0x29 Jul 14 21:45:37 xenau kernel: [<c02e6608>] _etext+0x5b/0x65 Jul 14 21:45:37 xenau kernel: [<c0124401>] warn_on_slowpath+0x41/0x67 Jul 14 21:45:37 xenau kernel: [<c0127496>] do_exit+0x31/0x5c6 Jul 14 21:45:37 xenau kernel: [<c0107702>] die+0x15e/0x166 Jul 14 21:45:37 xenau kernel: [<c02e35a9>] do_trap+0x8a/0xa3 Jul 14 21:45:37 xenau kernel: [<c0107b25>] do_invalid_op+0x6c/0x76 Jul 14 21:45:37 xenau kernel: [<c02e2ef2>] error_code+0x72/0x80 Jul 14 21:45:37 xenau kernel: [<e0efeeec>] flush_commit_list+0x5e/0x58d [reiserfs] Jul 14 21:45:37 xenau kernel: [<e0f01976>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] Jul 14 21:45:37 xenau kernel: [<e0eef531>] reiserfs_sync_file+0x36/0x74 [reiserfs] Jul 14 21:45:37 xenau kernel: [<c0192516>] do_fsync+0x48/0x75 Jul 14 21:45:37 xenau kernel: [<c0192562>] __do_fsync+0x1f/0x2f Jul 14 21:45:37 xenau kernel: [<c0192591>] sys_fsync+0xd/0xf Jul 14 21:45:37 xenau kernel: [<c0105a62>] syscall_call+0x7/0xb Jul 14 21:45:37 xenau kernel: [<b7d6bd15>] 0xb7d6bd15 Jul 14 21:45:37 xenau kernel: ======================= Jul 14 21:45:37 xenau kernel: ---[ end trace 17cc811cb40d2740 ]--- ------- Comment #2 From Marcus Meissner 2008-07-15 01:23:28 MDT ------- are you using the latest updated kernel? (2.6.25.9-something?() ------- Comment #3 From Boyd Gerber 2008-07-15 07:46:48 MDT ------- Yes, as seen in the trace. I am using 2.6.25.9-0.2-default. ------- Comment #4 From Boyd Gerber 2008-07-15 07:49:49 MDT ------- This time I was using alpine to read my email. ------- Comment #5 From Jeff Mahoney 2008-07-15 09:18:57 MDT ------- Thanks for the report. This is a duplicate of 399966. Lars didn't have his file system to test with any longer, and I had hoped this had been eliminated with the latest update. Apparently it hasn't. *** This bug has been marked as a duplicate of bug 399966 *** I am seeing random file hangs. Not the system hang. Once I reboot the system I am able to access the file. They have been all over the reiserfs system. Not just in the Mail systems. The above was from using alpine. But I have had them all over not just in the mail system https://bugzilla.novell.com/show_bug.cgi?id=399966 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c16 --- Comment #16 from Jeff Mahoney <jeffm@novell.com> 2008-07-15 09:42:15 MDT --- There's no need to mirror the bug's content. Bugzilla places a link to the bug in the comments of the "master" bug. Even though you're not seeing system hangs, the bug is the same: "kernel BUG at fs/reiserfs/journal.c:1036!" The fact that you're seeing "file hangs" instead of a full system hang is only a matter of timing. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c17 --- Comment #17 from Boyd Gerber <gerberb@zenez.com> 2008-07-22 23:21:05 MDT --- Latest kernel 2.6.25.11-0.1-default Update still fails. I have had to reboot the machine 3 times after the kernel update because a file hangs and I am unable to access it. Jul 22 23:11:56 xenau kernel: ------------[ cut here ]------------ Jul 22 23:11:56 xenau kernel: kernel BUG at fs/reiserfs/journal.c:1036! Jul 22 23:11:56 xenau kernel: invalid opcode: 0000 [#1] SMP Jul 22 23:11:56 xenau kernel: last sysfs file: /sys/firmware/edd/int13_dev80/extensions Jul 22 23:11:56 xenau kernel: Modules linked in: ip6t_LOG xt_pkttype xt_TCPMSS xt_limit ipt_LOG ipt_recent xt_tcpudp nfsd lockd nfs_acl auth_rpcgss exportfs raw deflate zlib_deflate ctr twofish_i586 twofish_common camellia serpent blowfish cbc xcbc crypto_null xfrm_user xfrm4_tunnel tunnel4 ipcomp esp4 aead ah4 aes_i586 crypto_blkcipher aes_generic des_generic md5 sha1_generic sha256_generic af_key af_packet sunrpc ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 fuse loop dm_mod parport_pc ppdev parport rtc_cmos rtc_core rtc_lib st osst sg shpchp e100 pci_hotplug i2c_piix4 ide_cd_mod sworks_agp button cdrom ide_disk mii i2c_core agpgart ohci_hcd usbcore sd_mod piix edd reiserfs fan aic7xxx scsi_transport_spi scsi_mod serverworks ide_core thermal processor [last unloaded: speedstep_lib] Jul 22 23:11:56 xenau kernel: Jul 22 23:11:56 xenau kernel: Pid: 5056, comm: alpine Tainted: G N (2.6.25.11-0.1-default #1) Jul 22 23:11:56 xenau kernel: EIP: 0060:[<e0efeee8>] EFLAGS: 00210246 CPU: 0 Jul 22 23:11:56 xenau kernel: EIP is at flush_commit_list+0x5e/0x58d [reiserfs] Jul 22 23:11:56 xenau kernel: EAX: df6db040 EBX: e1220000 ECX: de78b800 EDX: e0f0aed2 Jul 22 23:11:56 xenau kernel: ESI: dd6db120 EDI: 013fe3e6 EBP: dcca9f1c ESP: dcca9edc Jul 22 23:11:56 xenau kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Jul 22 23:11:56 xenau kernel: Process alpine (pid: 5056, ti=dcca8000 task=df6db040 task.ti=dcca8000) Jul 22 23:11:56 xenau kernel: Stack: 00000001 de78b800 00000000 e1220000 c12512e0 00000000 df347d58 00000000 Jul 22 23:11:56 xenau kernel: dcca9f58 dcca9f10 c015dc83 df347d58 00000000 e1220000 de78b800 013fe3e6 Jul 22 23:11:56 xenau kernel: dcca9f64 e0f01972 df347cac dd6db120 00000000 00000000 00000001 00000000 Jul 22 23:11:56 xenau kernel: Call Trace: Jul 22 23:11:56 xenau kernel: [<e0f01972>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] Jul 22 23:11:56 xenau named[2609]: client 86.59.118.117#25503: query (cache) 'www.microsoft.com/A/IN' denied Jul 22 23:11:56 xenau kernel: [<e0eef52d>] reiserfs_sync_file+0x36/0x74 [reiserfs] Jul 22 23:11:56 xenau kernel: [<c019258e>] do_fsync+0x48/0x75 Jul 22 23:11:56 xenau kernel: [<c01925da>] __do_fsync+0x1f/0x2f Jul 22 23:11:56 xenau kernel: [<c0192609>] sys_fsync+0xd/0xf Jul 22 23:11:56 xenau kernel: [<c0105a62>] syscall_call+0x7/0xb Jul 22 23:11:56 xenau kernel: [<b7ca9d15>] 0xb7ca9d15 Jul 22 23:11:56 xenau kernel: ======================= Jul 22 23:11:56 xenau kernel: Code: 45 c4 e8 c7 fd ff ff 83 7e 14 00 c7 45 d4 00 00 00 00 0f 85 14 05 00 00 64 a1 00 f0 4b c0 90 ff 80 a4 06 00 00 83 7e 08 00 75 04 <0f> 0b eb fe 8b 4d cc 8b 45 c8 3b 41 18 75 04 0f 0b eb fe ff 46 Jul 22 23:11:56 xenau kernel: EIP: [<e0efeee8>] flush_commit_list+0x5e/0x58d [reiserfs] SS:ESP 0068:dcca9edc Jul 22 23:11:56 xenau kernel: ---[ end trace 6506536fd257c625 ]--- Jul 22 23:11:56 xenau kernel: ------------[ cut here ]------------ Jul 22 23:11:56 xenau kernel: WARNING: at kernel/exit.c:892 do_exit+0x31/0x5c6() Jul 22 23:11:56 xenau kernel: Modules linked in: ip6t_LOG xt_pkttype xt_TCPMSS xt_limit ipt_LOG ipt_recent xt_tcpudp nfsd lockd nfs_acl auth_rpcgss exportfs raw deflate zlib_deflate ctr twofish_i586 twofish_common camellia serpent blowfish cbc xcbc crypto_null xfrm_user xfrm4_tunnel tunnel4 ipcomp esp4 aead ah4 aes_i586 crypto_blkcipher aes_generic des_generic md5 sha1_generic sha256_generic af_key af_packet sunrpc ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 fuse loop dm_mod parport_pc ppdev parport rtc_cmos rtc_core rtc_lib st osst sg shpchp e100 pci_hotplug i2c_piix4 ide_cd_mod sworks_agp button cdrom ide_disk mii i2c_core agpgart ohci_hcd usbcore sd_mod piix edd reiserfs fan aic7xxx scsi_transport_spi scsi_mod serverworks ide_core thermal processor [last unloaded: speedstep_lib] Jul 22 23:11:56 xenau kernel: Pid: 5056, comm: alpine Tainted: G D N 2.6.25.11-0.1-default #1 Jul 22 23:11:56 xenau kernel: [<c01071d9>] dump_trace+0x63/0x227 Jul 22 23:11:56 xenau kernel: [<c0107c8a>] show_trace+0x15/0x29 Jul 22 23:11:56 xenau kernel: [<c02e66d8>] _etext+0x5b/0x65 Jul 22 23:11:56 xenau kernel: [<c01243fd>] warn_on_slowpath+0x41/0x67 Jul 22 23:11:56 xenau kernel: [<c0127492>] do_exit+0x31/0x5c6 Jul 22 23:11:56 xenau kernel: [<c0107702>] die+0x15e/0x166 Jul 22 23:11:56 xenau kernel: [<c02e3679>] do_trap+0x8a/0xa3 Jul 22 23:11:56 xenau kernel: [<c0107b25>] do_invalid_op+0x6c/0x76 Jul 22 23:11:56 xenau kernel: [<c02e2fc2>] error_code+0x72/0x80 Jul 22 23:11:56 xenau kernel: [<e0efeee8>] flush_commit_list+0x5e/0x58d [reiserfs] Jul 22 23:11:56 xenau kernel: [<e0f01972>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] Jul 22 23:11:56 xenau kernel: [<e0eef52d>] reiserfs_sync_file+0x36/0x74 [reiserfs] Jul 22 23:11:56 xenau kernel: [<c019258e>] do_fsync+0x48/0x75 Jul 22 23:11:56 xenau kernel: [<c01925da>] __do_fsync+0x1f/0x2f Jul 22 23:11:56 xenau kernel: [<c0192609>] sys_fsync+0xd/0xf Jul 22 23:11:56 xenau kernel: [<c0105a62>] syscall_call+0x7/0xb Jul 22 23:11:56 xenau kernel: [<b7ca9d15>] 0xb7ca9d15 Jul 22 23:11:56 xenau kernel: ======================= Jul 22 23:11:56 xenau kernel: ---[ end trace 6506536fd257c625 ]--- -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c18 --- Comment #18 from Boyd Gerber <gerberb@zenez.com> 2008-07-23 06:19:42 MDT --- Jul 23 06:12:31 xenau kernel: ------------[ cut here ]------------ Jul 23 06:12:31 xenau kernel: kernel BUG at fs/reiserfs/journal.c:1036! Jul 23 06:12:31 xenau kernel: invalid opcode: 0000 [#2] SMP Jul 23 06:12:31 xenau kernel: last sysfs file: /sys/firmware/edd/int13_dev80/extensions Jul 23 06:12:31 xenau kernel: Modules linked in: ip6t_LOG xt_pkttype xt_TCPMSS xt_limit ipt_LOG ipt_recent xt_tcpudp nfsd lockd nfs_acl auth_rpcgss exportfs raw deflate zlib_deflate ctr twofish_i586 twofish_common camellia serpent blowfish cbc xcbc crypto_null xfrm_user xfrm4_tunnel tunnel4 ipcomp esp4 aead ah4 aes_i586 crypto_blkcipher aes_generic des_generic md5 sha1_generic sha256_generic af_key af_packet sunrpc ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 fuse loop dm_mod parport_pc ppdev parport rtc_cmos rtc_core rtc_lib st osst sg sworks_agp shpchp button e100 pci_hotplug ide_cd_mod i2c_piix4 cdrom mii agpgart ide_disk i2c_core ohci_hcd usbcore sd_mod piix edd reiserfs fan aic7xxx scsi_transport_spi scsi_mod serverworks ide_core thermal processor [last unloaded: speedstep_lib] Jul 23 06:12:31 xenau kernel: Jul 23 06:12:31 xenau kernel: Pid: 22518, comm: alpine Tainted: G D N (2.6.25.11-0.1-default #1) Jul 23 06:12:31 xenau kernel: EIP: 0060:[<e0efeee8>] EFLAGS: 00210246 CPU: 0 Jul 23 06:12:31 xenau kernel: EIP is at flush_commit_list+0x5e/0x58d [reiserfs] Jul 23 06:12:31 xenau kernel: EAX: dff02020 EBX: e1227000 ECX: dd659800 EDX: e0f0aed2 Jul 23 06:12:31 xenau kernel: ESI: dd0a58c0 EDI: 0140073c EBP: de8cdf1c ESP: de8cdedc Jul 23 06:12:31 xenau kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Jul 23 06:12:31 xenau kernel: Process alpine (pid: 22518, ti=de8cc000 task=dff02020 task.ti=de8cc000) Jul 23 06:12:31 xenau kernel: Stack: 00000001 dd659800 00000000 e1227000 c10d66c0 00000000 dca30340 00000000 Jul 23 06:12:31 xenau kernel: de8cdf58 de8cdf10 c015dc83 dca30340 00000000 e1227000 dd659800 0140073c Jul 23 06:12:31 xenau kernel: de8cdf64 e0f01972 dca30294 dd0a58c0 00000000 00000000 00000001 00000000 Jul 23 06:12:31 xenau kernel: Call Trace: Jul 23 06:12:31 xenau kernel: [<e0f01972>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] Jul 23 06:12:31 xenau kernel: [<e0eef52d>] reiserfs_sync_file+0x36/0x74 [reiserfs] Jul 23 06:12:31 xenau kernel: [<c019258e>] do_fsync+0x48/0x75 Jul 23 06:12:31 xenau kernel: [<c01925da>] __do_fsync+0x1f/0x2f Jul 23 06:12:31 xenau kernel: [<c0192609>] sys_fsync+0xd/0xf Jul 23 06:12:31 xenau kernel: [<c0105a62>] syscall_call+0x7/0xb Jul 23 06:12:31 xenau kernel: [<b7bccd15>] 0xb7bccd15 Jul 23 06:12:31 xenau kernel: ======================= Jul 23 06:12:31 xenau kernel: Code: 45 c4 e8 c7 fd ff ff 83 7e 14 00 c7 45 d4 00 00 00 00 0f 85 14 05 00 00 64 a1 00 f0 4b c0 90 ff 80 a4 06 00 00 83 7e 08 00 75 04 <0f> 0b eb fe 8b 4d cc 8b 45 c8 3b 41 18 75 04 0f 0b eb fe ff 46 Jul 23 06:12:31 xenau kernel: EIP: [<e0efeee8>] flush_commit_list+0x5e/0x58d [reiserfs] SS:ESP 0068:de8cdedc Jul 23 06:12:31 xenau kernel: ---[ end trace 7d87952e5043ae41 ]--- Jul 23 06:12:31 xenau kernel: ------------[ cut here ]------------ Jul 23 06:12:31 xenau kernel: WARNING: at kernel/exit.c:892 do_exit+0x31/0x5c6() Jul 23 06:12:31 xenau kernel: Modules linked in: ip6t_LOG xt_pkttype xt_TCPMSS xt_limit ipt_LOG ipt_recent xt_tcpudp nfsd lockd nfs_acl auth_rpcgss exportfs raw deflate zlib_deflate ctr twofish_i586 twofish_common camellia serpent blowfish cbc xcbc crypto_null xfrm_user xfrm4_tunnel tunnel4 ipcomp esp4 aead ah4 aes_i586 crypto_blkcipher aes_generic des_generic md5 sha1_generic sha256_generic af_key af_packet sunrpc ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 fuse loop dm_mod parport_pc ppdev parport rtc_cmos rtc_core rtc_lib st osst sg sworks_agp shpchp button e100 pci_hotplug ide_cd_mod i2c_piix4 cdrom mii agpgart ide_disk i2c_core ohci_hcd usbcore sd_mod piix edd reiserfs fan aic7xxx scsi_transport_spi scsi_mod serverworks ide_core thermal processor [last unloaded: speedstep_lib] Jul 23 06:12:31 xenau kernel: Pid: 22518, comm: alpine Tainted: G D N 2.6.25.11-0.1-default #1 Jul 23 06:12:31 xenau kernel: [<c01071d9>] dump_trace+0x63/0x227 Jul 23 06:12:31 xenau kernel: [<c0107c8a>] show_trace+0x15/0x29 Jul 23 06:12:31 xenau kernel: [<c02e66d8>] _etext+0x5b/0x65 Jul 23 06:12:31 xenau kernel: [<c01243fd>] warn_on_slowpath+0x41/0x67 Jul 23 06:12:31 xenau kernel: [<c0127492>] do_exit+0x31/0x5c6 Jul 23 06:12:31 xenau kernel: [<c0107702>] die+0x15e/0x166 Jul 23 06:12:31 xenau kernel: [<c02e3679>] do_trap+0x8a/0xa3 Jul 23 06:12:31 xenau kernel: [<c0107b25>] do_invalid_op+0x6c/0x76 Jul 23 06:12:31 xenau kernel: [<c02e2fc2>] error_code+0x72/0x80 Jul 23 06:12:31 xenau kernel: [<e0efeee8>] flush_commit_list+0x5e/0x58d [reiserfs] Jul 23 06:12:31 xenau kernel: [<e0f01972>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] Jul 23 06:12:31 xenau kernel: [<e0eef52d>] reiserfs_sync_file+0x36/0x74 [reiserfs] Jul 23 06:12:31 xenau kernel: [<c019258e>] do_fsync+0x48/0x75 Jul 23 06:12:31 xenau kernel: [<c01925da>] __do_fsync+0x1f/0x2f Jul 23 06:12:31 xenau kernel: [<c0192609>] sys_fsync+0xd/0xf Jul 23 06:12:31 xenau kernel: [<c0105a62>] syscall_call+0x7/0xb Jul 23 06:12:31 xenau kernel: [<b7bccd15>] 0xb7bccd15 Jul 23 06:12:31 xenau kernel: ======================= Jul 23 06:12:31 xenau kernel: ---[ end trace 7d87952e5043ae41 ]--- -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gbv@oxixares.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c19 Guillermo Ballester Valor <gbv@oxixares.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |gbv@oxixares.com --- Comment #19 from Guillermo Ballester Valor <gbv@oxixares.com> 2008-08-02 02:09:07 MDT --- Hello, I've filled the bug report 413378. And searching the bugzilla database I found this one!. The third attachment in 413378 also is refered to fs/reiserfs/journal.c:1036 This is my syslog: Aug 2 07:03:44 gauss kernel: ------------[ cut here ]------------ Aug 2 07:03:44 gauss kernel: kernel BUG at fs/reiserfs/journal.c:1036! Aug 2 07:03:44 gauss kernel: invalid opcode: 0000 [1] SMP Aug 2 07:03:44 gauss kernel: last sysfs file: /sys/devices/platform/pcspkr/modalias Aug 2 07:03:44 gauss kernel: CPU 0 Aug 2 07:03:44 gauss kernel: Modules linked in: ip6t_LOG xt_tcpudp xt_pkttype ipt_LOG xt_limit it87 hwmon_vid snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 fuse dm_crypt crypto_blkcipher ext3 jbd mbcache loop dm_mod snd_mpu401 ppdev snd_mpu401_uart snd_intel8x0 parport_pc ohci1394 snd_rawmidi snd_ac97_codec snd_seq_device ac97_bus nvidia(P) button ieee1394 parport sr_mod snd_pcm sky2 snd_timer rtc_cmos snd snd_page_alloc cdrom i2c_nforce2 i2c_core ns558 gameport rtc_core rtc_lib soundcore forcedeth k8temp floppy sg sd_mod ehci_hcd ohci_hcd usbcore amd74xx ide_core edd reiserfs fan pata_amd sata_nv sata_sil24 libata scsi_mod dock thermal processor Aug 2 07:03:44 gauss kernel: Pid: 32729, comm: amavisd Tainted: P N 2.6.25.11-0.1-default #1 Aug 2 07:03:44 gauss kernel: RIP: 0010:[<ffffffff880ceec6>] [<ffffffff880ceec6>] :reiserfs:flush_commit_list+0x6c/0x689 Aug 2 07:03:44 gauss kernel: RSP: 0018:ffff81007e15ddb8 EFLAGS: 00010246 Aug 2 07:03:44 gauss kernel: RAX: ffff810028438700 RBX: ffffc20001301000 RCX: ffff81007e15df08 Aug 2 07:03:44 gauss kernel: RDX: 0000000000000001 RSI: ffffffff880dc91c RDI: ffff81007da67800 Aug 2 07:03:44 gauss kernel: RBP: ffff81007e15de48 R08: 0000000000000000 R09: ffff81007e15dd48 Aug 2 07:03:44 gauss kernel: R10: ffffe200006dfca0 R11: ffff81007e15dd38 R12: ffff81005c6030c0 Aug 2 07:03:44 gauss kernel: R13: 0000000000000000 R14: 0000000000000000 R15: ffff81007da67800 Aug 2 07:03:44 gauss kernel: FS: 00007fc10874c6f0(0000) GS:ffffffff80630000(0000) knlGS:00000000f7cffaf0 Aug 2 07:03:44 gauss kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Aug 2 07:03:44 gauss kernel: CR2: 000000000353df28 CR3: 0000000040469000 CR4: 00000000000006e0 Aug 2 07:03:44 gauss kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Aug 2 07:03:44 gauss kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Aug 2 07:03:44 gauss kernel: Process amavisd (pid: 32729, threadinfo ffff81007e15c000, task ffff810028438700) Aug 2 07:03:44 gauss kernel: Stack: 0000000000000000 0000000000000000 ffffe2000023c240 00000001006dfca0 Aug 2 07:03:44 gauss kernel: 0000000000000000 ffffffff00000001 ffff81004b40b740 0000000000e3fa51 Aug 2 07:03:44 gauss kernel: ffffc20001301000 0000000000000000 0000000000000000 ffff810028438700 Aug 2 07:03:44 gauss kernel: Call Trace: Aug 2 07:03:44 gauss kernel: [<ffffffff880d2011>] :reiserfs:reiserfs_commit_for_inode+0x17e/0x1bc Aug 2 07:03:44 gauss kernel: [<ffffffff880bd9a3>] :reiserfs:reiserfs_sync_file+0x47/0x8d Aug 2 07:03:44 gauss kernel: [<ffffffff802c040c>] do_fsync+0x55/0x8a Aug 2 07:03:44 gauss kernel: [<ffffffff802c046f>] __do_fsync+0x2e/0x44 Aug 2 07:03:44 gauss kernel: [<ffffffff802c0493>] sys_fdatasync+0xe/0x10 Aug 2 07:03:44 gauss kernel: [<ffffffff8020bffa>] system_call_after_swapgs+0x8a/0x8f Aug 2 07:03:44 gauss kernel: DWARF2 unwinder stuck at system_call_after_swapgs+0x8a/0x8f Aug 2 07:03:44 gauss kernel: Aug 2 07:03:44 gauss kernel: Leftover inexact backtrace: Aug 2 07:03:44 gauss kernel: Aug 2 07:03:44 gauss syslog-ng[1887]: last message repeated 2 times Aug 2 07:03:44 gauss kernel: Code: 45 b0 e8 ce fd ff ff 41 83 7c 24 20 00 0f 85 00 06 00 00 65 48 8b 04 25 00 00 00 00 f0 ff 80 d0 08 00 00 49 83 7c 24 10 00 75 04 <0f> 0b eb fe 48 8b 55 b0 48 8b 5d a8 48 3b 5a 30 75 04 0f 0b eb Aug 2 07:03:44 gauss kernel: RIP [<ffffffff880ceec6>] :reiserfs:flush_commit_list+0x6c/0x689 Aug 2 07:03:44 gauss kernel: RSP <ffff81007e15ddb8> Aug 2 07:03:44 gauss kernel: ---[ end trace 19047d3744914647 ]--- Aug 2 07:03:44 gauss kernel: ------------[ cut here ]------------ Aug 2 07:03:44 gauss kernel: WARNING: at kernel/exit.c:892 do_exit+0x41/0x6ec() Aug 2 07:03:44 gauss kernel: Modules linked in: ip6t_LOG xt_tcpudp xt_pkttype ipt_LOG xt_limit it87 hwmon_vid snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 fuse dm_crypt crypto_blkcipher ext3 jbd mbcache loop dm_mod snd_mpu401 ppdev snd_mpu401_uart snd_intel8x0 parport_pc ohci1394 snd_rawmidi snd_ac97_codec snd_seq_device ac97_bus nvidia(P) button ieee1394 parport sr_mod snd_pcm sky2 snd_timer rtc_cmos snd snd_page_alloc cdrom i2c_nforce2 i2c_core ns558 gameport rtc_core rtc_lib soundcore forcedeth k8temp floppy sg sd_mod ehci_hcd ohci_hcd usbcore amd74xx ide_core edd reiserfs fan pata_amd sata_nv sata_sil24 libata scsi_mod dock thermal processor Aug 2 07:03:44 gauss kernel: Pid: 32729, comm: amavisd Tainted: P D N 2.6.25.11-0.1-default #1 Aug 2 07:03:44 gauss kernel: Aug 2 07:03:44 gauss kernel: Call Trace: Aug 2 07:03:44 gauss kernel: [<ffffffff8020d696>] dump_trace+0xc4/0x576 Aug 2 07:03:44 gauss kernel: [<ffffffff8020db88>] show_trace+0x40/0x57 Aug 2 07:03:44 gauss kernel: [<ffffffff8044f8d5>] _etext+0x72/0x7b Aug 2 07:03:44 gauss kernel: [<ffffffff80237507>] warn_on_slowpath+0x58/0x80 Aug 2 07:03:44 gauss kernel: [<ffffffff8023adc6>] do_exit+0x41/0x6ec Aug 2 07:03:44 gauss kernel: [<ffffffff80449d8f>] oops_begin+0x0/0xa0 Aug 2 07:03:44 gauss kernel: [<ffffffff8020dfd9>] die+0x5d/0x66 Aug 2 07:03:44 gauss kernel: [<ffffffff8044a2ba>] do_trap+0x110/0x11f Aug 2 07:03:44 gauss kernel: [<ffffffff8020e70c>] do_invalid_op+0xa0/0xa9 Aug 2 07:03:44 gauss kernel: [<ffffffff804496b9>] error_exit+0x0/0x60 Aug 2 07:03:44 gauss kernel: DWARF2 unwinder stuck at error_exit+0x0/0x60 Aug 2 07:03:44 gauss kernel: Aug 2 07:03:44 gauss kernel: Leftover inexact backtrace: Aug 2 07:03:44 gauss kernel: Aug 2 07:03:44 gauss kernel: [<ffffffff880ceec6>] ? :reiserfs:flush_commit_list+0x6c/0x689 Aug 2 07:03:44 gauss kernel: [<ffffffff880ceea2>] ? :reiserfs:flush_commit_list+0x48/0x689 Aug 2 07:03:44 gauss kernel: [<ffffffff880d2011>] ? :reiserfs:reiserfs_commit_for_inode+0x17e/0x1bc Aug 2 07:03:44 gauss kernel: [<ffffffff8027b17b>] ? generic_writepages+0x1f/0x25 Aug 2 07:03:44 gauss kernel: [<ffffffff8027b1b0>] ? do_writepages+0x2f/0x38 Aug 2 07:03:44 gauss kernel: [<ffffffff880bd9a3>] ? :reiserfs:reiserfs_sync_file+0x47/0x8d Aug 2 07:03:44 gauss kernel: [<ffffffff802c040c>] ? do_fsync+0x55/0x8a Aug 2 07:03:44 gauss kernel: [<ffffffff802c046f>] ? __do_fsync+0x2e/0x44 Aug 2 07:03:44 gauss kernel: [<ffffffff802c0493>] ? sys_fdatasync+0xe/0x10 Aug 2 07:03:44 gauss kernel: [<ffffffff8020bffa>] ? system_call_after_swapgs+0x8a/0x8f Aug 2 07:03:44 gauss kernel: Aug 2 07:03:44 gauss kernel: ---[ end trace 19047d3744914647 ]--- -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Priority|P5 - None |P2 - High -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c20 --- Comment #20 from Boyd Gerber <gerberb@zenez.com> 2008-08-21 03:33:14 MDT --- With the latest alpine that came out I am getting a lot of errors even on other file systems. Here is one Problem detected: "header size inconsistent". Alpine Exiting. Backtrace (11 stack frames): alpine(panic+0x15f) [0x80a7dd3] alpine(fatal+0x10) [0x8205881] alpine(unix_rewrite+0x31e) [0x823efd9] alpine(unix_check+0x56) [0x823f52d] alpine(check_point+0x3fe) [0x81ba8fe] alpine(new_mail+0x498) [0x81bb804] alpine(scrolltool+0x846) [0x8104047] alpine(mail_view_screen+0x5b6) [0x8106734] alpine(main+0x1f20) [0x80ad4fb] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7b2b5f5] alpine [0x808f2f1] Aborted Maybe this should be a new bug. for the above. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c21 --- Comment #21 from Jeff Mahoney <jeffm@novell.com> 2008-09-11 09:07:21 MDT --- The alpine problem is entirely unrelated. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c22 --- Comment #22 from Boyd Gerber <gerberb@zenez.com> 2008-09-11 10:39:23 MDT --- This is definitly not alpine related. I am seeing if often with no use of alpine. Alpine was just one tool to see the exact same error. This problem is a big pain. Since trying to create both a ext3 and reseier boot partition, to work around the problem. I am only able to boot from reiserfs. I get an ext3 error about unknown type. An other bug with this is already filed. Only current solution is not to have any reseirfs / partition. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gbv@oxixares.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c23 --- Comment #23 from Guillermo Ballester Valor <gbv@oxixares.com> 2008-09-11 12:53:43 MDT --- (In reply to comment #22 from Boyd Gerber)
This problem is a big pain. Since trying to create both a ext3 and reseier boot partition, to work around the problem. I am only able to boot from reiserfs. I get an ext3 error about unknown type. An other bug with this is already filed. Only current solution is not to have any reseirfs / partition.
I've solved my problem with this workaround: 1) Changed my /home partition from reiserfs to ext3. Yast2 has helped a lot. Just change the home directory and Yast2 sync it to new path. 2) After syncing and linking (soft) to a ext3 partition the following directories: /tmp /var/tmp /var/spool It seems it has avoided problems with the main '/' resiserfs partition. But this is still a tricky workaround. :( -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User P.Suetterlin@royac.iac.es added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c24 Peter Sütterlin <P.Suetterlin@royac.iac.es> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |P.Suetterlin@royac.iac.es --- Comment #24 from Peter Sütterlin <P.Suetterlin@royac.iac.es> 2008-09-23 09:29:45 MDT --- (In reply to comment #22 from Boyd Gerber)
Only current solution is not to have any reseirfs / partition.
Too bad - I had just set up my laptop with 11.0 from scratch, and switched to reiserfs as I wasn't too happy with ext3. I'm experiencing exactly the same bug, most of the time when mutt is accessing my Maildir type mail folder. The mutt process will fall dead and unkillable, though the rest of the system is running fine. But I also had one hard crash so far - maybe related to the same bug? I didn't investgate further at that time. I'm running 2.6.25.16-0.1-default on a Pentium-M 1.5GHz machine. Syslog output available on request (guess it wouldn't give much additional insight...?) Pit -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c25 --- Comment #25 from Boyd Gerber <gerberb@zenez.com> 2008-09-23 09:36:36 MDT --- I am sure it is this bug. I have the same issue with mutt, pine, alpine, spamassassin, ... The list goes on. The more you access the files the sooner you get hung and can not access the file. The program you use most of the time is un killable. I hate ext3 but with 11.0 it is the only boot partition that works sadly. Good Luck. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c26 --- Comment #26 from Jeff Mahoney <jeffm@novell.com> 2008-09-23 10:17:57 MDT --- Ugh, I hadn't heard anything about this one in a while and hoped it had been fixed by an update. I guess I'm digging back in. Hopefully I'll be able to create a decent simulation of the fsync workload. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c27 --- Comment #27 from Boyd Gerber <gerberb@zenez.com> 2008-09-23 10:36:49 MDT --- I only have to reboot 4-30 times a day with the latest kernel just released. This bug is a real pain. I have been force to reformat 50 systems, just to keep them working. I have left 3 systems for testing. 2 production machines. Every system is on the latest kernel. They all have the problem. So a fix would greatly be appreciated. Thanks, -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User kairo@kairo.at added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c28 --- Comment #28 from Robert Kaiser <kairo@kairo.at> 2008-09-23 10:51:56 MDT --- Jeff: For testing, I'd recommend using Firefox 3 urlbar autocomplete a lot with your Firefox profile (in ~/.mozilla) being on reiserfs. The URLs of history and bookmarks that are accessed for this are saved in a SQLite database (places.sqlite), which is quite an fsync hog. I got the browser process get stuck and being reported as being in uninterruptable sleep (i.e. status D in top or ps) quite often before I switched my home partition to ext3 because of this issue. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c29 --- Comment #29 from Jeff Mahoney <jeffm@novell.com> 2008-09-23 14:32:59 MDT --- Can you try reproducing with kernel-vanilla as well? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c30 --- Comment #30 from Jeff Mahoney <jeffm@novell.com> 2008-09-25 08:22:38 MDT --- As a data point, when I started tracking this bug down, I converted my / and /home back to reiserfs for testing. I am still unable to trigger this bug, with heavy use of firefox and thunderbird. /dev/mapper/system-reiserfs--root on / type reiserfs (rw,acl,user_xattr) /dev/mapper/cr_home on /home type reiserfs (rw,acl,user_xattr) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User P.Suetterlin@royac.iac.es added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c31 --- Comment #31 from Peter Sütterlin <P.Suetterlin@royac.iac.es> 2008-09-25 08:45:48 MDT --- Hi Jeff, I installed vanilla yesterday evening, and was booting from it today. I did not get the error so far. As a side note, so far I almost exclusively had gotten the problems when switching mailboxes and/or mail folders in mutt, and all of them are IMAP folders, i.e., not localy on my disk. What I do have, though, is header caching. I'll continue using vanilla and see if that really helps. If there's something else I can do, let me know (but will be away for vacation next week). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c32 --- Comment #32 from Boyd Gerber <gerberb@zenez.com> 2008-09-26 18:31:32 MDT --- With the vanilla kernel it has only happened 2 times in 48 hours, but it still fails. The way I did it was to create a 500 message queue to report spam. rptspam formail -s spamassassin -r < /tmp/spam formail -s spamassassin -r < /tmp/spam
/tmp/spam
I had it hang once using this with 500 messages. I have been able to cause the error with a cron job running every minute sending and email to a user@localhost. After a while no more messages are put in /var/spool/mail/user This is the 2 ways I have been able to cause the error. Sometime using mutt or alpine to read mail will cause an error and you are either dropped from the program or gone into an infinite loop waiting for the mail box to open. I left alpine open I did the following... alpine -f /var/spool/mail/user I left the window open while the email was being generated by cron. Once alpine aborted with an error message I found in the log the standard message. I hope this helps with finding and fixing it. I now have had to format the / partition to something other than rieserfs in order to work. I do now have one machine for testing should this be fixed. I have 10 friends that can not have their / formated so I need a fix for them. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c33 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |sla_lom@yahoo.com --- Comment #33 from Jeff Mahoney <jeffm@novell.com> 2008-11-10 09:43:02 MST --- *** Bug 443202 has been marked as a duplicate of this bug. *** https://bugzilla.novell.com/show_bug.cgi?id=443202 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c34 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |lmb@novell.com --- Comment #34 from Jeff Mahoney <jeffm@novell.com> 2008-11-11 17:52:15 MST --- I've posted a test kernel at http://ftp.suse.com/pub/people/jeffm/suse/testpkgs/399966/ It won't avoid the crash, but it will document more of the state of the file system when the crash occurs. Please report back with any results. I'm particularly interested in the information immediately preceding the start of the BUG header. I'm still unable to reproduce this myself. I've been running with a reiserfs / and /home for several months now and it hasn't triggered once. There's a race somewhere. I expect it's a journal list still associated with an inode after it's been released, but I'm not sure how that's happening. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User lmb@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c35 Lars Marowsky-Bree <lmb@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|lmb@novell.com | --- Comment #35 from Lars Marowsky-Bree <lmb@novell.com> 2008-12-12 03:16:57 MST --- Jeff, I can no longer trace this. My system is ext3/xfs now because I needed more stability. Eventually, I'll upgrade to 11.1 too and there the default no longer is reiser either :-( Sorry. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c36 --- Comment #36 from Boyd Gerber <gerberb@zenez.com> 2008-12-12 08:04:52 MST --- It is still there. I have one system setup where I can boot from a reisefs main partition. I rebooted my machine and ran with the latest kernel. It lasted only 2 hours before the problem was there. I then had to boot to the ext3 partition to keep the machine up and working. I can not be down for long. So the bug is still there. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c37 --- Comment #37 from Jeff Mahoney <jeffm@novell.com> 2008-12-12 08:07:29 MST --- I know the bug is still there. From comment #34: "It won't avoid the crash, but it will document more of the state of the file system when the crash occurs." I really need the log when it crashes. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c38 --- Comment #38 from Boyd Gerber <gerberb@zenez.com> 2008-12-12 08:14:20 MST --- I will see if I can get one over the weekend. I am trying to make sure everything is covered in 11.1 at the moment. Also I have a couple customer issues I have to resolve. I will try and then upload the log. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c39 --- Comment #39 from Jeff Mahoney <jeffm@novell.com> 2008-12-12 08:15:31 MST --- I have a similar report against 11.1, so I'm really looking to get this fixed for the first update. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c41 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |gerberb@zenez.com --- Comment #41 from Jeff Mahoney <jeffm@novell.com> 2008-12-12 13:41:07 MST --- I've checked in a patch that should fix this. Please try to reproduce with a KOTD kernel after tomorrow. The patch will be in the HEAD and SL110_BRANCH kernels, but not SL111_BRANCH yet. It should contain the following entry in the changelog: - patches.fixes/reiserfs-ensure-nonzero-transaction: reiserfs: ensure nonzero transaction (bnc#447406). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c42 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|gerberb@zenez.com | --- Comment #42 from Jeff Mahoney <jeffm@novell.com> 2008-12-23 12:23:25 MST --- Turns out that patch doesn't change anything. There is already an assertion after that call site where j_len == 0 would have been caught. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User gerberb@zenez.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c43 --- Comment #43 from Boyd Gerber <gerberb@zenez.com> 2008-12-23 16:49:30 MST --- I got it once, but the hard drive that I was saving the logs to failed. I have since upgraded this machine to 11.1 I have seen the error there as well. It only happens about 1 in 24 hours. This machine used to fail every couple hours. So it is better. I had a power outage here when a car hit the power pole outside. When the machine came backup the log was deleted. I will try and get it later. I need the machine up for my internet access, so I have it booting from ext3. Sorry -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User arnd@gronenberg.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c44 Arnd Gronenberg <arnd@gronenberg.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |arnd@gronenberg.com --- Comment #44 from Arnd Gronenberg <arnd@gronenberg.com> 2008-12-27 18:46:57 MST --- I upgraded a server a week ago from 10.2 to 11.0 (HW: IBM x345, Dual Xeon, 3GB, 6x 15k 36GB SCSI320 RAID5) and I encountered the above mentioned problem twice. Setup is MD RAID 5 on 6 15k SCSI320 disks with LVM and reiserfs on top. Kernel is 2.6.25.18-0.2-pae and system is current. The problem only occured during nightly scheduled backups (IBM TSM, client and server process on same system) and caused the TSM backup server process to hang. After killing the process and restarting, the following backup attempts (manual and scheduled) did not cause problems. Backups are performed as follows: LVM create snapshot, snapshot mounted read-only, backup performed from snapshot, umount of snapshot, LVM removal of snapshot. Based on timing comparison (failing backup to successful one), it seems the problem occured shortly before, around of after the time of ending the backup (ie. umount / lvremove)... Is there any possibility to find out which file was being processed at the time the error occurred? Please find attached the excerpt from the log: ============================================== Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): found reiserfs format "3.6" with standard journal Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): using ordered data mode Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): journal params: device dm-11, size 8192, journal first block 18, max trans len 1024, max batch 900, max commit age 30, max trans age 30 Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): checking transaction log (dm-11) Dec 28 01:28:54 arndsrv kernel: REISERFS (device dm-11): Using r5 hash to sort names Dec 28 01:39:00 arndsrv kernel: ------------[ cut here ]------------ Dec 28 01:39:00 arndsrv kernel: kernel BUG at fs/reiserfs/journal.c:1036! Dec 28 01:39:00 arndsrv kernel: invalid opcode: 0000 [#1] SMP Dec 28 01:39:00 arndsrv kernel: last sysfs file: /sys/devices/system/cpu/cpu3/topology/core_siblings Dec 28 01:39:00 arndsrv kernel: Modules linked in: udf crc_itu_t ip6t_LOG ipt_MASQUERADE ipt_REDIRECT xt_mark xt_pkttype xt_TCPMSS xt_tcpudp ipt_LOG xt_limit xt_MARK tun af_packet 8021q cls_u32 sch_sfq sch_htb capidrv isdn slhc b1pci b1dma b1 ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 capi capifs kernelcapi snd_pcm_oss snd_mixer_oss snd_seq fuse dm_crypt crypto_blkcipher ext2 loop snd_usb_audio snd_pcm snd_timer snd_page_alloc snd_usb_lib st i2c_piix4 rtc_cmos i2c_core snd_rawmidi snd_seq_device joydev rtc_core snd_hwdep rtc_lib snd osst sg soundcore usb_storage usblp e1000 sr_mod button sworks_agp ibmasm agpgart cdrom usbhid hid ff_memless linear raid0 ehci_hcd ohci_hcd sd_mod usbcore dm_snapshot raid456 async_xor async_memcpy async_tx xor raid1 ext3 jbd mbcache aic7xxx mptsas scsi_transport_sas mptfc scsi_transport_fc scsi_tgt piix ide_core edd dm_mod reiserfs Dec 28 01:39:00 arndsrv kernel: fan pata_serverworks libata dock mptspi mptscsih mptbase scsi_transport_spi scsi_mod thermal processor [last unloaded: speedstep_lib] Dec 28 01:39:00 arndsrv kernel: Dec 28 01:39:00 arndsrv kernel: Pid: 429, comm: dsmserv Tainted: G N (2.6.25.18-0.2-pae #1) Dec 28 01:39:00 arndsrv kernel: EIP: 0060:[<f8fb6ee8>] EFLAGS: 00210246 CPU: 3 Dec 28 01:39:00 arndsrv kernel: EIP is at flush_commit_list+0x5e/0x58d [reiserfs] Dec 28 01:39:00 arndsrv kernel: EAX: f78640a0 EBX: f992b000 ECX: f6d21a00 EDX: f8fc2ed2 Dec 28 01:39:00 arndsrv kernel: ESI: f6d45e00 EDI: 007cc033 EBP: d6b7bf1c ESP: d6b7bedc Dec 28 01:39:00 arndsrv kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Dec 28 01:39:00 arndsrv kernel: Process dsmserv (pid: 429, ti=d6b7a000 task=f78640a0 task.ti=d6b7a000) Dec 28 01:39:00 arndsrv kernel: Stack: 00000001 f6d21a00 00000000 f992b000 00000000 00000000 08d456d8 04000001 Dec 28 01:39:00 arndsrv kernel: f4c632b4 c2832bb4 f7c7f120 00000000 00000000 f992b000 f6d21a00 007cc033 Dec 28 01:39:00 arndsrv kernel: d6b7bf64 f8fb9972 f2dcdc8c f6d45e00 00000000 00000000 00000001 00000000 Dec 28 01:39:00 arndsrv kernel: Call Trace: Dec 28 01:39:00 arndsrv kernel: [<f8fb9972>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] Dec 28 01:39:00 arndsrv kernel: [<f8fa752d>] reiserfs_sync_file+0x36/0x74 [reiserfs] Dec 28 01:39:00 arndsrv kernel: [<c01958b2>] do_fsync+0x48/0x75 Dec 28 01:39:00 arndsrv kernel: [<c01958fe>] __do_fsync+0x1f/0x2f Dec 28 01:39:00 arndsrv kernel: [<c019592d>] sys_fsync+0xd/0xf Dec 28 01:39:01 arndsrv kernel: [<c01059e4>] sysenter_past_esp+0x6d/0xa9 Dec 28 01:39:01 arndsrv kernel: [<ffffe430>] 0xffffe430 Dec 28 01:39:01 arndsrv kernel: ======================= Dec 28 01:39:01 arndsrv kernel: Code: 45 c4 e8 c7 fd ff ff 83 7e 14 00 c7 45 d4 00 00 00 00 0f 85 14 05 00 00 64 a1 00 00 4d c0 f0 ff 80 b4 06 00 00 83 7e 08 00 75 04 <0f> 0b eb fe 8b 4d cc 8b 45 c8 3b 41 18 75 04 0f 0b eb fe ff 46 Dec 28 01:39:01 arndsrv kernel: EIP: [<f8fb6ee8>] flush_commit_list+0x5e/0x58d [reiserfs] SS:ESP 0068:d6b7bedc Dec 28 01:39:01 arndsrv kernel: ---[ end trace a8ee4669643ba7e6 ]--- Dec 28 01:39:01 arndsrv kernel: ------------[ cut here ]------------ Dec 28 01:39:01 arndsrv kernel: WARNING: at kernel/exit.c:892 do_exit+0x31/0x5c6() Dec 28 01:39:01 arndsrv kernel: Modules linked in: udf crc_itu_t ip6t_LOG ipt_MASQUERADE ipt_REDIRECT xt_mark xt_pkttype xt_TCPMSS xt_tcpudp ipt_LOG xt_limit xt_MARK tun af_packet 8021q cls_u32 sch_sfq sch_htb capidrv isdn slhc b1pci b1dma b1 ip6t_REJECT nf_conntrack_ipv6 ipt_REJECT xt_state iptable_mangle iptable_nat nf_nat iptable_filter ip6table_mangle nf_conntrack_ipv4 nf_conntrack ip_tables ip6table_filter ip6_tables x_tables ipv6 capi capifs kernelcapi snd_pcm_oss snd_mixer_oss snd_seq fuse dm_crypt crypto_blkcipher ext2 loop snd_usb_audio snd_pcm snd_timer snd_page_alloc snd_usb_lib st i2c_piix4 rtc_cmos i2c_core snd_rawmidi snd_seq_device joydev rtc_core snd_hwdep rtc_lib snd osst sg soundcore usb_storage usblp e1000 sr_mod button sworks_agp ibmasm agpgart cdrom usbhid hid ff_memless linear raid0 ehci_hcd ohci_hcd sd_mod usbcore dm_snapshot raid456 async_xor async_memcpy async_tx xor raid1 ext3 jbd mbcache aic7xxx mptsas scsi_transport_sas mptfc scsi_transport_fc scsi_tgt piix ide_core edd dm_mod reiserfs Dec 28 01:39:01 arndsrv kernel: fan pata_serverworks libata dock mptspi mptscsih mptbase scsi_transport_spi scsi_mod thermal processor [last unloaded: speedstep_lib] Dec 28 01:39:01 arndsrv kernel: Pid: 429, comm: dsmserv Tainted: G D N 2.6.25.18-0.2-pae #1 Dec 28 01:39:01 arndsrv kernel: [<c01071d9>] dump_trace+0x63/0x227 Dec 28 01:39:01 arndsrv kernel: [<c0107c8a>] show_trace+0x15/0x29 Dec 28 01:39:01 arndsrv kernel: [<c02e2e65>] dump_stack+0x5b/0x65 Dec 28 01:39:01 arndsrv kernel: [<c01257b9>] warn_on_slowpath+0x41/0x67 Dec 28 01:39:01 arndsrv kernel: [<c0128856>] do_exit+0x31/0x5c6 Dec 28 01:39:01 arndsrv kernel: [<c0107702>] die+0x15e/0x166 Dec 28 01:39:01 arndsrv kernel: [<c02e5909>] do_trap+0x8a/0xa3 Dec 28 01:39:01 arndsrv kernel: [<c0107b25>] do_invalid_op+0x6c/0x76 Dec 28 01:39:01 arndsrv kernel: [<c02e5252>] error_code+0x72/0x80 Dec 28 01:39:01 arndsrv kernel: [<f8fb6ee8>] flush_commit_list+0x5e/0x58d [reiserfs] Dec 28 01:39:01 arndsrv kernel: [<f8fb9972>] reiserfs_commit_for_inode+0x14f/0x17d [reiserfs] Dec 28 01:39:01 arndsrv kernel: [<f8fa752d>] reiserfs_sync_file+0x36/0x74 [reiserfs] Dec 28 01:39:01 arndsrv kernel: [<c01958b2>] do_fsync+0x48/0x75 Dec 28 01:39:01 arndsrv kernel: [<c01958fe>] __do_fsync+0x1f/0x2f Dec 28 01:39:01 arndsrv kernel: [<c019592d>] sys_fsync+0xd/0xf Dec 28 01:39:01 arndsrv kernel: [<c01059e4>] sysenter_past_esp+0x6d/0xa9 Dec 28 01:39:01 arndsrv kernel: [<ffffe430>] 0xffffe430 Dec 28 01:39:01 arndsrv kernel: ======================= Dec 28 01:39:01 arndsrv kernel: ---[ end trace a8ee4669643ba7e6 ]--- ============================================== Thanks, Arnd -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User swamp@suse.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c45 Swamp Script User <swamp@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status Whiteboard| |maint:released:11.0:21569 --- Comment #45 from Swamp Script User <swamp@suse.com> 2009-01-20 04:57:51 MST --- Update released for: kernel-debug, kernel-default, kernel-docs, kernel-kdump, kernel-pae, kernel-ppc64, kernel-ps3, kernel-rt, kernel-rt_debug, kernel-source, kernel-syms, kernel-vanilla, kernel-xen Products: openSUSE 11.0 (debug, i386, ppc, x86_64) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c46 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |sundance@ierne.eu.org --- Comment #46 from Jeff Mahoney <jeffm@novell.com> 2009-01-21 08:01:55 MST --- *** Bug 467815 has been marked as a duplicate of this bug. *** https://bugzilla.novell.com/show_bug.cgi?id=467815 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User sundance@ierne.eu.org added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c47 --- Comment #47 from Bali Greydragon <sundance@ierne.eu.org> 2009-01-21 09:49:30 MST --- Jeff: Is your test kernel package from comment #34 still available somewhere? I'm being hit by the bug and would like to help see to its resolution. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User bugproxy@us.ibm.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c48 LTC BugProxy <bugproxy@us.ibm.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |bugproxy@us.ibm.com --- Comment #48 from LTC BugProxy <bugproxy@us.ibm.com> 2009-01-21 17:51:31 MST --- Hello Jeff, IBM has opened bug 468163 to report an issue with the additional journal list debug patch added to SLES11 RC2. The good news is that we seem to have be seeing the condition you were hoping for fairly readily using an IO stress test. Please give us you opinion of that patch. Thanks! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c49 Vadim Krevs <vkrevs@yahoo.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |vkrevs@yahoo.com --- Comment #49 from Vadim Krevs <vkrevs@yahoo.com> 2009-01-23 02:39:13 MST --- Does comment #45 imply that this issue had been fixed in that openSUSE 11.0 kernel update? If so, can you release an updated kernel for openSUSE 11.1? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c50 --- Comment #50 from Jeff Mahoney <jeffm@novell.com> 2009-01-23 18:46:41 MST --- No, sorry. The 11.0 message was an error. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c51 --- Comment #51 from Vadim Krevs <vkrevs@yahoo.com> 2009-01-25 09:19:54 MST --- Hi Jeff, This error seems to reoccur every night around 1AM on my office desktop running openSUSE 11.1 for x86-64 since I had upgraded from 11.0 (presumably while oracle 11g performs nightly maintenance jobs). Perhaps you have a test/debug kernel package with extended tracing, etc - I don't mind installing it to capture any information that might help track down this issue. Regards, Vadim -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c52 --- Comment #52 from Jeff Mahoney <jeffm@novell.com> 2009-01-28 08:12:46 MST --- Can you try using kernel-debug to reproduce this? Another report using a kernel with more instrumentation indicated that memory corruption may be at fault. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c53 --- Comment #53 from Jeff Mahoney <jeffm@novell.com> 2009-02-11 08:11:48 MST --- *** Bug 467511 has been marked as a duplicate of this bug. *** https://bugzilla.novell.com/show_bug.cgi?id=467511 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c54 --- Comment #54 from Vadim Krevs <vkrevs@yahoo.com> 2009-02-11 09:50:48 MST --- Hmm, I've switched to kernel-debug two weeks ago and this error has not reoccurred even once. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c55 --- Comment #55 from Jeff Mahoney <jeffm@novell.com> 2009-02-11 10:22:00 MST --- Thanks for the feedback. That's consistent with what I've been hearing. I have another report where the reporter is actively testing kernels, so we hope to have this tracked down soon. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c56 --- Comment #56 from Vadim Krevs <vkrevs@yahoo.com> 2009-02-16 03:25:59 MST --- This morning I had to reboot my desktop and I forgot to select kernel-debug, so the desktop booted into kernel-default. And within an hour the problem reoccurred: Feb 16 10:21:05 stal-dev-lx1 kernel: ------------[ cut here ]------------ Feb 16 10:21:05 stal-dev-lx1 kernel: kernel BUG at fs/reiserfs/journal.c:1034! Feb 16 10:21:05 stal-dev-lx1 kernel: invalid opcode: 0000 [1] SMP Feb 16 10:21:05 stal-dev-lx1 kernel: last sysfs file: /sys/devices/system/cpu/cpu1/cache/index2/shared_cpu_map Feb 16 10:21:05 stal-dev-lx1 kernel: CPU 0 Feb 16 10:21:05 stal-dev-lx1 kernel: Modules linked in: nvidia(PN) raw snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device nfs lockd nfs_acl sunrpc ipv6 af_packet cisco_ipsec(PN) cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq fuse loop dm_mod ppdev wmi parport_pc button intel_agp parport pcspkr e1000e i2c_core snd_hda_intel snd_pcm snd_timer snd_page_alloc snd_hwdep snd soundcore rtc_cmos rtc_core rtc_lib iTCO_wdt iTCO_vendor_support sr_mod cdrom sg floppy sd_mod crc_t10dif ehci_hcd uhci_hcd usbcore edd reiserfs fan ide_pci_generic ide_core thermal processor thermal_sys hwmon ahci ata_generic libata scsi_mod dock [last unloaded: nvidia] Feb 16 10:21:05 stal-dev-lx1 kernel: Supported: Yes Feb 16 10:21:05 stal-dev-lx1 kernel: Pid: 29695, comm: oracle Tainted: P 2.6.27.7-9-default #1 Feb 16 10:21:05 stal-dev-lx1 kernel: RIP: 0010:[<ffffffffa00c997f>] [<ffffffffa00c997f>] flush_commit_list+0x5f/0x55e [reiserfs] Feb 16 10:21:05 stal-dev-lx1 kernel: RSP: 0018:ffff8800af78ba98 EFLAGS: 00010246 Feb 16 10:21:05 stal-dev-lx1 kernel: RAX: ffff8800b0490740 RBX: 0000000000bc638e RCX: ffff88008b03a880 Feb 16 10:21:05 stal-dev-lx1 kernel: RDX: 0000000000000001 RSI: ffffffffa00d6bf0 RDI: 0000000000000000 Feb 16 10:21:05 stal-dev-lx1 kernel: RBP: ffff88008b03a880 R08: ffff8800d9c53540 R09: 0000000000000019 Feb 16 10:21:05 stal-dev-lx1 kernel: R10: ffff88011916a0c0 R11: 0000000000000001 R12: 0000000000000000 Feb 16 10:21:05 stal-dev-lx1 kernel: R13: 0000000000000000 R14: ffff880118988000 R15: ffffc20005fe3000 Feb 16 10:21:05 stal-dev-lx1 kernel: FS: 00007f29a203b6f0(0000) GS:ffffffff8084aa00(0000) knlGS:0000000000000000 Feb 16 10:21:05 stal-dev-lx1 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Feb 16 10:21:05 stal-dev-lx1 kernel: CR2: 000000000244c538 CR3: 0000000090691000 CR4: 00000000000006e0 Feb 16 10:21:05 stal-dev-lx1 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Feb 16 10:21:05 stal-dev-lx1 kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Feb 16 10:21:05 stal-dev-lx1 kernel: Process oracle (pid: 29695, threadinfo ffff8800af78a000, task ffff8800b0490740) Feb 16 10:21:05 stal-dev-lx1 kernel: Stack: 0000000000000000 0000000000000000 0000000000000000 0000000100000000 Feb 16 10:21:05 stal-dev-lx1 kernel: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 Feb 16 10:21:05 stal-dev-lx1 kernel: 0000000000000000 ffffc20005fe3000 ffff880118988000 0000000000000000 Feb 16 10:21:05 stal-dev-lx1 kernel: Call Trace: Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffffa00cc7fb>] __commit_trans_jl+0x16e/0x18c [reiserfs] Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffffa00b79e5>] reiserfs_get_blocks_direct_io+0x79/0x95 [reiserfs] Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802e3887>] do_direct_IO+0x147/0x369 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802e3d61>] direct_io_worker+0x174/0x309 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802e4167>] __blockdev_direct_IO+0x271/0x2c3 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffffa00b3e27>] reiserfs_direct_IO+0x4c/0x51 [reiserfs] Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff80289f34>] generic_file_aio_read+0xba/0x16c Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802badd8>] do_sync_read+0xce/0x113 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802bb7d2>] vfs_read+0xaa/0x153 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802bb8d2>] sys_pread64+0x57/0x77 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff8020c37a>] system_call_fastpath+0x16/0x1b Feb 16 10:21:05 stal-dev-lx1 kernel: [<00007f29a08702f3>] 0x7f29a08702f3 Feb 16 10:21:05 stal-dev-lx1 kernel: Feb 16 10:21:05 stal-dev-lx1 kernel: Feb 16 10:21:05 stal-dev-lx1 kernel: Code: a0 4c 8b 78 18 e8 e2 fd ff ff 83 7d 20 00 0f 85 e5 04 00 00 65 48 8b 04 25 00 00 00 00 f0 ff 80 84 07 00 00 48 83 7d 10 00 75 04 <0f> 0b eb fe 41 3b 5f 30 75 04 0f0b eb fe ff 85 b0 00 00 00 83 Feb 16 10:21:05 stal-dev-lx1 kernel: RIP [<ffffffffa00c997f>] flush_commit_list+0x5f/0x55e [reiserfs] Feb 16 10:21:05 stal-dev-lx1 kernel: RSP <ffff8800af78ba98> Feb 16 10:21:05 stal-dev-lx1 kernel: ---[ end trace 8469c1c050692731 ]--- Feb 16 10:21:05 stal-dev-lx1 kernel: ------------[ cut here ]------------ Feb 16 10:21:05 stal-dev-lx1 kernel: WARNING: at kernel/exit.c:1008 do_exit+0x36/0x334() Feb 16 10:21:05 stal-dev-lx1 kernel: Modules linked in: nvidia(PN) raw snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device nfs lockd nfs_acl sunrpc ipv6 af_packet cisco_ipsec(PN) cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq fuse loop dm_mod ppdev wmi parport_pc button intel_agp parport pcspkr e1000e i2c_core snd_hda_intel snd_pcm snd_timer snd_page_alloc snd_hwdep snd soundcore rtc_cmos rtc_core rtc_lib iTCO_wdt iTCO_vendor_support sr_mod cdrom sg floppy sd_mod crc_t10dif ehci_hcd uhci_hcd usbcore edd reiserfs fan ide_pci_generic ide_core thermal processor thermal_sys hwmon ahci ata_generic libata scsi_mod dock [last unloaded: nvidia] Feb 16 10:21:05 stal-dev-lx1 kernel: Supported: Yes Feb 16 10:21:05 stal-dev-lx1 kernel: Pid: 29695, comm: oracle Tainted: P D 2.6.27.7-9-default #1 Feb 16 10:21:05 stal-dev-lx1 kernel: Feb 16 10:21:05 stal-dev-lx1 kernel: Call Trace: Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff8020e42e>] show_trace_log_lvl+0x41/0x58 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff804a1e97>] dump_stack+0x69/0x6f Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff80240eb2>] warn_on_slowpath+0x51/0x77 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802449ad>] do_exit+0x36/0x334 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff804a4b9b>] oops_begin+0x0/0x9e Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff8020edee>] do_invalid_op+0x94/0x9e Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff804a426a>] error_exit+0x0/0x70 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffffa00c997f>] flush_commit_list+0x5f/0x55e [reiserfs] Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffffa00cc7fb>] __commit_trans_jl+0x16e/0x18c [reiserfs] Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffffa00b79e5>] reiserfs_get_blocks_direct_io+0x79/0x95 [reiserfs] Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802e3887>] do_direct_IO+0x147/0x369 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802e3d61>] direct_io_worker+0x174/0x309 Feb 16 10:21:05 stal-dev-lx1 kernel: [<ffffffff802e4167>] __blockdev_direct_IO+0x271/0x2c3 Feb 16 10:21:06 stal-dev-lx1 kernel: [<ffffffffa00b3e27>] reiserfs_direct_IO+0x4c/0x51 [reiserfs] Feb 16 10:21:06 stal-dev-lx1 kernel: [<ffffffff80289f34>] generic_file_aio_read+0xba/0x16c Feb 16 10:21:06 stal-dev-lx1 kernel: [<ffffffff802badd8>] do_sync_read+0xce/0x113 Feb 16 10:21:06 stal-dev-lx1 kernel: [<ffffffff802bb7d2>] vfs_read+0xaa/0x153 Feb 16 10:21:06 stal-dev-lx1 kernel: [<ffffffff802bb8d2>] sys_pread64+0x57/0x77 Feb 16 10:21:06 stal-dev-lx1 kernel: [<ffffffff8020c37a>] system_call_fastpath+0x16/0x1b Feb 16 10:21:06 stal-dev-lx1 kernel: [<00007f29a08702f3>] 0x7f29a08702f3 Feb 16 10:21:06 stal-dev-lx1 kernel: Feb 16 10:21:06 stal-dev-lx1 kernel: ---[ end trace 8469c1c050692731 ]--- Feb 16 10:21:12 stal-dev-lx1 kerneloops: Submitted 1 kernel oopses to www.kerneloops.org -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c57 --- Comment #57 from Vadim Krevs <vkrevs@yahoo.com> 2009-02-17 14:29:50 MST --- And now it happened for the first time on my home Dell Inspiron 9400 noteboo running openSUSE 11.1 for x86-64 (kernel-default-2.6.27.7-9.1) and froze the entire machine: Feb 17 21:13:11 starfire kernel: ------------[ cut here ]------------ Feb 17 21:13:11 starfire kernel: kernel BUG at fs/reiserfs/journal.c:1034! Feb 17 21:13:11 starfire kernel: invalid opcode: 0000 [1] SMP Feb 17 21:13:11 starfire kernel: last sysfs file: /sys/devices/pci0000:00/0000:00:1c.1/0000:0c:00.0/rfkill/rfkill0/state Feb 17 21:13:11 starfire kernel: CPU 0 Feb 17 21:13:11 starfire kernel: Modules linked in: snd_pcm_oss snd_mixer_oss snd_seq cisco_ipsec(PN) snd_usb_audio snd_usb_lib snd_rawmidi snd_seq_device snd_hda_intel snd_pcm snd_timer snd_page_alloc snd_hwdep snd af_packet raw ipv6 microcode cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq fuse loop dm_mod arc4 ecb crypto_blkcipher iwl3945 b44 ssb sdhci_pci rfkill uvcvideo rtc_cmos pcmcia sdhci mac80211 iTCO_wdt compat_ioctl32 iTCO_vendor_support ohci1394 video rtc_core pcmcia_core ricoh_mmc mmc_core videodev sr_mod rtc_lib i2c_i801 led_class dcdbas(X) output ieee1394 mii v4l1_compat i2c_core intel_agp wmi button battery ac cdrom joydev fglrx(PX) cfg80211 pcspkr soundcore sg usbhid hid ff_memless sd_mod crc_t10dif uhci_hcd ehci_hcd usbcore edd reiserfs fan ata_piix libata scsi_mod dock thermal processor thermal_sys hwmon [last unloaded: cisco_ipsec] Feb 17 21:13:11 starfire kernel: Supported: Yes, External Feb 17 21:13:11 starfire kernel: Pid: 8746, comm: firefox31 Tainted: P 2.6.27.7-9-default #1 Feb 17 21:13:11 starfire kernel: RIP: 0010:[<ffffffffa00a297f>] [<ffffffffa00a297f>] flush_commit_list+0x5f/0x55e [reiserfs] Feb 17 21:13:11 starfire kernel: RSP: 0018:ffff8800bd6a7e08 EFLAGS: 00010246 Feb 17 21:13:11 starfire kernel: RAX: ffff8800371d44c0 RBX: 00000000006153f0 RCX: ffff880037644a80 Feb 17 21:13:11 starfire kernel: RDX: 0000000000000001 RSI: ffffffffa00afbf0 RDI: 0007ffffffffffff Feb 17 21:13:11 starfire kernel: RBP: ffff880037644a80 R08: 0000000000000000 R09: 0000000007cb76cd Feb 17 21:13:11 starfire kernel: R10: 0000000007cb752d R11: ffff880011e28678 R12: 0000000000000000 Feb 17 21:13:11 starfire kernel: R13: 0000000000000000 R14: ffff8800cd01b400 R15: ffffc20006675000 Feb 17 21:13:11 starfire kernel: FS: 00007fb504808700(0000) GS:ffffffff8084aa00(0000) knlGS:0000000000000000 Feb 17 21:13:11 starfire kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b Feb 17 21:13:11 starfire kernel: CR2: 00007fb4ff9f6e00 CR3: 000000009703f000 CR4: 00000000000006e0 Feb 17 21:13:11 starfire kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Feb 17 21:13:11 starfire kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Feb 17 21:13:11 starfire kernel: Process firefox31 (pid: 8746, threadinfo ffff8800bd6a6000, task ffff8800371d44c0) Feb 17 21:13:11 starfire kernel: Stack: 0000000000000000 0000000000000000 ffffe20000ec55c0 0000000101c446c0 Feb 17 21:13:11 starfire kernel: 0000000000000000 ffff8800371d44c0 0000000000000000 0000000680290961 Feb 17 21:13:11 starfire kernel: 0000000000000000 ffffc20006675000 ffff8800cd01b400 0000000000000000 Feb 17 21:13:11 starfire kernel: Call Trace: Feb 17 21:13:11 starfire kernel: [<ffffffffa00a57fb>] __commit_trans_jl+0x16e/0x18c [reiserfs] Feb 17 21:13:11 starfire kernel: [<ffffffffa0091545>] reiserfs_sync_file+0x3b/0x7d [reiserfs] Feb 17 21:13:11 starfire kernel: [<ffffffff802dc804>] do_fsync+0x52/0x87 Feb 17 21:13:11 starfire kernel: [<ffffffff802dc85d>] __do_fsync+0x24/0x36 Feb 17 21:13:11 starfire kernel: [<ffffffff8020c37a>] system_call_fastpath+0x16/0x1b Feb 17 21:13:11 starfire kernel: [<00007fb50441d037>] 0x7fb50441d037 Feb 17 21:13:11 starfire kernel: Feb 17 21:13:11 starfire kernel: Feb 17 21:13:11 starfire kernel: Code: a0 4c 8b 78 18 e8 e2 fd ff ff 83 7d 20 00 0f 85 e5 04 00 00 65 48 8b 04 25 00 00 00 00 f0 ff 80 84 07 00 00 48 83 7d 10 00 75 04 <0f> 0b eb fe 41 3b 5f 30 75 04 0f 0b eb fe ff 85 b0 00 00 00 83 Feb 17 21:13:11 starfire kernel: RIP [<ffffffffa00a297f>] flush_commit_list+0x5f/0x55e [reiserfs] Feb 17 21:13:11 starfire kernel: RSP <ffff8800bd6a7e08> Feb 17 21:13:11 starfire kernel: ---[ end trace 486686d0a6152d2e ]--- Feb 17 21:13:11 starfire kernel: ------------[ cut here ]------------ Feb 17 21:13:11 starfire kernel: WARNING: at kernel/exit.c:1008 do_exit+0x36/0x334() Feb 17 21:13:11 starfire kernel: Modules linked in: snd_pcm_oss snd_mixer_oss snd_seq cisco_ipsec(PN) snd_usb_audio snd_usb_lib snd_rawmidi snd_seq_device snd_hda_intel snd_pcm snd_timer snd_page_alloc snd_hwdep snd af_packet raw ipv6 microcode cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq fuse loop dm_mod arc4 ecb crypto_blkcipher iwl3945 b44 ssb sdhci_pci rfkill uvcvideo rtc_cmos pcmcia sdhci mac80211 iTCO_wdt compat_ioctl32 iTCO_vendor_support ohci1394 video rtc_core pcmcia_core ricoh_mmc mmc_core videodev sr_mod rtc_lib i2c_i801 led_class dcdbas(X) output ieee1394 mii v4l1_compat i2c_core intel_agp wmi button battery ac cdrom joydev fglrx(PX) cfg80211 pcspkr soundcore sg usbhid hid ff_memless sd_mod crc_t10dif uhci_hcd ehci_hcd usbcore edd reiserfs fan ata_piix libata scsi_mod dock thermal processor thermal_sys hwmon [last unloaded: cisco_ipsec] Feb 17 21:13:11 starfire kernel: Supported: Yes, External Feb 17 21:13:11 starfire kernel: Pid: 8746, comm: firefox31 Tainted: P D 2.6.27.7-9-default #1 Feb 17 21:13:11 starfire kernel: Feb 17 21:13:11 starfire kernel: Call Trace: Feb 17 21:13:11 starfire kernel: [<ffffffff8020e42e>] show_trace_log_lvl+0x41/0x58 Feb 17 21:13:11 starfire kernel: [<ffffffff804a1e97>] dump_stack+0x69/0x6f Feb 17 21:13:11 starfire kernel: [<ffffffff80240eb2>] warn_on_slowpath+0x51/0x77 Feb 17 21:13:11 starfire kernel: [<ffffffff802449ad>] do_exit+0x36/0x334 Feb 17 21:13:11 starfire kernel: [<ffffffff804a4b9b>] oops_begin+0x0/0x9e Feb 17 21:13:11 starfire kernel: [<ffffffff8020edee>] do_invalid_op+0x94/0x9e Feb 17 21:13:11 starfire kernel: [<ffffffff804a426a>] error_exit+0x0/0x70 Feb 17 21:13:11 starfire kernel: [<ffffffffa00a297f>] flush_commit_list+0x5f/0x55e [reiserfs] Feb 17 21:13:11 starfire kernel: [<ffffffffa00a57fb>] __commit_trans_jl+0x16e/0x18c [reiserfs] Feb 17 21:13:11 starfire kernel: [<ffffffffa0091545>] reiserfs_sync_file+0x3b/0x7d [reiserfs] Feb 17 21:13:11 starfire kernel: [<ffffffff802dc804>] do_fsync+0x52/0x87 Feb 17 21:13:11 starfire kernel: [<ffffffff802dc85d>] __do_fsync+0x24/0x36 Feb 17 21:13:11 starfire kernel: [<ffffffff8020c37a>] system_call_fastpath+0x16/0x1b Feb 17 21:13:11 starfire kernel: [<00007fb50441d037>] 0x7fb50441d037 Feb 17 21:13:11 starfire kernel: Feb 17 21:13:11 starfire kernel: ---[ end trace 486686d0a6152d2e ]--- -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c58 --- Comment #58 from Vadim Krevs <vkrevs@yahoo.com> 2009-02-19 04:10:50 MST --- Any progress on this issue? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User swamp@suse.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c59 Swamp Script User <swamp@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status Whiteboard|maint:released:11.0:21569 |maint:released:11.0:21569 | |maint:released:11.1:22712 --- Comment #59 from Swamp Script User <swamp@suse.com> 2009-02-26 07:31:28 MST --- Update released for: kernel-debug, kernel-debug-base, kernel-debug-debuginfo, kernel-debug-debugsource, kernel-debug-extra, kernel-default, kernel-default-base, kernel-default-debuginfo, kernel-default-debugsource, kernel-default-extra, kernel-docs, kernel-kdump, kernel-kdump-debuginfo, kernel-kdump-debugsource, kernel-pae, kernel-pae-base, kernel-pae-extra, kernel-ppc64, kernel-ppc64-base, kernel-ppc64-debuginfo, kernel-ppc64-debugsource, kernel-ppc64-extra, kernel-ps3, kernel-ps3-debuginfo, kernel-ps3-debugsource, kernel-source, kernel-source-debuginfo, kernel-syms, kernel-trace, kernel-trace-base, kernel-trace-debuginfo, kernel-trace-debugsource, kernel-trace-extra, kernel-vanilla, kernel-vanilla-debuginfo, kernel-vanilla-debugsource, kernel-xen, kernel-xen-base, kernel-xen-debuginfo, kernel-xen-debugsource, kernel-xen-extra Products: openSUSE 11.1 (debug, i586, ppc, x86_64) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c60 --- Comment #60 from Jeff Mahoney <jeffm@novell.com> 2009-02-26 07:45:39 MST --- The problem isn't fixed yet. The script is picking up the debugging patch that lists the bug number. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c61 --- Comment #61 from Vadim Krevs <vkrevs@yahoo.com> 2009-02-26 10:24:05 MST --- Created an attachment (id=275764) --> (https://bugzilla.novell.com/attachment.cgi?id=275764) contents of /var/log/messages for the new kernel Hi. I've just installed the just released kernel update and immediately reproduced the error. The attachment contains the relevant /var/log/messages portion. It seems to contain new debug information - hope this helps. rpm -q kernel-default kernel-default-2.6.27.19-3.2.1 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User hharun@cs.ubc.ca added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c62 H Harun <hharun@cs.ubc.ca> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |hharun@cs.ubc.ca --- Comment #62 from H Harun <hharun@cs.ubc.ca> 2009-02-26 12:54:11 MST --- Hi, I'm running openSuSE 11.1 (kernel-default-2.6.27.7-9.1). I get this error (below) every day around 02:13 every morning. In /etc/sysconfig/cron , I set DAILY_TIME="2:00". In /etc/cron.d , I have a file update.cron which has 00 07 * * * root /nfsshare/hosts-11.1 hosts-11.1 calls "zypper lu" I use cfengine to set DAILY_TIME and put update.cron in the proper place. (Any progress???) Feb 26 02:13:21 sapporo kernel: ------------[ cut here ]------------ Feb 26 02:13:21 sapporo kernel: kernel BUG at fs/reiserfs/journal.c:1034! Feb 26 02:13:21 sapporo kernel: invalid opcode: 0000 [#1] SMP Feb 26 02:13:21 sapporo kernel: last sysfs file: /sys/devices/pci0000:00/0000:00:1e.0/0000:02:02.0/class Feb 26 02:13:21 sapporo kernel: Modules linked in: nfs lockd nfs_acl sunrpc vmsync(N) vmmemctl(N) vmblock(N) autofs4 binfmt_misc snd_pcm_oss snd_ mixer_oss vboxdrv(N) ipv6 af_packet microcode fuse loop dm_mod snd_intel8x0 snd_ac97_codec iTCO_wdt e100 nvidia(PX) ac97_bus iTCO_vendor_support ppdev i2c_i801 mii snd_pcm parport_pc snd_timer snd intel_rng shpchp rtc_cmos i2c_core parport button sr_mod intel_agp cdrom floppy pci_hotplug r tc_core pcspkr soundcore agpgart rtc_lib snd_page_alloc sg ehci_hcd uhci_hcd sd_mod crc_t10dif usbcore edd reiserfs fan thermal processor thermal _sys hwmon ide_pci_generic piix ide_core ata_generic ata_piix libata scsi_mod dock [last unloaded: speedstep_lib] Feb 26 02:13:21 sapporo kernel: Supported: No Feb 26 02:13:21 sapporo kernel: Feb 26 02:13:21 sapporo kernel: Pid: 9903, comm: cfagent Tainted: P (2.6.27.7-9-default #1) Feb 26 02:13:21 sapporo kernel: EIP: 0060:[<e1330678>] EFLAGS: 00010246 CPU: 0 Feb 26 02:13:21 sapporo kernel: EIP is at flush_commit_list+0x56/0x50f [reiserfs] Feb 26 02:13:21 sapporo kernel: EAX: db059040 EBX: 000178ce ECX: de260200 EDX: e133c57a Feb 26 02:13:21 sapporo kernel: ESI: db72ee00 EDI: 00000000 EBP: 00000000 ESP: dd3b1ef0 Feb 26 02:13:21 sapporo kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Feb 26 02:13:21 sapporo kernel: Process cfagent (pid: 9903, ti=dd3b0000 task=db059040 task.ti=dd3b0000) Feb 26 02:13:21 sapporo kernel: Stack: 00000000 00000001 ffffffff deed7a80 00000000 00000000 00000001 de260200 Feb 26 02:13:21 sapporo kernel: 00000004 e152e000 00000000 dd3b1f70 c2a68300 c016fba0 00000000 e152e000 Feb 26 02:13:21 sapporo kernel: de260200 00000000 000178ce e13330a9 db72ee00 c2a68254 00000000 00000001 Feb 26 02:13:21 sapporo kernel: Call Trace: Feb 26 02:13:21 sapporo kernel: [<e13330a9>] __commit_trans_jl+0x124/0x138 [reiserfs] Feb 26 02:13:21 sapporo kernel: [<e131f7e1>] reiserfs_sync_file+0x33/0x70 [reiserfs] Feb 26 02:13:21 sapporo kernel: [<c01ad158>] do_fsync+0x41/0x6e Feb 26 02:13:21 sapporo kernel: [<c01ad1a1>] __do_fsync+0x1c/0x2b Feb 26 02:13:21 sapporo kernel: [<c0104c9b>] sysenter_do_call+0x12/0x2f Feb 26 02:13:21 sapporo kernel: [<ffffe430>] 0xffffe430 Feb 26 02:13:21 sapporo kernel: ======================= Feb 26 02:13:21 sapporo kernel: Code: 0c 89 44 24 24 8b 44 24 1c e8 ba fd ff ff 83 7e 14 00 0f 85 a5 04 00 00 64 a1 00 10 58 c0 f0 ff 80 d4 04 00 00 83 7e 08 00 75 04 <0f> 0b eb fe 8b 44 24 24 3b 58 18 75 04 0f 0b eb fe ff 46 60 83 Feb 26 02:13:21 sapporo kernel: EIP: [<e1330678>] flush_commit_list+0x56/0x50f [reiserfs] SS:ESP 0068:dd3b1ef0 Feb 26 02:13:21 sapporo kernel: ---[ end trace 48eccf502af12bf3 ]--- Feb 26 02:13:21 sapporo kernel: ------------[ cut here ]------------ Feb 26 02:13:21 sapporo kernel: WARNING: at kernel/exit.c:1008 do_exit+0x2e/0x2a4() Feb 26 02:13:21 sapporo kernel: Modules linked in: nfs lockd nfs_acl sunrpc vmsync(N) vmmemctl(N) vmblock(N) autofs4 binfmt_misc snd_pcm_oss snd_ mixer_oss vboxdrv(N) ipv6 af_packet microcode fuse loop dm_mod snd_intel8x0 snd_ac97_codec iTCO_wdt e100 nvidia(PX) ac97_bus iTCO_vendor_support ppdev i2c_i801 mii snd_pcm parport_pc snd_timer snd intel_rng shpchp rtc_cmos i2c_core parport button sr_mod intel_agp cdrom floppy pci_hotplug r tc_core pcspkr soundcore agpgart rtc_lib snd_page_alloc sg ehci_hcd uhci_hcd sd_mod crc_t10dif usbcore edd reiserfs fan thermal processor thermal _sys hwmon ide_pci_generic piix ide_core ata_generic ata_piix libata scsi_mod dock [last unloaded: speedstep_lib] Feb 26 02:13:21 sapporo kernel: Supported: No Feb 26 02:13:21 sapporo kernel: Pid: 9903, comm: cfagent Tainted: P D 2.6.27.7-9-default #1 Feb 26 02:13:21 sapporo kernel: [<c0106570>] dump_trace+0x6b/0x249 Feb 26 02:13:21 sapporo kernel: [<c01070a5>] show_trace+0x20/0x39 Feb 26 02:13:21 sapporo kernel: [<c0343c02>] dump_stack+0x71/0x76 Feb 26 02:13:21 sapporo kernel: [<c012ad20>] warn_on_slowpath+0x4d/0x70 Feb 26 02:13:21 sapporo kernel: [<c012e2c8>] do_exit+0x2e/0x2a4 Feb 26 02:13:21 sapporo kernel: [<c0345fe8>] oops_end+0xad/0xb2 Feb 26 02:13:21 sapporo kernel: [<c0106eed>] do_invalid_op+0x81/0x8a Feb 26 02:13:21 sapporo kernel: [<c0345c62>] error_code+0x72/0x80 Feb 26 02:13:21 sapporo kernel: [<e1330678>] flush_commit_list+0x56/0x50f [reiserfs] Feb 26 02:13:21 sapporo kernel: [<e13330a9>] __commit_trans_jl+0x124/0x138 [reiserfs] Feb 26 02:13:21 sapporo kernel: [<e131f7e1>] reiserfs_sync_file+0x33/0x70 [reiserfs] Feb 26 02:13:21 sapporo kernel: [<c01ad158>] do_fsync+0x41/0x6e Feb 26 02:13:21 sapporo kernel: [<c01ad1a1>] __do_fsync+0x1c/0x2b Feb 26 02:13:21 sapporo kernel: [<ffffe430>] 0xffffe430 Feb 26 02:13:21 sapporo kernel: ======================= Feb 26 02:13:21 sapporo kernel: ---[ end trace 48eccf502af12bf3 ]--- -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c63 --- Comment #63 from Jeff Mahoney <jeffm@novell.com> 2009-02-26 13:11:21 MST --- We have a good idea of what's happening, just not _why_ it's happening. There appears to be a use after free of reiserfs_journal_lists, but the reference counting is simple. This is being actively debugged in our enterprise kernel where it can be reliably triggered on test machines. I *still* can't reproduce this on my own, so it's slow going. As a note, this morning's kernel updated switched the opensuse kernel to the SLE kernel, so any fixes developed there will be inherited by opensuse automatically. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c64 --- Comment #64 from Vadim Krevs <vkrevs@yahoo.com> 2009-02-26 13:24:05 MST --- Perhaps this will help - on my machine the error appears in syslog within several minutes of starting an oracle 11g instance after boot up, before any application even connects to that oracle instance. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c65 --- Comment #65 from Jeff Mahoney <jeffm@novell.com> 2009-02-26 13:28:01 MST --- Your log is interesting. It's still going through __commit_trans_jl, but isn't via fsync. That's a first. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c66 --- Comment #66 from Jeff Mahoney <jeffm@novell.com> 2009-03-02 08:57:23 MST --- I believe we've fixed this problem. For 11.1, a recent KOTD kernel (http://ftp.suse.com/pub/projects/kernel/kotd/SLE11_BRANCH/) will contain the fix. For 11.0, I'm committing the fix now. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c67 --- Comment #67 from Jeff Mahoney <jeffm@novell.com> 2009-03-02 09:43:41 MST --- The next 11.0 KOTD will have the fix at http://ftp.suse.com/pub/projects/kernel/kotd/SL110_BRANCH/ Please ensure it has the following in the changelog when reporting test results: - Disabled patches.suse/reiserfs-inode-init (bnc#399966) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User jeffm@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c68 Jeff Mahoney <jeffm@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |FIXED --- Comment #68 from Jeff Mahoney <jeffm@novell.com> 2009-03-16 11:21:25 MST --- Verified in FIXED in another report. Closing. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=399966 User vkrevs@yahoo.com added comment https://bugzilla.novell.com/show_bug.cgi?id=399966#c69 --- Comment #69 from Vadim Krevs <vkrevs@yahoo.com> 2009-03-16 13:13:12 MST --- So when will an updated kernel for openSUSE 11.1 be released? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com