[Bug 209593] New: Reproducable OOPS in journal_mark_freed [reiserfs] on online update
https://bugzilla.novell.com/show_bug.cgi?id=209593 Summary: Reproducable OOPS in journal_mark_freed [reiserfs] on online update Product: SUSE Linux 10.1 Version: Final Platform: i686 OS/Version: SuSE Linux 10.1 Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: nine@detonation.org QAContact: qa@suse.de I've got a reproducable OOPS with a following hang when running an online update (either with yast, the online updater or rug update). It seems like it's always the same call trace. Unfortunately I did not succeed in capturing it, not even with network syslog. I only get the Oops, but not the call trace, so I had to type it in manually: Call Trace: [<e0e77bea>] journal_mark_freed+0x13d/0x289 [reiserfs] reiserfs_free_block+0x27/0x3a [reiserfs] prepare_for_delete_or_cut+0x574/0x5f4 [reiserfs] reiserfs_cut_from_item+0xd8/0x756 [reiserfs] search_by_key+0x16f/0xcbb [reiserfs] reiserfs_do_truncate+0x374/0x4c2 [reiserfs] reiserfs_delete_object+0x30/0x5f [reiserfs] reiserfs_delete_inode+0x88/0xf0 [reiserfs] reiserfs_delete_inode+0x0/0xf0 [reiserfs] generic_delete_inode+0xa6/0xfe iput+0x63/0x65 dput+0xfb/0x113 sys_renameat+0x15f/0x1af current_fs_time+0x40/0x4b schedule+0x4c1/0x52e sys_rename+0x11/0x15 sysenter_past_esp+0x53/0x79 Code: eb 6f 8b 55 d8 8b 45 d4 8b 4d e0 8b 52 0c 89 55 f0 c1 e2 03 8b 49 04 89 d6 31 d2 f7 f6 89 4d ec 89 c1 8b 45 ec c1 e1 02 89 4d e8 <83> 3c 08 00 75 49 8b 55 d8 8b 82 5c 81 00 00 8b 50 0c 89 45 d0 Oct 2 23:38:50 sunshine kernel: Unable to handle kernel paging request at virtual address e082dad0 Badness in do_exit at kernel/exit.c:807 do_exit+0x44/0x617 printk+0x14/0x18 do_page_fault+0x0/0x51e3 show_stack+0x0/0xa do_page_fault+0x373/0x51e [<e0e77bea>] journal_mark_freed+0x13d/0x289 [reiserfs] do_page_fault+0x0/0x51e error_code+0x4f/0x60 text.lock.mmap+0x36/0x63 [<e0e77bea>] journal_mark_freed+0x13d/0x289 [reiserfs] reiserfs_free_block+0x27/0x3a [reiserfs] prepare_for_delete_or_cut+0x574/0x5f4 [reiserfs] reiserfs_cut_from_item+0xd8/0x756 [reiserfs] search_by_key+0x16f/0xcbb [reiserfs] reiserfs_do_truncate+0x374/0x4c2 [reiserfs] reiserfs_delete_object+0x30/0x5f [reiserfs] reiserfs_delete_inode+0x88/0xf0 [reiserfs] reiserfs_delete_inode+0x0/0xf0 [reiserfs] generic_delete_inode+0xa6/0xfe iput+0x63/0x65 dput+0xfb/0x113 sys_renameat+0x15f/0x1af current_fs_time+0x40/0x4b schedule+0x4c1/0x52e sys_rename+0x11/0x15 sysenter_past_esp+0x53/0x79 Oct 2 23:40:55 sunshine.lan kernel: Unable to handle kernel paging request at virtual address e082dad0x Oct 2 23:40:55 sunshine.lan kernel: printing eip:x Oct 2 23:40:55 sunshine.lan kernel: e0e77beax Oct 2 23:40:55 sunshine.lan kernel: *pde = 1ff5e067x Oct 2 23:40:55 sunshine.lan kernel: Oops: 0000 [#1]x Oct 2 23:40:57 sunshine.lan kernel: last sysfs file: /class/net/eth0/carrierx Oct 2 23:40:57 sunshine.lan kernel: Modules linked in: af_packet cpufreq_ondemand cpufreq_userspace cpufreq_powersave acpi_cpufreq freq_table ipv6 edd snd_pcm_oss snd_mixer_oss thermal processor fan button battery ac sg sd_mod loop dm_mod pcmcia firmware_class usb_storage scsi_mod shpchp pci_hotplug intel_agp agpgart snd_intel8x0 snd_ac97_codec snd_ac97_bus snd_pcm snd_timer snd soundcore snd_page_alloc uhci_hcd ehci_hcd yenta_socket rsrc_nonstatic pcmcia_core ohci1394 ieee1394 i2c_i801 i2c_core b44 usbcore i8xx_tco m ii ide_cd cdrom parport_pc lp parport reiserfs piix ide_disk ide_corex Oct 2 23:40:57 sunshine.lan kernel: CPU: 0x Oct 2 23:40:57 sunshine.lan kernel: EIP: 0060:[<e0e77bea>] Not tainted VLIx Oct 2 23:40:57 sunshine.lan kernel: EFLAGS: 00010202 (2.6.16.21-0.13-default #1)xx Oct 2 23:40:57 sunshine.lan kernel: EIP is at journal_mark_freed+0x13d/0x289 [reiserfs]x Oct 2 23:40:58 sunshine.lan kernel: eax: e0823000 ebx: 00000000 ecx: 0000aad0 edx: 000014e3x Oct 2 23:40:58 sunshine.lan kernel: esi: 00008000 edi: 00000001 ebp: c464fb40 esp: c464fb0cx Oct 2 23:40:58 sunshine.lan kernel: ds: 007b es: 007b ss: 0068x Oct 2 23:40:58 sunshine.lan kernel: Process rpm (pid: 5771, threadinfo=c464e000 task=dfa1d570)x Oct 2 23:40:59 sunshine.lan kernel: Stack: <0>c464fc0c dfaed200 155a14e3 dfaed200 e085b000 e085b0f4 00000f32 0000aad0xx Oct 2 23:40:59 sunshine.lan kernel: e0823000 00001000 c464fe8c 155a14e3 00000001 c917f84c e0e5847e 155a14e3xx Oct 2 23:40:59 sunshine.lan kernel: c0abcdd8 00000005 c464fe04 e0e713e3 00000001 00008012 00000000 c917f84cxx Oct 2 23:40:59 sunshine.lan kernel: Call Trace:x Today's try produced this in my logfile: Oct 3 08:57:44 sunshine.lan kernel: ReiserFS: hda2: warning: vs-4075: reiserfs_free_block: block 358225123 is out of range on hda2 x Oct 3 08:57:44 sunshine.lan kernel: Unable to handle kernel paging request at virtual address 72695f78x Oct 3 08:57:45 sunshine.lan kernel: printing eip:x Oct 3 08:57:45 sunshine.lan kernel: e0e77c5dx Oct 3 08:57:45 sunshine.lan kernel: *pde = 00000000x Oct 3 08:57:45 sunshine.lan kernel: Oops: 0000 [#1]x Oct 3 08:57:45 sunshine.lan kernel: last sysfs file: /class/net/eth0/carrierx Oct 3 08:57:46 sunshine.lan kernel: Modules linked in: af_packet cpufreq_ondemand cpufreq_userspace cpufreq_powersave acpi_cpufreq freq_table ipv6 edd snd_pcm_oss snd_mixer_oss thermal processor fan button battery ac sg sd_mod loop dm_mod usb_storage scsi_mod p cmcia firmware_class shpchp pci_hotplug i2c_i801 i2c_core ohci1394 ieee1394 ehci_hcd b44 i8xx_tco uhci_hcd ide_cd cdrom yenta_socke t rsrc_nonstatic pcmcia_core snd_intel8x0 snd_ac97_codec snd_ac97_bus intel_agp usbcore snd_pcm snd_timer snd soundcore snd_page_al loc mii agpgart parport_pc lp parport reiserfs piix ide_disk ide_corex Oct 3 08:57:46 sunshine.lan kernel: CPU: 0x -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 gregkh@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|kernel- |jeffm@novell.com |maintainers@forge.provo.nove| |ll.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 jeffm@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO Info Provider| |nine@detonation.org ------- Comment #1 from jeffm@novell.com 2006-10-05 10:26 MST ------- Both of these oopsen are due to an invalid block being freed. The first is in set_bit_in_list_bitmap(): static int set_bit_in_list_bitmap(struct super_block *p_s_sb, int block, struct reiserfs_list_bitmap *jb) { int bmap_nr = block / (p_s_sb->s_blocksize << 3); int bit_nr = block % (p_s_sb->s_blocksize << 3); if (!jb->bitmaps[bmap_nr]) { <----- jb->bitmaps[bmap_nr] = get_bitmap_node(p_s_sb); } set_bit(bit_nr, (unsigned long *)jb->bitmaps[bmap_nr]->data); return 0; } .. but the second doesn't contain enough information to debug exactly where it occurs. Can you run reiserfsck --check on this file system? If it's your root fs, you'll need to boot from the rescue image. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 nine@detonation.org changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED Info Provider|nine@detonation.org | ------- Comment #2 from nine@detonation.org 2006-10-05 14:20 MST ------- Thanks for your advise. reiserfsck found 26 errors in the filesystem. After fixing them and the more severe damage I inflicted because I forgot that I was in suspend to disk when booting the rescue system, the update now went through without any error or problem. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 ------- Comment #3 from nine@detonation.org 2006-10-05 14:21 MST ------- Created an attachment (id=100624) --> (https://bugzilla.novell.com/attachment.cgi?id=100624&action=view) the errors found by reiserfsck -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 ------- Comment #4 from jeffm@novell.com 2006-10-05 14:29 MST ------- Ok, glad you're up and running again. Those errors do tell me that you're going to have a corrupt file. You can use debugreiserfs -D <dev> on a mounted file system to dump the metadata. If you search through the output, look for a directory entry that points to the key [956483 956376]. That will tell you what file was corrupted. Sorry, the recovery tools don't yet support a file-id by key method. The file system should have handled this case more gracefully. I have a set of patches for handling errors better, so I'll add this case to the list. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 jeffm@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- OtherBugsDependingO| |248079 nThis| | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 jeffm@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO Info Provider| |jeffm@novell.com ------- Comment #5 from jeffm@novell.com 2007-02-22 16:06 MST ------- Setting NEEDINFO to me, putting this bug to sleep. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 jeffm@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |RESOLVED Info Provider|jeffm@novell.com | Resolution| |LATER ------- Comment #6 from jeffm@novell.com 2007-02-22 16:36 MST ------- Actually, closing as LATER. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=209593 User coolo@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=209593#c7 Stephan Kulow <coolo@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |REOPENED Resolution|LATER | --- Comment #7 from Stephan Kulow <coolo@novell.com> 2008-06-25 03:40:21 MDT --- mass reopening all SuSE Linux bugs that are set to REMIND+LATER to change the resolution to WONTFIX (adapting to new policy) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=209593 User coolo@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=209593#c8 --- Comment #8 from Stephan Kulow <coolo@novell.com> 2008-06-25 03:41:15 MDT --- mass reopening all SuSE Linux bugs that are set to REMIND+LATER to change the resolution to WONTFIX (adapting to new policy) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=209593 User coolo@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=209593#c9 --- Comment #9 from Stephan Kulow <coolo@novell.com> 2008-06-25 03:43:50 MDT --- mass reopening all SuSE Linux bugs that are set to REMIND+LATER to change the resolution to WONTFIX (adapting to new policy) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=209593 User coolo@novell.com added comment https://bugzilla.novell.com/show_bug.cgi?id=209593#c10 Stephan Kulow <coolo@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|REOPENED |RESOLVED Resolution| |WONTFIX --- Comment #10 from Stephan Kulow <coolo@novell.com> 2008-06-25 03:56:49 MDT --- Closing old LATER+REMIND bugs as WONTFIX - if you still plan to work on it, feel free to reopen and set to ASSIGNED. In case the report saw repeated reopen comments, it's due to bugzilla timing out on the huge request ;( -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com