Mailinglist Archive: opensuse-bugs (7475 mails)

< Previous Next >
[Bug 358508] New: Aborting journal on device; ext3_abort called; EXT3-fs error (device sda3): ext3_journal_start_sb: Detected aborted journal; Remounting filesystem read-only
  • From: bugzilla_noreply@xxxxxxxxxx
  • Date: Mon, 4 Feb 2008 06:15:45 -0700 (MST)
  • Message-id: <bug-358508-21960@xxxxxxxxxxxxxxxxxxxxxxxxx/>
https://bugzilla.novell.com/show_bug.cgi?id=358508


Summary: Aborting journal on device; ext3_abort called; EXT3-fs
error (device sda3): ext3_journal_start_sb: Detected
aborted journal; Remounting filesystem read-only
Product: SUSE Linux 10.1
Version: Final
Platform: x86-64
OS/Version: SuSE Linux 10.1
Status: NEW
Severity: Critical
Priority: P5 - None
Component: Kernel
AssignedTo: kernel-maintainers@xxxxxxxxxxxxxxxxxxxxxx
ReportedBy: vik@xxxxxxx
QAContact: qa@xxxxxxx
Found By: Other


I get the following errors in dmesg and /var/log/messeges periodically:

Nov 3 21:13:23 emcgw kernel: Aborting journal on device sda3.
Nov 3 21:13:52 emcgw kernel: ext3_abort called.
Nov 3 21:13:52 emcgw kernel: EXT3-fs error (device sda3):
ext3_journal_start_sb: Detected aborted journal
Nov 3 21:13:52 emcgw kernel: Remounting filesystem read-only

After I stop the server and boot from a rescue SLES 10 SP1 and remove the
journal from the ext3 root file system, make an e2fsck, then put the journal
back and boot again than it's ok, but when I reboot again because of a new FC
SAN Disc assigned to the server the error message comes in a few days again.
Sometimes after two days sometimes after two weeks. I'm having this problem I
think three months and can't find any solution in google or anywhere else.

These are two dell power edge 1850 computers with the same SLES 10 SP1 having
the same problem. (fully updated) The problem comes on an ext3 file system
which is on an integrated RAID controller handled by the megaraid_mbox kernel
module. (RAID bus controller: Dell PowerEdge Expandable RAID controller 4)
There is also a two port Emulex FC card in both machines connecting to an EMC
CX-320 through EMC Powerpath software.

I think it's not a hardware error and the healths of the discs are ok.
I have done tests with dell diagnostics and I have been searching for bad
blocks but nothing found.
There are no I/O error messages only the few lines I copied here, so it is
impossible for me to find out what the problem is.
I'm also not able to generate this error it comes randomly.


Please tell me what information you need to solve this problem.


--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

< Previous Next >