http://bugzilla.novell.com/show_bug.cgi?id=566288
http://bugzilla.novell.com/show_bug.cgi?id=566288#c17
Robert Schweikert changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEEDINFO |NEW
InfoProvider|rschweikert@novell.com |
--- Comment #17 from Robert Schweikert 2010-04-19 12:11:37 UTC ---
Here is a trace from an oops that occurred with the debug kernel. The trace
occurred while heavy I/O was taking place.
Apr 17 01:18:35 triumph kernel: [629085.891576] page:ffffea0001e53000
flags:0020000000000000 count:-775699968 mapcount:-43821 mapping:(null) index:46
Apr 17 01:18:35 triumph kernel: [629085.891594] Pid: 1122, comm: tar Tainted: P
2.6.31.12-0.2-default #1
Apr 17 01:18:35 triumph kernel: [629085.891603] Call Trace:
Apr 17 01:18:35 triumph kernel: [629085.891624] [<ffffffff81011749>]
try_stack_unwind+0x189/0x1b0
Apr 17 01:18:35 triumph kernel: [629085.891638] [<ffffffff8101013d>]
dump_trace+0x9d/0x330
Apr 17 01:18:35 triumph kernel: [629085.891651] [<ffffffff81011254>]
show_trace_log_lvl+0x64/0x90
Apr 17 01:18:35 triumph kernel: [629085.891663] [<ffffffff810112a3>]
show_trace+0x23/0x40
Apr 17 01:18:35 triumph kernel: [629085.891676] [<ffffffff81554ebc>]
dump_stack+0x81/0x9e
Apr 17 01:18:35 triumph kernel: [629085.891690] [<ffffffff8110f739>]
bad_page+0xf9/0x160
Apr 17 01:18:35 triumph kernel: [629085.891702] [<ffffffff8110ff9b>]
prep_new_page+0x3b/0x190
Apr 17 01:18:35 triumph kernel: [629085.891714] [<ffffffff8111079f>]
get_page_from_freelist+0x35f/0x6e0
Apr 17 01:18:35 triumph kernel: [629085.891726] [<ffffffff811111e5>]
__alloc_pages_nodemask+0xe5/0x160
Apr 17 01:18:35 triumph kernel: [629085.891739] [<ffffffff81146f6e>]
alloc_pages_current+0x8e/0xe0
Apr 17 01:18:35 triumph kernel: [629085.891753] [<ffffffff8110a045>]
__page_cache_alloc+0x85/0x90
Apr 17 01:18:35 triumph kernel: [629085.891765] [<ffffffff81116209>]
__do_page_cache_readahead+0xd9/0x1a0
Apr 17 01:18:35 triumph kernel: [629085.891778] [<ffffffff811162ff>]
ra_submit+0x2f/0x50
Apr 17 01:18:35 triumph kernel: [629085.891790] [<ffffffff8111657d>]
ondemand_readahead+0x11d/0x260
Apr 17 01:18:35 triumph kernel: [629085.891802] [<ffffffff81116760>]
page_cache_async_readahead+0xa0/0xc0
Apr 17 01:18:35 triumph kernel: [629085.891815] [<ffffffff8110b321>]
T.778+0x1f1/0x440
Apr 17 01:18:35 triumph kernel: [629085.891827] [<ffffffff8110b636>]
generic_file_aio_read+0xc6/0x1f0
Apr 17 01:18:35 triumph kernel: [629085.891872] [<ffffffffa0145cd5>]
xfs_read+0x165/0x3c0 [xfs]
Apr 17 01:18:35 triumph kernel: [629085.891975] [<ffffffffa01405be>]
xfs_file_aio_read+0x6e/0x90 [xfs]
Apr 17 01:18:35 triumph kernel: [629085.892067] [<ffffffff8115a4e2>]
do_sync_read+0x102/0x160
Apr 17 01:18:35 triumph kernel: [629085.892080] [<ffffffff8115aa15>]
vfs_read+0xd5/0x1c0
Apr 17 01:18:35 triumph kernel: [629085.892091] [<ffffffff8115b13b>]
sys_read+0x5b/0xa0
Apr 17 01:18:35 triumph kernel: [629085.892103] [<ffffffff8100c602>]
system_call_fastpath+0x16/0x1b
Apr 17 01:18:35 triumph kernel: [629085.892120] [<00007f12acd05a90>]
0x7f12acd05a90
Apr 17 01:20:01 triumph kernel: [629171.986844] BUG: Bad page state in process
tar pfn:91492
Apr 17 01:20:01 triumph kernel: [629171.986865] page:ffffea0001fc7ff0
flags:0020000000000000 count:0 mapcount:0 mapping:000054d2d1c3c200 index:49f
Apr 17 01:20:01 triumph kernel: [629171.986876] Pid: 1122, comm: tar Tainted: P
B 2.6.31.12-0.2-default #1
Apr 17 01:20:01 triumph kernel: [629171.986885] Call Trace:
Apr 17 01:20:01 triumph kernel: [629171.986902] [<ffffffff81011749>]
try_stack_unwind+0x189/0x1b0
Apr 17 01:20:01 triumph kernel: [629171.986916] [<ffffffff8101013d>]
dump_trace+0x9d/0x330
Apr 17 01:20:01 triumph kernel: [629171.986928] [<ffffffff81011254>]
show_trace_log_lvl+0x64/0x90
Apr 17 01:20:01 triumph kernel: [629171.986940] [<ffffffff810112a3>]
show_trace+0x23/0x40
Apr 17 01:20:01 triumph kernel: [629171.986952] [<ffffffff81554ebc>]
dump_stack+0x81/0x9e
Apr 17 01:20:01 triumph kernel: [629171.986965] [<ffffffff8110f739>]
bad_page+0xf9/0x160
Apr 17 01:20:01 triumph kernel: [629171.986977] [<ffffffff8110ff9b>]
prep_new_page+0x3b/0x190
Apr 17 01:20:01 triumph kernel: [629171.986989] [<ffffffff8111079f>]
get_page_from_freelist+0x35f/0x6e0
Apr 17 01:20:01 triumph kernel: [629171.987001] [<ffffffff811111e5>]
__alloc_pages_nodemask+0xe5/0x160
Apr 17 01:20:01 triumph kernel: [629171.987014] [<ffffffff81146f6e>]
alloc_pages_current+0x8e/0xe0
Apr 17 01:20:01 triumph kernel: [629171.987027] [<ffffffff8110a045>]
__page_cache_alloc+0x85/0x90
Apr 17 01:20:01 triumph kernel: [629171.987040] [<ffffffff81116209>]
__do_page_cache_readahead+0xd9/0x1a0
Apr 17 01:20:01 triumph kernel: [629171.987053] [<ffffffff811162ff>]
ra_submit+0x2f/0x50
Apr 17 01:20:01 triumph kernel: [629171.987064] [<ffffffff8111657d>]
ondemand_readahead+0x11d/0x260
Apr 17 01:20:01 triumph kernel: [629171.987077] [<ffffffff81116760>]
page_cache_async_readahead+0xa0/0xc0
Apr 17 01:20:01 triumph kernel: [629171.987090] [<ffffffff8110b321>]
T.778+0x1f1/0x440
Apr 17 01:20:01 triumph kernel: [629171.987102] [<ffffffff8110b636>]
generic_file_aio_read+0xc6/0x1f0
Apr 17 01:20:01 triumph kernel: [629171.987145] [<ffffffffa0145cd5>]
xfs_read+0x165/0x3c0 [xfs]
Apr 17 01:20:01 triumph kernel: [629171.987248] [<ffffffffa01405be>]
xfs_file_aio_read+0x6e/0x90 [xfs]
Apr 17 01:20:01 triumph kernel: [629171.987328] [<ffffffff8115a4e2>]
do_sync_read+0x102/0x160
Apr 17 01:20:01 triumph kernel: [629171.987341] [<ffffffff8115aa15>]
vfs_read+0xd5/0x1c0
Apr 17 01:20:01 triumph kernel: [629171.987352] [<ffffffff8115b13b>]
sys_read+0x5b/0xa0
Apr 17 01:20:01 triumph kernel: [629171.987364] [<ffffffff8100c602>]
system_call_fastpath+0x16/0x1b
Apr 17 01:20:01 triumph kernel: [629171.987380] [<00007f12acd05a90>]
0x7f12acd05a90
Some more observations.
- When I run system repair from the install DVD it always complains about a
corrupted XFS partition, even if I run repair pretty much right after install
- System might also just hang and no dump is triggered, when the system goes
into never never land keyboard LEDs for caps lock and num lock blink, this
behavior has been observed when removing large directories from disk or
unpacking large tarballs (> 1GB)
- Running mem test on the machine triggers no errors (let mem test run for 16
passes)
- Running firmware test triggers 4 failing tests:
~ [FAIL] OS/2 memory hole test
The memory map has a memory hole between 15Mb and 16 Mb
~ [FAIL] MTRR validation
many addresses with "has incorrect attribute write-back
~ [FAIL] HPET configuration test
Failed to locate HPET
~ [FAIL] EDD Boot disk hinting
Boot device 0x80 does not support EDD
Additional info:
~ I run 11.2 on 2 laptops with Centrino Duo (i.e. 2 core processors) and am
also using XFS. Unpacking the same large tarball and remove operations similar
to those on the flacky desktop machine do not trigger these types of errors
The hardware:
http://www.evga.com/articles/389.asp
Intel Core 2 Quad Q6700
8 GB DDR3 memory
1 TB SATA drive
--
Configure bugmail: http://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.