[opensuse-kernel] Re: 2.6.34-rc3: simple du (on a big xfs tree) triggers oom killer [bisected: 57817c68229984818fea9e614d6f95249c3fb098]
Hi Dave, On Tuesday 06 April 2010, 01:06:00 Dave Chinner wrote:
On Mon, Apr 05, 2010 at 01:35:41PM +0200, Hans-Peter Jansen wrote:
Oh, this is a highmem box. You ran out of low memory, I think, which is where all the inodes are cached. Seems like a VM problem or a highmem/lowmem split config problem to me, not anything to do with XFS...
With all due respect, I disagree. See below.
Might be, I don't have a chance to test this on a different FS. Thanks for the answer anyway, Dave. I hope, you don't mind, that I keep you copied on this thread..
This matter is, I cannot locate the problem from the syslog output. Might be a "can't see the forest because all the trees" syndrome.
Well, I have to ask why you are running a 32bit PAE kernel when your CPU is:
<6>[ 0.085062] CPU0: Intel(R) Xeon(R) CPU X3460 @ 2.80GHz stepping 05
64bit capable. Use a 64 bit kernel and this problem should go away.
Sure, but for compatibility reasons with a customer setup, that I'm fully
responsible for and we strongly depend on, it is i586 still. (and it's a
system, that I've full access on only for a few hours on sundays, which
punishes my family..).
Dave, I really don't want to disappoint you, but a lengthy bisection session
points to:
57817c68229984818fea9e614d6f95249c3fb098 is the first bad commit
commit 57817c68229984818fea9e614d6f95249c3fb098
Author: Dave Chinner
It's hard to believe, that a current kernel on a current system with 12 GB, even if using the insane pae on i586 is not able to cope with an du on a 1.1 TB file tree. Since du is invokable by users, this creates a pretty ugly DOS attack for local users.
Agreed. And FWIW, don't let your filesystems get near ENOSPC on 2.6.34-rc, either....
(i.e. under sustained write load, 2.6.34-rc will hit the OOM killer on page cache allocation before the filesystem can report ENOSPC to the user application. Test 224 in the xfsqa suite on a VM w/ 1GB RAM will trigger this with > 90% reliability....)
Hmm, thanks for the warning. Will resort to 2.6.33.2 for now on my servers
and keep an eye on the xfs commit logs...
Cheers && greetings to the orbit ;-),
Pete
For the sake of completeness, here's the revert:
---
commit dfe0d292280ad21c9cf3f240bb415913715d8980
Author: Hans-Peter Jansen
participants (1)
-
Hans-Peter Jansen