[Bug 381873] New: utf-8 performace in grep/wc sux!
https://bugzilla.novell.com/show_bug.cgi?id=381873 User koenig@linux.de added comment https://bugzilla.novell.com/show_bug.cgi?id=381873#c308698 Summary: utf-8 performace in grep/wc sux! Product: openSUSE 11.0 Version: Beta 1 Platform: x86-64 OS/Version: Other Status: NEW Severity: Blocker Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: koenig@linux.de QAContact: qa@suse.de Found By: --- I know about #308698 very well, but now it's still the same problem in 11.0-beta1, so I try to raise severity with a new bug report :-((( at least "grep" and "wc" show very poor performace with UTF-8 locale, here for a small stace-logfile: harald > ll O6 -rw-r--r-- 1 harald users 15819620 Apr 21 10:34 O6 harald > LC_ALL=C time grep 29618 O6 > /dev/null 0:00.36 real, 0.01 user, 0.02 sys, 11% cpu harald > LC_ALL=de_DE.utf-8 time grep 29618 O6 > /dev/null 0:03.11 real, 2.97 user, 0.01 sys, 95% cpu harald > LC_ALL=de_DE.utf-8 time wc O6 96071 687927 15819620 O6 1.088 0m1.088s real, 1.076 user, 0.012 sys, 99.99 cpu harald > LC_ALL=C time wc O6 96071 687927 15819620 O6 0:00.16 real, 0.15 user, 0.00 sys, 99% cpu you can't really think about releasing a new openSUSE version without fixing this issue???? it's a *BIG* shame for SUSE/NOVELL anyway that this issue hasn't been fixed in 10.3 for such a long time now :-(( -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=381873
Cyril Hrubis
https://bugzilla.novell.com/show_bug.cgi?id=381873
User jengelh@gmx.de added comment
https://bugzilla.novell.com/show_bug.cgi?id=381873#c1
Jan Engelhardt
https://bugzilla.novell.com/show_bug.cgi?id=381873
User koenig@linux.de added comment
https://bugzilla.novell.com/show_bug.cgi?id=381873#c2
--- Comment #2 from Harald Koenig
you can't really think about releasing a new openSUSE version without fixing this issue????
hello!!! anyone out there?!?!? I'm shocked that it's already beta-2 and there seems to be no significant improvement in this case. what else can be done to escalade this issue ??? I'm not sure if anything was changed in grep at all (maybe?), but at least the "-i" option still sux, so does "wc" and likely other tools and options. my current data looks like this: harald > rpm -qf /usr/bin/grep grep-2.5.2-64 harald > ll -L /var/lib/pin/ARCHIVES.gz -rw-r--r-- 1 root root 40230619 Apr 30 15:50 /var/lib/pin/ARCHIVES.gz good: harald > zcat /var/lib/pin/ARCHIVES.gz | LC_ALL=de_DE.utf-8 time grep foobar /inst-source1/suse/noarch/texlive-latex-doc-2007-161.noarch.rpm: -rw-r--r-- 1 root root 209 Jan 9 2006 /usr/share/texmf/doc/latex/examdesign/foobar.tex 0:05.57 real, 0.41 user, 0.16 sys, 10% cpu harald > zcat /var/lib/pin/ARCHIVES.gz | LC_ALL=C time grep foobar /inst-source1/suse/noarch/texlive-latex-doc-2007-161.noarch.rpm: -rw-r--r-- 1 root root 209 Jan 9 2006 /usr/share/texmf/doc/latex/examdesign/foobar.tex 0:05.56 real, 0.57 user, 0.15 sys, 13% cpu harald > zcat /var/lib/pin/ARCHIVES.gz | LC_ALL=C time grep -i foobar /inst-source1/suse/ppc/gnupod-0.99.5-34.ppc.rpm: -r--r--r-- 1 root root 11636 Apr 26 00:51 /usr/lib/perl5/vendor_perl/5.10.0/ppc-linux-thread-multi-64int/GNUpod/FooBar.pm /inst-source1/suse/i586/gnupod-0.99.5-34.i586.rpm: -r--r--r-- 1 root root 11636 Apr 26 00:32 /usr/lib/perl5/vendor_perl/5.10.0/i586-linux-thread-multi/GNUpod/FooBar.pm /inst-source1/suse/noarch/texlive-latex-doc-2007-161.noarch.rpm: -rw-r--r-- 1 root root 209 Jan 9 2006 /usr/share/texmf/doc/latex/examdesign/foobar.tex /inst-source1/suse/x86_64/gnupod-0.99.5-34.x86_64.rpm: -r--r--r-- 1 root root 11636 Apr 25 22:16 /usr/lib/perl5/vendor_perl/5.10.0/x86_64-linux-thread-multi/GNUpod/FooBar.pm 0:05.59 real, 0.49 user, 0.16 sys, 11% cpu BAD: harald > zcat /var/lib/pin/ARCHIVES.gz | LC_ALL=de_DE.utf-8 time grep -i foobar /inst-source1/suse/ppc/gnupod-0.99.5-34.ppc.rpm: -r--r--r-- 1 root root 11636 Apr 26 00:51 /usr/lib/perl5/vendor_perl/5.10.0/ppc-linux-thread-multi-64int/GNUpod/FooBar.pm /inst-source1/suse/i586/gnupod-0.99.5-34.i586.rpm: -r--r--r-- 1 root root 11636 Apr 26 00:32 /usr/lib/perl5/vendor_perl/5.10.0/i586-linux-thread-multi/GNUpod/FooBar.pm /inst-source1/suse/noarch/texlive-latex-doc-2007-161.noarch.rpm: -rw-r--r-- 1 root root 209 Jan 9 2006 /usr/share/texmf/doc/latex/examdesign/foobar.tex /inst-source1/suse/x86_64/gnupod-0.99.5-34.x86_64.rpm: -r--r--r-- 1 root root 11636 Apr 25 22:16 /usr/lib/perl5/vendor_perl/5.10.0/x86_64-linux-thread-multi/GNUpod/FooBar.pm 0:34.48 real, 29.28 user, 0.16 sys, 85% cpu good: harald > zcat /var/lib/pin/ARCHIVES.gz | LC_ALL=C time wc 3331696 30052221 535205599 0:09.43 real, 4.50 user, 0.13 sys, 49% cpu BAD: harald > zcat /var/lib/pin/ARCHIVES.gz | LC_ALL=de_DE.utf-8 time wc 3331696 30052221 535205599 0:41.74 real, 36.58 user, 0.14 sys, 87% cpu BTW: this last "good" example with "zcat | wc" shows another (for me) surprise with bad performace: I'm running this on a intel T7200 Core2Duo CPU (with SMP kernel;) but only one CPU gets used:-(( more details about the pipe stuff go here: https://bugzilla.novell.com/show_bug.cgi?id=387347 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=381873
Andreas Schwab
https://bugzilla.novell.com/show_bug.cgi?id=381873
User coolo@novell.com added comment
https://bugzilla.novell.com/show_bug.cgi?id=381873#c3
Stephan Kulow
participants (1)
-
bugzilla_noreply@novell.com