[opensuse] Unicode v ascii?

19 Feb 2008

      All,

I have a huge text file (1.7 million lines) full of unicode and ascii
text (half and half).

note: disk space for copies is not a problem, if I need to manipulate this file

Also, I have a 30 line file full of ascii text.

I need to search the large file for any occurrences of the keywords in
the 30 line file.

Ignoring the unicode issue, I could use grep (of fgrep, egrep) with
appropriate args.

I have no idea how to handle the unicode issue.

Any suggestions?

Thanks
Greg
-- 
Greg Freemyer
Litigation Triage Solutions Specialist
http://www.linkedin.com/in/gregfreemyer
First 99 Days Litigation White Paper -
http://www.norcrossgroup.com/forms/whitepapers/99%20Days%20whitepaper.pdf

The Norcross Group
The Intersection of Evidence & Technology
http://www.norcrossgroup.com
-- 
To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org
For additional commands, e-mail: opensuse+help@opensuse.org

Greg Freemyer

Aaron Kulkis

Petr Cerny

Greg Freemyer

Petr Cerny

Stefan Hundhammer

tags

participants (4)