graphics converter util needed
I'm not sure if this is the right list for this kind of question so please point me in the right direction if it isn't. The scenario: We have a large archive of documents that have been scanned and stored as GIF or JPEG files. Some are in full color, some are grey-scale and some are black and white. It has now been decided to try using an OCR tool to extract the text from the images in an attempt to make it easier to search and use this source of information. The OCR tools we have been looking at require a black and white or grey-scale TIFF or a PGM or PBM as input. So we need a utillity that can convert all the GIF and JPEG images to one of those formats. And since the archive contains 50,000+ images then it would be highly desireable to find a utillity that can run from the command line so that we can write some script to automate the entire process. Thanks in advance, Claus
On Friday 21 February 2003 9:58 am, Claus Lund wrote:
I'm not sure if this is the right list for this kind of question so please point me in the right direction if it isn't.
The scenario: We have a large archive of documents that have been scanned and stored as GIF or JPEG files. Some are in full color, some are grey-scale and some are black and white. It has now been decided to try using an OCR tool to extract the text from the images in an attempt to make it easier to search and use this source of information. The OCR tools we have been looking at require a black and white or grey-scale TIFF or a PGM or PBM as input. So we need a utillity that can convert all the GIF and JPEG images to one of those formats. And since the archive contains 50,000+ images then it would be highly desireable to find a utillity that can run from the command line so that we can write some script to automate the entire process.
Thanks in advance, Claus
man convert And it should take you about 10 mins to write a script for it. -- +----------------------------------------------------------------------------+ + Bruce S. Marshall bmarsh@bmarsh.com Bellaire, MI 02/21/03 11:11 + +----------------------------------------------------------------------------+ "The first thing we do, let's kill all the lawyers." - Shakespeare: Henry VI, Part 2, act ii
On Friday 21 February 2003 8:12 am, Bruce Marshall wrote:
On Friday 21 February 2003 9:58 am, Claus Lund wrote:
We have a large archive of documents that have been scanned and stored as GIF or JPEG files. ... So we need a utillity that can convert all the GIF and JPEG images to one of those formats...
man convert
What package is that from? I have a similar need [rotating images I took with my digital camera to make "portrait" shots really "portrait"...] so I just tried to read the page you suggested, appearently it isn't installed on my system... [however, I do have "gimp" installed, which I know will do this "one at a time"] -- Yet another Blog: http://osnut.homelinux.net
On Friday 21 February 2003 11:27 am, Tom Emerson wrote:
On Friday 21 February 2003 8:12 am, Bruce Marshall wrote:
On Friday 21 February 2003 9:58 am, Claus Lund wrote:
We have a large archive of documents that have been scanned and stored as GIF or JPEG files. ... So we need a utillity that can convert all the GIF and JPEG images to one of those formats...
man convert
What package is that from? I have a similar need [rotating images I took with my digital camera to make "portrait" shots really "portrait"...] so I just tried to read the page you suggested, appearently it isn't installed on my system...
[however, I do have "gimp" installed, which I know will do this "one at a time"]
convert is from imagemagik (or something close to that) -- +----------------------------------------------------------------------------+ + Bruce S. Marshall bmarsh@bmarsh.com Bellaire, MI 02/21/03 11:41 + +----------------------------------------------------------------------------+ "Man is a rational animal who always loses his temper when he is called upon to act in accordance with the dictates of reason." - Oscar Wilde, British playwright, poet, and novelist (1854-1900)
Thanks for all the answers. It looks like convert is going to be able to do
what we need.
-Claus
----- Original Message -----
From: "Bruce Marshall"
On Friday 21 February 2003 11:27 am, Tom Emerson wrote:
On Friday 21 February 2003 8:12 am, Bruce Marshall wrote:
On Friday 21 February 2003 9:58 am, Claus Lund wrote:
We have a large archive of documents that have been scanned and stored as GIF or JPEG files. ... So we need a utillity that can convert all the GIF and JPEG images to one of those formats...
man convert
What package is that from? I have a similar need [rotating images I took with my digital camera to make "portrait" shots really "portrait"...] so I just tried to read the page you suggested, appearently it isn't installed on my system...
[however, I do have "gimp" installed, which I know will do this "one at a time"]
convert is from imagemagik (or something close to that)
--
+--------------------------------------------------------------------------- -+
+ Bruce S. Marshall bmarsh@bmarsh.com Bellaire, MI 02/21/03 11:41 +
+--------------------------------------------------------------------------- -+
"Man is a rational animal who always loses his temper when he is called upon to act in accordance with the dictates of reason." - Oscar Wilde, British playwright, poet, and novelist (1854-1900)
-- Check the headers for your unsubscription address For additional commands send e-mail to suse-linux-e-help@suse.com Also check the archives at http://lists.suse.com Please read the FAQs: suse-linux-e-faq@suse.com
On Friday 21 February 2003 2:58 pm, Claus Lund wrote:
we need a utillity that can convert all the GIF and JPEG images to one of those formats.
Have a look at installing "ImageMagick". It contains a utitlity called "convert" that can do pretty much anything ;o) It is uses the command line as its interface, and can accept wildcards IIRC. I just used it to convert a load of JPEGs to thumbnail PNGs. Hope this helps! Jon
On Friday 21 February 2003 08:58 am, Claus Lund wrote:
I'm not sure if this is the right list for this kind of question so please point me in the right direction if it isn't.
The scenario: We have a large archive of documents that have been scanned and stored as GIF or JPEG files. Some are in full color, some are grey-scale and some are black and white. It has now been decided to try using an OCR tool to extract the text from the images in an attempt to make it easier to search and use this source of information. The OCR tools we have been looking at require a black and white or grey-scale TIFF or a PGM or PBM as input. So we need a utillity that can convert all the GIF and JPEG images to one of those formats. And since the archive contains 50,000+ images then it would be highly desireable to find a utillity that can run from the command line so that we can write some script to automate the entire process.
Thanks in advance, Claus
Hi, I'm not sure I understand correctly what you're wanting to do, but thought I'd at least try to help somehow, heh. Perhaps the app 'imgseek' is what you can use? http://imgseek.sourceforge.net/ Sorry if this isn't what you need, but at least you know people are trying to help you. <S> Take care and be good. John
participants (5)
-
Bruce Marshall
-
Claus Lund
-
John
-
The Purple Tiger
-
Tom Emerson