Mailinglist Archive: opensuse-programming (16 mails)

< Previous Next >
[opensuse-programming] extracting text from html
  • From: Per Jessen <per@xxxxxxxxxxxx>
  • Date: Tue, 25 May 2010 16:47:52 +0200
  • Message-id: <htgnuo$58r$1@xxxxxxxxxxxxxxxx>
I need to extract text from html for purposes of indexing -
implementation language is C or C++. Sofar I've come across html2text
which is written in C++ - it looks pretty good, but I will need to make
some changes to make it fit my prposes. Does any other library come to
mind for extracting text from html?


/Per Jessen, Zürich

--
To unsubscribe, e-mail: opensuse-programming+unsubscribe@xxxxxxxxxxxx
For additional commands, e-mail: opensuse-programming+help@xxxxxxxxxxxx

< Previous Next >
List Navigation