Mailinglist Archive: opensuse-programming (16 mails)
| < Previous | Next > |
[opensuse-programming] extracting text from html
- From: Per Jessen <per@xxxxxxxxxxxx>
- Date: Tue, 25 May 2010 16:47:52 +0200
- Message-id: <htgnuo$58r$1@xxxxxxxxxxxxxxxx>
I need to extract text from html for purposes of indexing -
implementation language is C or C++. Sofar I've come across html2text
which is written in C++ - it looks pretty good, but I will need to make
some changes to make it fit my prposes. Does any other library come to
mind for extracting text from html?
/Per Jessen, Zürich
--
To unsubscribe, e-mail: opensuse-programming+unsubscribe@xxxxxxxxxxxx
For additional commands, e-mail: opensuse-programming+help@xxxxxxxxxxxx
implementation language is C or C++. Sofar I've come across html2text
which is written in C++ - it looks pretty good, but I will need to make
some changes to make it fit my prposes. Does any other library come to
mind for extracting text from html?
/Per Jessen, Zürich
--
To unsubscribe, e-mail: opensuse-programming+unsubscribe@xxxxxxxxxxxx
For additional commands, e-mail: opensuse-programming+help@xxxxxxxxxxxx
| < Previous | Next > |