New subject: [opensuse-programming] extracting text from html

25 May 2010


      I need to extract text from html for purposes of indexing -
implementation language is C or C++.  Sofar I've come across html2text
which is written in C++ - it looks pretty good, but I will need to make
some changes to make it fit my prposes.  Does any other library come to
mind for extracting text from html?


/Per Jessen, Zürich

-- 
To unsubscribe, e-mail: opensuse-programming+unsubscribe@opensuse.org
For additional commands, e-mail: opensuse-programming+help@opensuse.org

Main

Development

Information

Community

Social Media

Other

[opensuse-programming] extracting text from html

Per Jessen

Patrick Shanahan

Per Jessen

Greg Freemyer

Per Jessen

Greg Freemyer

justin finnerty

Per Jessen

Per Jessen

Thomas Hertweck

Per Jessen

Luna Rodríguez, Raúl

tags

participants (6)