Hi all,
I need to convert an HTML page - downloaded with
an URLConnection Object - into plain text.
I wonder if I can do it in java, maybe using Servlet Filters,
(filtering the response) or do I have to use complex XML/XSLT transformations ?
Thanks
Francesco
-
How to convert an HTML page into plain text ? (1 messages)
- Posted by: fmarchioni fmarchioni
- Posted on: September 06 2002 04:10 EDT
Threaded Messages (1)
- How to convert an HTML page into plain text ? by Lasse Koskela on September 06 2002 05:02 EDT
-
How to convert an HTML page into plain text ?[ Go to top ]
- Posted by: Lasse Koskela
- Posted on: September 06 2002 05:02 EDT
- in response to fmarchioni fmarchioni
I'm not aware of any freely available libraries providing HTML-to-plaintext conversion, but that surely doesn't mean that there aren't any out there, so keep on searching...
Anyway, if/when you decide to implement your own conversion logic, XSL is probably out of the question, unless your HTML is actually XHTML, or converted into HTML from XML.