- Posted by: Martin Straus
- Posted on: December 14 2004 09:02 EST
I'm building a simple forum-like application. I want to parse the text of each post in order to eliminate or eventually replace html tags, so that no html ends up it the generated view of each thread.
Does anyone know a library, package or algorithm with this funtionality implemented? I'd hate to code it myself.
Thanks in advance!
Maybe not what you want, but I've used a regular expression like this successfully before to remove html/xml code from a string:
str = str.replaceAll("<[^>]*>", "");