
[ Download ]
This tool retrieves a page in html format and extracts the text content (by stripping the html tags). Requires the CPAN modules Unicode::String, HTML::TagFilter. Specifies subroutines to be called from a perl script.
Usage:
$htmlText = getURLContent("www.whatever.url...");
$cleanText = cleanHTML($htmlText)