HTML extractorcan open an HTML page give the file name or remote URL to retrieve its contents. It can search the page for HTML tags given the tag name or the ID attribute. The contents of the search tag or any given occurrence number of the tag will be returned. Requirements: PHP 4.0 or higher
CRIOSWEB_HTMLCleaner can be used to remove unwanted tags and data from HTML document. It takes a string with the HTML document to clean and parses it assuming a given character set encoding.CRIOSWEB_HTMLCleaner can perform several types of clean-up operations like:- Removing style definitions- Remove tags or attributes based on white lists or blacklists- Use the HTML tidy extension to clean ...