API to extract data from HTML and XML documents.