Parsing and extracting information from (possibly malformed) HTML/XML documents
https://github.com/ndmitchell/tagsoup#readme