HTML parsers are software for automated Hypertext Markup Language (HTML) parsing. They have two main purposes:
Parser | License | Implementation language(s) | Latest date* | HTML parsing[1] | HTML5-compliant parsing | Clean HTML** | Update HTML*** |
---|---|---|---|---|---|---|---|
HTML Tidy | W3C license | ANSI C | 2021-03-24[2] | Yes[3] | Yes | Yes[3] | Yes |
HtmlUnit | Apache License 2.0 | Java | 2021-05-16[4] | Yes | ? | No | No |
libxml2 HTMLparser | MIT License | C | 2021-05-13[5] | Yes | No | ? | ? |
Parser | License | Implementation language(s) | Latest date* | HTML Parsing | HTML5-compliant Parsing | Clean HTML** | Update HTML*** |
By: Wikipedia.org
Edited: 2021-06-18 19:12:27
Source: Wikipedia.org