Age | Commit message (Collapse) | Author |
|
than before as happens with lxml.
|
|
|
|
the BeautifulSoup class itself. lxml uses Unicode, Dammit; html5lib uses its internal algorithms.
|
|
rely on handle_starttag to trigger it.
|
|
|
|
|
|
html5lib writer isn't setting up the charset substitution.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
comments and tests.
|
|
|
|
|
|
|
|
tables.
|
|
characters on the way in.
|
|
Unicode characters during parsing.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
though there are still some underlying problems.
|