diff options
author | Leonard Richardson <leonard.richardson@canonical.com> | 2011-02-11 09:10:56 -0500 |
---|---|---|
committer | Leonard Richardson <leonard.richardson@canonical.com> | 2011-02-11 09:10:56 -0500 |
commit | d0531c4204a67a4289025bf7108a922f680fa057 (patch) | |
tree | cdad3f97812e658d84a611b6017b7198fd97d818 /TODO | |
parent | 3366ad67dc2dfdd508267efc87dfc851b612fb0d (diff) | |
parent | d89c8878ea86a2575c87e9fad8081cfcd81e0bcd (diff) |
Ported some more tests, fixed an encoding problem, and added rudimentary doctype handling.
Diffstat (limited to 'TODO')
-rw-r--r-- | TODO | 5 |
1 files changed, 5 insertions, 0 deletions
@@ -2,6 +2,11 @@ html5lib has its own Unicode, Dammit-like system. Converting the input to Unicode should be up to the builder. The lxml builder would use Unicode, Dammit, and the html5lib builder would be a no-op. +Bare ampersands should be converted to HTML entities upon output. + +It should also be possible to convert certain Unicode characters to +HTML entities upon output. + --- Here are some unit tests that fail with HTMLParser. |