diff options
author | Leonard Richardson <leonard.richardson@canonical.com> | 2011-02-20 09:54:42 -0500 |
---|---|---|
committer | Leonard Richardson <leonard.richardson@canonical.com> | 2011-02-20 09:54:42 -0500 |
commit | ae349fd47c627f8166526fed8906811707d2f4b2 (patch) | |
tree | 116edd8c1d9a7cf6348f784162fd2291608833c2 /CHANGELOG | |
parent | 158e76fd3e1005f6f5f932414cb741083d114cb6 (diff) | |
parent | 9f437ea591aeaf16d593350baf081315e56a8b73 (diff) |
Greatly improved the handling of empty-element tags.
Diffstat (limited to 'CHANGELOG')
-rw-r--r-- | CHANGELOG | 16 |
1 files changed, 16 insertions, 0 deletions
@@ -17,6 +17,10 @@ work. Here are the renames: * findPreviousSibling -> find_previous_sibling * findPreviousSiblings -> find_previous_siblings +Some attributes have also been renamed: + + * Tag.isSelfClosing -> Tag.is_empty_element + == Generators are now properties == The generators have been given more sensible (and PEP 8-compliant) @@ -51,6 +55,18 @@ and nothing else, then A.string is the same as B.string. So: The value of a.string used to be None, and now it's "foo". +== Empty-element tags == + +Beautiful Soup's handling of empty-element tags (aka self-closing +tags) has been improved, especially when parsing XML. Previously you +had to explicitly specify a list of empty-element tags when parsing +XML. You can still do that, but if you don't, Beautiful Soup now +considers any empty tag to be an empty-element tag. + +The determination of empty-element-ness is now made at runtime rather +than parse time. If you add a child to an empty-element tag, it stops +being an empty-element tag. + == Entities are always converted to Unicode == An HTML or XML entity is always converted into the corresponding |