summaryrefslogtreecommitdiff
path: root/beautifulsoup/builder
AgeCommit message (Collapse)Author
2023-01-25Removed very copy of the code that was imported as part of the bzr import ↵Leonard Richardson
but not removed.
2011-02-26Emit an XML declaration when appropriate.Leonard Richardson
2011-02-21Removed the now-useless Entities class.Leonard Richardson
2011-02-20Renamed the registry variable to builder_registry.Leonard Richardson
2011-02-20Started using the builder registry.Leonard Richardson
2011-02-20Renamed constructor arguments to comply with PEP 8.Leonard Richardson
2011-02-20Added tests for the default builder registry.Leonard Richardson
2011-02-20Tree builders now advertise their features.Leonard Richardson
2011-02-20Started work on a tagging system that should make it easy to find a tree ↵Leonard Richardson
builder that meets your needs.
2011-02-20Simplified the builder registration.Leonard Richardson
2011-02-20Greatly simplified the module import code by making it take a module, not a ↵Leonard Richardson
module name.
2011-02-20Fixed up the code to register builders from a module.Leonard Richardson
2011-02-20Use registration code to register builders. The registration code will be ↵Leonard Richardson
expanded later.
2011-02-20Discovered that html5lib can't be made to support SoupStrainers, and changed ↵Leonard Richardson
the test suite appropriately.
2011-02-20Made the XML treebuilder able to handle basic invalid XML.Leonard Richardson
2011-02-20Added an empty-element tag test.Leonard Richardson
2011-02-20Tag.is_empty_element is determined dynamically, based on a) whether the ↵Leonard Richardson
builder used to create the tag had an explicit list of empty-element tags, and b) whether the tag actually contains anything.
2011-02-20Why is the test failing? Because I'm asserting the wrong thing.Leonard Richardson
2011-02-19Hacked in something to get lxml's behavior where any empty tag is treated as ↵Leonard Richardson
self-closing. This may or may not stay as is.
2011-02-19Made it easier to pass a custom lxml parser object into the treebuilder.Leonard Richardson
2011-02-19Set up an lxml parser that only parses XML, though it's not very functional yet.Leonard Richardson
2011-02-18Made Unicode, Dammit more PEP-8 compliant.Leonard Richardson
2011-02-18Made Unicode, Dammit more PEP-8 compliant.Leonard Richardson
2011-02-18Don't let html5lib set the original encoding to UTF-8 if the input was ↵Leonard Richardson
actually Unicode.
2011-02-18Pass the user-specified encoding in to html5lib rather than dropping it on ↵Leonard Richardson
the floor.
2011-02-18Have the html5lib builder set the sniffed encoding after parsing, rather ↵Leonard Richardson
than before as happens with lxml.
2011-02-18Made conversion of markup to Unicode the responsibility of the builder, not ↵Leonard Richardson
the BeautifulSoup class itself. lxml uses Unicode, Dammit; html5lib uses its internal algorithms.
2011-02-18Yay, meta tag rewrites now work with html5lib.Leonard Richardson
2011-02-18Still trying to get html5lib to rewrite the META tag.Leonard Richardson
2011-02-18Moved the substitution code to the Tag constructor so that we don't have to ↵Leonard Richardson
rely on handle_starttag to trigger it.
2011-02-18Got rid of now-useless builder configuration.Leonard Richardson
2011-02-13Got the doctype tests to work for html5lib.Leonard Richardson
2011-02-13Added tests for namespaced doctypes.Leonard Richardson
2011-02-13Clarified lxml's behavior w/r/t CDATA sections.Leonard Richardson
2011-02-13Figured out the deal with CDATA sections in lxml and html5lib, and added ↵Leonard Richardson
comments and tests.
2011-02-10Added some elementary doctype handling.Leonard Richardson