summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2011-02-27Renamed the beautifulsoup module to bs4 to save typing.Leonard Richardson
2011-02-26Updated TODO.Leonard Richardson
2011-02-26Added docstring.Leonard Richardson
2011-02-26Made EntitySubstitution a utility class, not the kind of class you subclass.Leonard Richardson
2011-02-26Made EntitySubstitution a utility class, not the kind of class you subclass.Leonard Richardson
2011-02-26The attribute list comes in as a dictionary, so stop turning it into a list ↵Leonard Richardson
for no reason. Saves code and a little time. Sort outgoing attributes so that the tests will run consistently.
2011-02-26Minor cleanup and adding docstrings.Leonard Richardson
2011-02-26Removed the redundant pretty_print argument except for the top level.Leonard Richardson
2011-02-26Cleaned up decodeContents, and removed encodeContents, which isn't used.Leonard Richardson
2011-02-26Refactored the code that makes a string output-ready.Leonard Richardson
2011-02-26Renamed replace_with_html_entities to substitute_html_entities.Leonard Richardson
2011-02-26First stab at HTML entity replacement.Leonard Richardson
2011-02-26Emit an XML declaration when appropriate.Leonard Richardson
2011-02-22Fixed comment.Leonard Richardson
2011-02-22Solved the question of how to decide between ' (XML) and &squot; (HTML) ↵Leonard Richardson
by cutting the Gordian knot: quote the *double* quotes, which are always ".
2011-02-22Don't turn " into " except in attribute values.Leonard Richardson
2011-02-21Added a class for converting certain characters into XML or HTML entities, ↵Leonard Richardson
though it's not usable by the end-user yet.
2011-02-21Removed useless code.Leonard Richardson
2011-02-21PEP-8-ified more argument names.Leonard Richardson
2011-02-21Renamed prettyPrint to pretty_print.Leonard Richardson
2011-02-21Minor cleanup.Leonard Richardson
2011-02-21Removed the now-useless Entities class.Leonard Richardson
2011-02-21Got rid of isString.Leonard Richardson
2011-02-21Switched Tag.decode to use EntitySubstitution.substitute_xml.Leonard Richardson
2011-02-21Created an EntitySubstitution class that's going to take code away from ↵Leonard Richardson
UnicodeDammit, Entities, and BeautifulSoup.
2011-02-20Added a registry for tree builders and made it possible to find a tree ↵Leonard Richardson
builder that has the features you want from the BeautifulSoup constructor.
2011-02-20Fixed bug in the BS constructor lookup, and added the test file I've been ↵Leonard Richardson
working on this whole time.
2011-02-20Renamed the registry variable to builder_registry.Leonard Richardson
2011-02-20Started using the builder registry.Leonard Richardson
2011-02-20Renamed constructor arguments to comply with PEP 8.Leonard Richardson
2011-02-20Added tests for the default builder registry.Leonard Richardson
2011-02-20Tree builders now advertise their features.Leonard Richardson
2011-02-20Started work on a tagging system that should make it easy to find a tree ↵Leonard Richardson
builder that meets your needs.
2011-02-20Started work on a tagging system that should make it easy to find a tree ↵Leonard Richardson
builder that meets your needs.
2011-02-20Created a function that puts all tree-builders in a module into ↵Leonard Richardson
beautifulsoup.builders.
2011-02-20Simplified the builder registration.Leonard Richardson
2011-02-20Greatly simplified the module import code by making it take a module, not a ↵Leonard Richardson
module name.
2011-02-20Fixed up the code to register builders from a module.Leonard Richardson
2011-02-20Use registration code to register builders. The registration code will be ↵Leonard Richardson
expanded later.
2011-02-20Discovered that html5lib can't be made to support SoupStrainers, and changed ↵Leonard Richardson
the test suite appropriately.
2011-02-20Discovered that html5lib can't be made to support SoupStrainers, and changed ↵Leonard Richardson
the test suite appropriately.
2011-02-20Removed extraneous newlines.Leonard Richardson
2011-02-20I couldn't get the XML parser to parse CDATA as CData objects, but at least ↵Leonard Richardson
I documented the current behavior.
2011-02-20Since we can't parse in CData objects ATM, added a test for CData objects ↵Leonard Richardson
created manually, to keep the bits from rotting.
2011-02-20Made the XML treebuilder able to handle basic invalid XML.Leonard Richardson
2011-02-20Greatly improved the handling of empty-element tags.Leonard Richardson
2011-02-20Added a test showing weird behavior when you .insert contents into an ↵Leonard Richardson
empty-element tag.
2011-02-20Refactored some empty-element tests and added more.Leonard Richardson
2011-02-20Test that empty-element tags that get children stop being empty-element tags.Leonard Richardson
2011-02-20Added tests of custom lists of empty-element tags.Leonard Richardson