summaryrefslogtreecommitdiff
path: root/bs4/builder
AgeCommit message (Collapse)Author
2012-09-28Fixed package name.Leonard Richardson
2012-08-21We don't need a special insertComment method, we just need to make ↵Leonard Richardson
Element.appendChild call object_was_parsed.
2012-08-21Fixed a problem with the html5lib builder not handling comments correctly.Leonard Richardson
2012-08-16Use namespace prefixes for namespaced attribute names, instead ofLeonard Richardson
the fully-qualified names given by the lxml parser. [bug=1037597]
2012-06-30Fixed an html5lib tree builder crash which happened when html5libLeonard Richardson
moved a tag with a multivalued attribute from one part of the tree to another. [bug=1019603]
2012-05-29Removed breakpoints.Leonard Richardson
2012-05-29Prep for release.Leonard Richardson
2012-05-24Fixed a bug with the lxml treebuilder that prevented the user from adding ↵Leonard Richardson
attributes to a tag that didn't originally have any. [bug=1002378] Thanks to Oliver Beattie for the patch.
2012-04-26The test suite now passes when lxml is not installed, whether or not ↵Leonard Richardson
html5lib is installed. [bug=987004]
2012-04-18Got rid of contains_substitutions.Leonard Richardson
2012-04-18Made encoding substitution in <meta> tags completely transparent (no more ↵Leonard Richardson
%SOUP-ENCODING%).
2012-04-18Changed wording slightly.Leonard Richardson
2012-04-18Print a warning on HTMLParseErrors to let people know they should install an ↵Leonard Richardson
external parser.
2012-04-18Fixed a bug that made the HTMLParser treebuilder generate XML definitions ↵Leonard Richardson
ending with two question marks instead of one. [bug=984258]
2012-04-03Got rid of the 4.0.2 workaround for HTML documents--it was unnecessary and ↵Leonard Richardson
the workaround was triggering a (possibly different, but related) bug in lxml. [bug=972466]
2012-04-03Don't split up the markup into chunks when using the lxml HTML parser, which ↵Leonard Richardson
doesn't have the problems of the XML parser.
2012-03-30Fixed a typo that caused some versions of Python 3 to convert the Beautiful ↵Leonard Richardson
Soup codebase incorrectly.
2012-03-24Pass data into XMLParser.feed() in chunks. [bug=963880]Leonard Richardson
2012-03-01In HTML5-style <meta charset="foo"> tags, the value of the "charset" ↵Leonard Richardson
attribute is now replaced with the appropriate encoding on output. [bug=942714]
2012-03-01Added missing __len__ method that stopped html5lib tree builder from working ↵Leonard Richardson
on nested formatting elements. [bug=943246]
2012-02-28Fixed the generated XML declaration.Leonard Richardson
2012-02-24Warn when SoupStrainer is used with the html5lib tree builder.Leonard Richardson
2012-02-23Fixed handling of the closing of namespaced tags.Leonard Richardson
2012-02-23Bumped version number.Leonard Richardson
2012-02-23Merge from trunk and added tests.Leonard Richardson
2012-02-22Added comments.Leonard Richardson
2012-02-22Treat a new namespace mapping as a set of attributes on the tag that defines ↵Leonard Richardson
it, so we don't lose the mappings.
2012-02-21Have lxml invert namespace maps as they come in and set each tag's prefix ↵Leonard Richardson
appropriately.
2012-02-21Added nsprefix argument to the tag class.Leonard Richardson
2012-02-21Merged from trunk.Leonard Richardson
2012-02-20It's now possible to copy a BeautifulSoup object created with the ↵Leonard Richardson
html.parser treebuilder.
2012-02-20Changd the class structure so that the default parser test class uses ↵Leonard Richardson
html.parser.
2012-02-16It's a start, at least.Leonard Richardson
2012-02-15Some cdata-list attributes are only cdata lists for certain tags.Leonard Richardson
2012-02-15Clarified comment.Leonard Richardson
2012-02-15Removed _nodeIndex, because the misfeature it works around is now gone.Leonard Richardson
2012-02-15Minor cleanup.Leonard Richardson
2012-02-15Tested and cleaned up html5lib insertBefore.Leonard Richardson
2012-02-15Use append instead of insert.Leonard Richardson
2012-02-15Minor cleanup.Leonard Richardson
2012-02-09As a last-ditch attempt to turn data into Unicode, use errors=replace ↵Leonard Richardson
instead of errors=strict.
2012-02-09Patched over a bug in html5lib (?) that was crashing Beautiful Soup on ↵Leonard Richardson
certain kinds of markup. [bug=838800]
2012-02-09Minor Unicode, Dammit cleanup.Leonard Richardson
2012-02-08Added missing import.Leonard Richardson
2012-02-08Rationalized the treatment of multi-valued HTML attributes such as 'class'Leonard Richardson
2012-02-07Newly created tags use the same empty-element rules as the builder used to ↵Leonard Richardson
originally create the soup.
2012-02-06Monkeypatch Python 3.2 versions prior to 3.2.3 to solve some major ↵Leonard Richardson
HTMLParser bugs.
2012-02-01LXML's HTML parser is pretty permissive.Leonard Richardson
2012-01-20Made it easier to convert BS3 code to BS4.Leonard Richardson
2012-01-20Got the test suite to pass on Python 3.2 (skipping the html5lib stuff, which ↵Leonard Richardson
doesn't seem to have Python 3 support yet.)