summaryrefslogtreecommitdiff
path: root/bs4/builder
AgeCommit message (Expand)Author
2017-05-06HTML parsers treat all HTML4 and HTML5 empty element tags (aka void element t...Leonard Richardson
2016-12-19Fixed foster parenting when html5lib is the tree builder. Thanks to Geoffrey ...Leonard Richardson
2016-12-19Fixed yet another problem that caused the html5lib tree builder toLeonard Richardson
2016-07-30Explained why we test both unicode and bytestring processing instructions.Leonard Richardson
2016-07-26Fixed a reported (but not duplicated) bug involving processing instructions f...Leonard Richardson
2016-07-16Beautiful Soup will now work with versions of html5lib greater thanLeonard Richardson
2016-07-16Removed imports to pdb, since pdb is not available in some environments. [bug...Leonard Richardson
2016-07-16The contents of <textarea> tags will no longer be modified when theLeonard Richardson
2016-07-16Added a separate class for XML processing instructions, which have a slightly...Leonard Richardson
2016-07-16Rename COPYING.txt to LICENSE. Add a reference to LICENSE in every source file.Leonard Richardson
2015-12-08Fix foster parenting with html5lib.Geoffrey Sneddon
2015-12-08Make TreeBuilderForHtml5lib strictly follow the html5lib API.Geoffrey Sneddon
2015-09-28Fixed a parse bug with the html5lib tree-builder. Thanks to RoelLeonard Richardson
2015-06-28 It's now possible to pickle a BeautifulSoup object no matter whichLeonard Richardson
2015-06-28Changed the way soup objects work under copy.copy(). Copying aLeonard Richardson
2015-06-28Fixed a bug where Element.extract() could create an infinite loop inLeonard Richardson
2015-06-28Accept 'xml' as an unambiguous identifier for the lxml XML parser, since it's...Leonard Richardson
2015-06-27Added an exclude_encodings argument to UnicodeDammit and to theLeonard Richardson
2015-06-26Added a sanity check helper method that makes sure all the elements of a tree...Leonard Richardson
2015-06-24Fixed an import error in Python 3.5 caused by the removal of theLeonard Richardson
2015-06-24Made double sure that we don't use the 'strict' constructor argument when it'...Leonard Richardson
2015-06-24If the initial <html> tag contains a CDATA list attribute such asLeonard Richardson
2015-06-23Got a hacky fix for the latest html5lib problem.Leonard Richardson
2014-12-11Improved the lxml tree builder's handling of processingLeonard Richardson
2014-12-07In Python 3.4 and above, set the new convert_charrefs argument toLeonard Richardson
2014-12-07Tweaked the parser warning.Leonard Richardson
2014-12-07Issue a warning if the BeautifulSoup constructor arguments do not explicitly ...Leonard Richardson
2013-10-18Fixed yet another problem that caused the html5lib tree builder toLeonard Richardson
2013-10-01Fixed a bug in which short Unicode input was improperly encoded to ASCII when...Leonard Richardson
2013-08-13* Fixed yet another problem with the html5lib tree builder, caused byLeonard Richardson
2013-06-03Save another Element creation.Leonard Richardson
2013-06-03Improved performance for html5lib.Leonard Richardson
2013-06-03Improved performance of _replace_cdata_list_attribute_values, and greatly red...Leonard Richardson
2013-06-02Merged in big encoding-detection refactoring branch.Leonard Richardson
2013-06-02Turns out we had two bits of code to strip byte-order marks.Leonard Richardson
2013-06-02It turns out most of the untested code wasn't doing anything useful.Leonard Richardson
2013-06-02Treat an lxml ParserError as a ParserRejectedMarkup.Leonard Richardson
2013-05-31The html.parser treebuilder can now handle numeric attributes inLeonard Richardson
2013-05-31Create a new lxml parser object for every new parsing strategy.Leonard Richardson
2013-05-20The default XML formatter will now replace ampersands even if they appear to ...Leonard Richardson
2013-05-20The .next_element attribute used during parsing was confusingly similar to th...Leonard Richardson
2013-05-20Fixed another bug by which the html5lib tree builder could create aLeonard Richardson
2013-05-09Changed lxml.feed() to handle the eventuality that it may be given a bytestring.Leonard Richardson
2013-05-09Added a diagnostic function for randomly generating a simple, invalid HTML do...Leonard Richardson
2013-05-07Now that lxml's segfault on invalid doctype has been fixed, fix aLeonard Richardson
2012-10-11Fix a bug in the lxml treebuilder which crashed when a tag includedLeonard Richardson
2012-09-28Fixed package name.Leonard Richardson
2012-08-21We don't need a special insertComment method, we just need to make Element.ap...Leonard Richardson
2012-08-21Fixed a problem with the html5lib builder not handling comments correctly.Leonard Richardson
2012-08-16Use namespace prefixes for namespaced attribute names, instead ofLeonard Richardson