summaryrefslogtreecommitdiff
path: root/bs4/builder
AgeCommit message (Expand)Author
2019-11-11The html.parser tree builder now correctly handles DOCTYPEs that areLeonard Richardson
2019-11-11Added a Brazilian Portuguese translation by Cezar Peixeiro.Leonard Richardson
2019-09-02Avoid a crash when trying to detect the declared encoding of aLeonard Richardson
2019-07-21Implemented line number tracking for html5lib.Leonard Richardson
2019-07-21Adapt Chris Mayo's code to track line number and position when using html.par...Leonard Richardson
2019-07-14Give the Formatter class more control over formatting decisions.Leonard Richardson
2019-07-07Renamed the cdata_list_attributes argument to multi_valued_attributes since i...Leonard Richardson
2019-07-07It's now possible to override a TreeBuilder's cdata_list_attributes dictionar...Leonard Richardson
2019-01-06Don't track un-prefixed namespacesIsaac Muse
2018-12-30Fixed a problem with multi-valued attributes where the valueLeonard Richardson
2018-12-24Clarified the software license.Leonard Richardson
2018-12-24Keep track of the namespace abbreviations found while parsing the document. T...Leonard Richardson
2018-12-22Fix next and previous linkage issues. Fixes issues #1806598 and #1782928.Isaac Muse
2018-08-12Converted README to Markdown format.Leonard Richardson
2018-07-28Correctly handle invalid HTML numeric character entities like “Leonard Richardson
2018-07-21Fixed a problem where the html.parser tree builder interpretedLeonard Richardson
2018-07-18Preserve XML namespaces when they are introduced inside an XMLLeonard Richardson
2018-07-15Introduced the Formatter system. [bug=1716272].Leonard Richardson
2018-07-15It's possible for a TreeBuilder subclass to specify that voidLeonard Richardson
2018-07-15Stop data loss when encountering an empty numeric entity, andLeonard Richardson
2018-07-14Stopped HTMLParser from raising an exception in very rare cases ofLeonard Richardson
2017-05-06 Improved the handling of empty-element tags like <br> when using theLeonard Richardson
2017-05-06HTML parsers treat all HTML4 and HTML5 empty element tags (aka void element t...Leonard Richardson
2016-12-19Fixed foster parenting when html5lib is the tree builder. Thanks to Geoffrey ...Leonard Richardson
2016-12-19Fixed yet another problem that caused the html5lib tree builder toLeonard Richardson
2016-07-30Explained why we test both unicode and bytestring processing instructions.Leonard Richardson
2016-07-26Fixed a reported (but not duplicated) bug involving processing instructions f...Leonard Richardson
2016-07-16Beautiful Soup will now work with versions of html5lib greater thanLeonard Richardson
2016-07-16Removed imports to pdb, since pdb is not available in some environments. [bug...Leonard Richardson
2016-07-16The contents of <textarea> tags will no longer be modified when theLeonard Richardson
2016-07-16Added a separate class for XML processing instructions, which have a slightly...Leonard Richardson
2016-07-16Rename COPYING.txt to LICENSE. Add a reference to LICENSE in every source file.Leonard Richardson
2015-12-08Fix foster parenting with html5lib.Geoffrey Sneddon
2015-12-08Make TreeBuilderForHtml5lib strictly follow the html5lib API.Geoffrey Sneddon
2015-09-28Fixed a parse bug with the html5lib tree-builder. Thanks to RoelLeonard Richardson
2015-06-28 It's now possible to pickle a BeautifulSoup object no matter whichLeonard Richardson
2015-06-28Changed the way soup objects work under copy.copy(). Copying aLeonard Richardson
2015-06-28Fixed a bug where Element.extract() could create an infinite loop inLeonard Richardson
2015-06-28Accept 'xml' as an unambiguous identifier for the lxml XML parser, since it's...Leonard Richardson
2015-06-27Added an exclude_encodings argument to UnicodeDammit and to theLeonard Richardson
2015-06-26Added a sanity check helper method that makes sure all the elements of a tree...Leonard Richardson
2015-06-24Fixed an import error in Python 3.5 caused by the removal of theLeonard Richardson
2015-06-24Made double sure that we don't use the 'strict' constructor argument when it'...Leonard Richardson
2015-06-24If the initial <html> tag contains a CDATA list attribute such asLeonard Richardson
2015-06-23Got a hacky fix for the latest html5lib problem.Leonard Richardson
2014-12-11Improved the lxml tree builder's handling of processingLeonard Richardson
2014-12-07In Python 3.4 and above, set the new convert_charrefs argument toLeonard Richardson
2014-12-07Tweaked the parser warning.Leonard Richardson
2014-12-07Issue a warning if the BeautifulSoup constructor arguments do not explicitly ...Leonard Richardson
2013-10-18Fixed yet another problem that caused the html5lib tree builder toLeonard Richardson