summaryrefslogtreecommitdiff
path: root/bs4/builder/_lxml.py
AgeCommit message (Expand)Author
2021-11-29Do a better job of keeping track of namespaces as an XML document isLeonard Richardson
2021-10-24Issue a warning when an HTML parser is used to parse a document thatLeonard Richardson
2021-10-23Added a workaround for an lxml bug (https://bugs.launchpad.net/lxml/+bug/19...Leonard Richardson
2021-09-07Goodbye, Python 2. [bug=1942919]Leonard Richardson
2021-02-13Added a second way to pass specify encodings to UnicodeDammit andLeonard Richardson
2019-12-24Added docstrings for some but not all tree buidlers.Leonard Richardson
2019-11-11Added a Brazilian Portuguese translation by Cezar Peixeiro.Leonard Richardson
2019-09-02Avoid a crash when trying to detect the declared encoding of aLeonard Richardson
2019-07-21Implemented line number tracking for html5lib.Leonard Richardson
2019-07-07It's now possible to override a TreeBuilder's cdata_list_attributes dictionar...Leonard Richardson
2019-01-06Don't track un-prefixed namespacesIsaac Muse
2018-12-24Clarified the software license.Leonard Richardson
2018-12-24Keep track of the namespace abbreviations found while parsing the document. T...Leonard Richardson
2018-07-18Preserve XML namespaces when they are introduced inside an XMLLeonard Richardson
2018-07-14Stopped HTMLParser from raising an exception in very rare cases ofLeonard Richardson
2016-07-30Explained why we test both unicode and bytestring processing instructions.Leonard Richardson
2016-07-26Fixed a reported (but not duplicated) bug involving processing instructions f...Leonard Richardson
2016-07-16Removed imports to pdb, since pdb is not available in some environments. [bug...Leonard Richardson
2016-07-16Added a separate class for XML processing instructions, which have a slightly...Leonard Richardson
2016-07-16Rename COPYING.txt to LICENSE. Add a reference to LICENSE in every source file.Leonard Richardson
2015-06-28Accept 'xml' as an unambiguous identifier for the lxml XML parser, since it's...Leonard Richardson
2015-06-27Added an exclude_encodings argument to UnicodeDammit and to theLeonard Richardson
2014-12-11Improved the lxml tree builder's handling of processingLeonard Richardson
2014-12-07Tweaked the parser warning.Leonard Richardson
2014-12-07Issue a warning if the BeautifulSoup constructor arguments do not explicitly ...Leonard Richardson
2013-06-02Turns out we had two bits of code to strip byte-order marks.Leonard Richardson
2013-06-02It turns out most of the untested code wasn't doing anything useful.Leonard Richardson
2013-06-02Treat an lxml ParserError as a ParserRejectedMarkup.Leonard Richardson
2013-05-31Create a new lxml parser object for every new parsing strategy.Leonard Richardson
2013-05-09Changed lxml.feed() to handle the eventuality that it may be given a bytestring.Leonard Richardson
2013-05-09Added a diagnostic function for randomly generating a simple, invalid HTML do...Leonard Richardson
2012-10-11Fix a bug in the lxml treebuilder which crashed when a tag includedLeonard Richardson
2012-09-28Fixed package name.Leonard Richardson
2012-08-16Use namespace prefixes for namespaced attribute names, instead ofLeonard Richardson
2012-05-29Removed breakpoints.Leonard Richardson
2012-05-29Prep for release.Leonard Richardson
2012-05-24Fixed a bug with the lxml treebuilder that prevented the user from adding att...Leonard Richardson
2012-04-03Got rid of the 4.0.2 workaround for HTML documents--it was unnecessary and th...Leonard Richardson
2012-04-03Don't split up the markup into chunks when using the lxml HTML parser, which ...Leonard Richardson
2012-03-24Pass data into XMLParser.feed() in chunks. [bug=963880]Leonard Richardson
2012-02-28Fixed the generated XML declaration.Leonard Richardson
2012-02-23Fixed handling of the closing of namespaced tags.Leonard Richardson
2012-02-23Merge from trunk and added tests.Leonard Richardson
2012-02-22Added comments.Leonard Richardson
2012-02-22Treat a new namespace mapping as a set of attributes on the tag that defines ...Leonard Richardson
2012-02-21Have lxml invert namespace maps as they come in and set each tag's prefix app...Leonard Richardson
2012-02-21Added nsprefix argument to the tag class.Leonard Richardson
2012-02-16It's a start, at least.Leonard Richardson
2012-02-09As a last-ditch attempt to turn data into Unicode, use errors=replace instead...Leonard Richardson
2012-02-09Minor Unicode, Dammit cleanup.Leonard Richardson