Age | Commit message (Collapse) | Author | |
---|---|---|---|
2024-01-17 | Added the correct stacklevel to instances of the XMLParsedAsHTMLWarning. | Leonard Richardson | |
[bug=2034451] | |||
2023-01-27 | Warnings now do their best to provide an appropriate stacklevel, | Leonard Richardson | |
improving the usefulness of the message. [bug=1978744] | |||
2022-04-10 | Fixed another crash when overriding multi_valued_attributes and using the | Leonard Richardson | |
html5lib parser. [bug=1948488] | |||
2021-10-24 | Issue a warning when an HTML parser is used to parse a document that | Leonard Richardson | |
looks like XML but not XHTML. [bug=1939121] | |||
2021-10-23 | Fixed a crash when overriding multi_valued_attributes and using the | Leonard Richardson | |
html5lib parser. [bug=1948488] | |||
2021-09-07 | Goodbye, Python 2. [bug=1942919] | Leonard Richardson | |
2020-05-17 | Switch entirely to Python 3-style print statements, even in Python 2. | Leonard Richardson | |
2020-04-05 | Embedded CSS and Javascript is now stored in distinct Stylesheet and | Leonard Richardson | |
Script tags, which are ignored by methods like get_text(). This feature is not supported by the html5lib treebuilder. [bug=1868861] | |||
2019-11-11 | Added a Brazilian Portuguese translation by Cezar Peixeiro. | Leonard Richardson | |
2019-07-21 | Implemented line number tracking for html5lib. | Leonard Richardson | |
2019-07-07 | It's now possible to override a TreeBuilder's cdata_list_attributes ↵ | Leonard Richardson | |
dictionary by passing in a replacement. None will disable the feature altogether. [bug=1832978] | |||
2018-12-30 | Fixed a problem with multi-valued attributes where the value | Leonard Richardson | |
contained whitespace. Thanks to Jens Svalgaard for the fix. [bug=1787453] | |||
2018-12-24 | Clarified the software license. | Leonard Richardson | |
2018-12-22 | Fix next and previous linkage issues. Fixes issues #1806598 and #1782928. | Isaac Muse | |
2016-12-19 | Fixed foster parenting when html5lib is the tree builder. Thanks to Geoffrey ↵ | Leonard Richardson | |
Sneddon for a patch and test. | |||
2016-12-19 | Fixed yet another problem that caused the html5lib tree builder to | Leonard Richardson | |
2016-07-16 | Beautiful Soup will now work with versions of html5lib greater than | Leonard Richardson | |
0.99999999. [bug=1603299] | |||
2016-07-16 | Removed imports to pdb, since pdb is not available in some environments. ↵ | Leonard Richardson | |
[bug=1491700] | |||
2016-07-16 | Rename COPYING.txt to LICENSE. Add a reference to LICENSE in every source file. | Leonard Richardson | |
2015-12-08 | Fix foster parenting with html5lib. | Geoffrey Sneddon | |
This makes all of the html5lib tests pass. Yay! | |||
2015-12-08 | Make TreeBuilderForHtml5lib strictly follow the html5lib API. | Geoffrey Sneddon | |
This slightly changes the constructor (to make soup optional), and adds a testSerializer method so the tests can be run against it. | |||
2015-09-28 | Fixed a parse bug with the html5lib tree-builder. Thanks to Roel | Leonard Richardson | |
Kramer for the patch. [bug=1483781] | |||
2015-06-28 | Changed the way soup objects work under copy.copy(). Copying a | Leonard Richardson | |
NavigableString or a Tag will give you a new NavigableString that's equal to the old one but not connected to the parse tree. Patch by Martijn Peters. [bug=1307490] | |||
2015-06-28 | Fixed a bug where Element.extract() could create an infinite loop in | Leonard Richardson | |
the remaining tree. | |||
2015-06-27 | Added an exclude_encodings argument to UnicodeDammit and to the | Leonard Richardson | |
Beautiful Soup constructor, which lets you prohibit the detection of an encoding that you know is wrong. [bug=1469408] | |||
2015-06-26 | Added a sanity check helper method that makes sure all the elements of a ↵ | Leonard Richardson | |
tree are properly connected via .next_element and .previous_element. | |||
2015-06-24 | If the initial <html> tag contains a CDATA list attribute such as | Leonard Richardson | |
'class', the html5lib tree builder will now turn its value into a list, as it would with any other tag. [bug=1296481] | |||
2015-06-23 | Got a hacky fix for the latest html5lib problem. | Leonard Richardson | |
2014-12-07 | Issue a warning if the BeautifulSoup constructor arguments do not explicitly ↵ | Leonard Richardson | |
name a parser. | |||
2013-10-18 | Fixed yet another problem that caused the html5lib tree builder to | Leonard Richardson | |
create a disconnected parse tree. [bug=1237763] | |||
2013-08-13 | * Fixed yet another problem with the html5lib tree builder, caused by | Leonard Richardson | |
html5lib's tendency to rearrange the tree during parsing. [bug=1189267] | |||
2013-06-03 | Save another Element creation. | Leonard Richardson | |
2013-06-03 | Improved performance for html5lib. | Leonard Richardson | |
2013-05-31 | Create a new lxml parser object for every new parsing strategy. | Leonard Richardson | |
2013-05-20 | The .next_element attribute used during parsing was confusingly similar to ↵ | Leonard Richardson | |
the .next_element navigation attribute. Renamed the former to _most_recent_element. | |||
2013-05-20 | Fixed another bug by which the html5lib tree builder could create a | Leonard Richardson | |
disconnected tree. [bug=1182089] | |||
2012-08-21 | We don't need a special insertComment method, we just need to make ↵ | Leonard Richardson | |
Element.appendChild call object_was_parsed. | |||
2012-08-21 | Fixed a problem with the html5lib builder not handling comments correctly. | Leonard Richardson | |
2012-04-26 | The test suite now passes when lxml is not installed, whether or not ↵ | Leonard Richardson | |
html5lib is installed. [bug=987004] | |||
2012-04-18 | Got rid of contains_substitutions. | Leonard Richardson | |
2012-03-01 | Added missing __len__ method that stopped html5lib tree builder from working ↵ | Leonard Richardson | |
on nested formatting elements. [bug=943246] | |||
2012-02-24 | Warn when SoupStrainer is used with the html5lib tree builder. | Leonard Richardson | |
2012-02-23 | Bumped version number. | Leonard Richardson | |
2012-02-16 | It's a start, at least. | Leonard Richardson | |
2012-02-15 | Clarified comment. | Leonard Richardson | |
2012-02-15 | Removed _nodeIndex, because the misfeature it works around is now gone. | Leonard Richardson | |
2012-02-15 | Minor cleanup. | Leonard Richardson | |
2012-02-15 | Tested and cleaned up html5lib insertBefore. | Leonard Richardson | |
2012-02-15 | Use append instead of insert. | Leonard Richardson | |
2012-02-15 | Minor cleanup. | Leonard Richardson | |