Age | Commit message (Collapse) | Author | |
---|---|---|---|
2018-12-23 | Merging Isaac Muse's Soup Sieve branch as-is before making some modifications. | Leonard Richardson | |
2018-12-22 | Fix next and previous linkage issues. Fixes issues #1806598 and #1782928. | Isaac Muse | |
2018-12-19 | Add Soup Sieve support | Isaac Muse | |
2018-07-30 | Fix an exception when a custom formatter was asked to format a void | Leonard Richardson | |
element. [bug=1784408] | |||
2018-07-28 | When markup contains duplicate elements, a select() call that | Leonard Richardson | |
includes multiple match clauses will match all relevant elements. [bug=1770596] | |||
2018-07-28 | Correctly handle invalid HTML numeric character entities like “ | Leonard Richardson | |
which reference code points that are not Unicode code points. Note that this is only fixed when Beautiful Soup is used with the html.parser parser -- html5lib already worked and I couldn't fix it with lxml. [bug=1782933] | |||
2018-07-15 | You can pass a dictionary of into | Leonard Richardson | |
BeautifulSoup.new_tag. This makes it possible to create a tag with an attribute like 'name' that would otherwise be masked by another argument of new_tag. [bug=1779276] | |||
2018-07-15 | Introduced the Formatter system. [bug=1716272]. | Leonard Richardson | |
2018-07-15 | Stop data loss when encountering an empty numeric entity, and | Leonard Richardson | |
possibly in other cases. Thanks to tos.kamiya for the fix. [bug=1698503] | |||
2018-07-14 | Fixed a disconnected parse tree when one BeautifulSoup object was | Leonard Richardson | |
inserted into another. [bug=1105148] | |||
2018-07-14 | Stopped HTMLParser from raising an exception in very rare cases of | Leonard Richardson | |
bad markup. [bug=1708831] | |||
2018-07-14 | Fixed code that was causing deprecation warnings in recent Python 3 | Leonard Richardson | |
versions. Includes a patch from Ville Skyttä. [bug=1778909] [bug=1689496] | |||
2017-05-06 | Replace get_attribute_text with get_attribute_list. | Leonard Richardson | |
2017-05-06 | Improved the handling of empty-element tags like <br> when using the | Leonard Richardson | |
html.parser parser. [bug=1676935] | |||
2017-05-06 | Renamed convenience method to get_attribute_text. | Leonard Richardson | |
2017-05-06 | Added the method, which acts like for | Leonard Richardson | |
getting the value of an attribute, but which joins attribute multi-values into a single string value. [bug=1678589] | |||
2017-05-06 | It's now possible to use a tag's namespace prefix when searching, | Leonard Richardson | |
e.g. soup.find('namespace:tag') [bug=1655332] | |||
2016-12-19 | Fixed foster parenting when html5lib is the tree builder. Thanks to Geoffrey ↵ | Leonard Richardson | |
Sneddon for a patch and test. | |||
2016-12-19 | Remove breakpoint. | Leonard Richardson | |
2016-12-19 | Fixed yet another problem that caused the html5lib tree builder to | Leonard Richardson | |
2016-07-26 | Clarify that Beautiful Soup is no longer compatible with versions of Python ↵ | Leonard Richardson | |
pre-2.7. Contributed by Ville Skyttä. | |||
2016-07-26 | Renamed deprecated method. Contributed by Ville Skyttä. | Leonard Richardson | |
2016-07-26 | Spelling fixes | Ville Skyttä | |
2016-07-26 | Use assertEqual instead of deprecated assertEquals | Ville Skyttä | |
2016-07-26 | Clarify Python 2(.7) support status | Ville Skyttä | |
2016-07-19 | Fixed test that fails in Python 3.5. | Leonard Richardson | |
2016-07-18 | Pass in bytes so that the BeautifulSoup object always has an original_encoding. | Leonard Richardson | |
2016-07-18 | If a search against each individual value of a multi-valued | Leonard Richardson | |
attribute fails, the search will be run one final time against the complete attribute value considered as a single string. [bug=1476868] | |||
2016-07-18 | Corrected an encoding error that happened when a BeautifulSoup | Leonard Richardson | |
object was copied. [bug=1554439] | |||
2016-07-18 | Added support for CSS selector values that contain quoted spaces, | Leonard Richardson | |
such as tag[style="display: foo"]. [bug=1540588] | |||
2016-07-18 | The argument to now works correctly, though it's | Leonard Richardson | |
not implemented very efficiently. [bug=1520530] | |||
2016-07-17 | Fixed a bug in the html5lib treebuilder that deranged the tree | Leonard Richardson | |
when a whitespace element was reparented into a tag that contained an identical whitespace element. [bug=1505351] | |||
2016-07-16 | Fixed a Python 3 ByteWarning when a URL was passed in as though it | Leonard Richardson | |
were markup. Thanks to James Salter for a patch and test. [bug=1533762] | |||
2015-12-08 | Fix foster parenting with html5lib. | Geoffrey Sneddon | |
This makes all of the html5lib tests pass. Yay! | |||
2015-09-28 | Fixed a parse bug with the html5lib tree-builder. Thanks to Roel | Leonard Richardson | |
Kramer for the patch. [bug=1483781] | |||
2015-09-28 | Improved the implementation of CSS selector grouping. Thanks to Orangain for ↵ | Leonard Richardson | |
the patch. [bug=1484543] | |||
2015-09-28 | Corrected the output of Declaration objects. [bug=1477847] | Leonard Richardson | |
2015-09-28 | Fixed a bug that deranged the tree when part of it was | Leonard Richardson | |
removed. Thanks to Eric Weiser for the patch and John Wiseman for a test. [bug=1481520] | |||
2015-07-05 | Fixed the test_detect_utf8 test so that it works when chardet is | Leonard Richardson | |
installed. [bug=1471359] | |||
2015-06-28 | It's now possible to pickle a BeautifulSoup object no matter which | Leonard Richardson | |
tree builder was used to create it. However, the only tree builder that survives the pickling process is the HTMLParserTreeBuilder ('html.parser'). If you unpickle a BeautifulSoup object created with some other tree builder, soup.builder will be None. [bug=1231545] | |||
2015-06-28 | Changed the way soup objects work under copy.copy(). Copying a | Leonard Richardson | |
NavigableString or a Tag will give you a new NavigableString that's equal to the old one but not connected to the parse tree. Patch by Martijn Peters. [bug=1307490] | |||
2015-06-28 | Copying a NavigableString will give you a new NavigableString that is not ↵ | Leonard Richardson | |
connected to the parse tree. | |||
2015-06-28 | Raise a NotImplementedError whenever an unsupported CSS pseudoclass | Leonard Richardson | |
is used in select(). Previously some cases did not result in a NotImplementedError. | |||
2015-06-27 | Added an exclude_encodings argument to UnicodeDammit and to the | Leonard Richardson | |
Beautiful Soup constructor, which lets you prohibit the detection of an encoding that you know is wrong. [bug=1469408] | |||
2015-06-25 | Introduced the select_one() method, which uses a CSS selector but | Leonard Richardson | |
only returns the first match, instead of a list of matches. [bug=1349367] | |||
2015-06-25 | The text argument to the find_* methods is now called string, | Leonard Richardson | |
which is more accurate. text still works, but is the argument described in the documentation. text may eventually change its meaning, but not for a very long time. [bug=1366856] | |||
2015-06-25 | Improved the exception raised when you call .unwrap() or | Leonard Richardson | |
.replace_with() on an element that's not attached to a tree. | |||
2015-06-25 | __repr__ now returns an ASCII bytestring in Python 2, and a Unicode string ↵ | Leonard Richardson | |
in Python 3, instead of a UTF8-encoded bytestring in both versions. [bug=1420131] | |||
2015-06-25 | Fixed a crash in Unicode, Dammit's encoding detector when the name | Leonard Richardson | |
of the encoding itself contained invalid bytes. [bug=1360913] | |||
2015-06-24 | The select() method can now find tags with attributes whose names | Leonard Richardson | |
contain dashes. Patch by Marek Kapolka. [bug=1304007] |