Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-01-31 | Consistently use pytest.mark.skipif to skip tests when the corresponding ↵ | Leonard Richardson | |
libraries are not installed. | |||
2023-01-27 | Parametrize the 'string is deprecated' warning test so we can test all of ↵ | Leonard Richardson | |
the relevant methods. | |||
2023-01-27 | Check the associated filename for more warnings. | Leonard Richardson | |
2023-01-25 | The HTMLFormatter and XMLFormatter constructors no longer return a | Leonard Richardson | |
value. [bug=1992693] | |||
2023-01-25 | Passing a Tag's .contents into PageElement.extend() now works the | Leonard Richardson | |
same way as passing the Tag itself. | |||
2021-10-24 | Used a warning to formally deprecate the 'text' argument in favor of 'string'. | Leonard Richardson | |
2021-10-23 | Changing find* tests to use string instead of text, except for one test that ↵ | Leonard Richardson | |
specifically checks that text is an alias for string. | |||
2021-10-11 | More test refactoring. | Leonard Richardson | |
2021-10-11 | Broke up some monolithic unit test files. | Leonard Richardson | |
2021-10-11 | Moved the test classes to tests/__init__.py. | Leonard Richardson | |
2021-10-09 | Moved testing.py into the same package as the tests. | Leonard Richardson | |
2021-09-12 | Ported unit tests to use pytest. | Leonard Richardson | |
2021-09-07 | Goodbye, Python 2. [bug=1942919] | Leonard Richardson | |
2021-06-01 | The 'replace_with()' method now takes a variable number of arguments, | Leonard Richardson | |
and can be used to replace a single element with a sequence of elements. Patch by Bill Chandos. | |||
2021-02-14 | NavigableString and its subclasses now implement the get_text() | Leonard Richardson | |
method, as well as the properties .strings and .stripped_strings. These methods will either return the string itself, or nothing, so the only reason to use this is when iterating over a list of mixed Tag and NavigableString objects. [bug=1904309] | |||
2021-02-14 | The 'html5' formatter now treats attributes whose values are the | Leonard Richardson | |
empty string as HTML boolean attributes. Previously (and in other formatters), an attribute value must be set as None to be treated as a boolean attribute. In a future release, I plan to also give this behavior to the 'html' formatter. Patch by Isaac Muse. [bug=1915424] | |||
2021-02-13 | The behavior of methods like .get_text() and .strings now differs | Leonard Richardson | |
depending on the type of tag. The change is visible with HTML tags like <script>, <style>, and <template>. Starting in 4.9.0, methods like get_text() returned no results on such tags, because the contents of those tags are not considered 'text' within the document as a whole. But a user who calls script.get_text() is working from a different definition of 'text' than a user who calls div.get_text()--otherwise there would be no need to call script.get_text() at all. In 4.10.0, the contents of (e.g.) a <script> tag are considered 'text' during a get_text() call on the tag itself, but not considered 'text' during a get_text() call on the tag's parent. Because of this change, calling get_text() on each child of a tag may now return a different result than calling get_text() on the tag itself. That's because different tags now have different understandings of what counts as 'text'. [bug=1906226] [bug=1868861] | |||
2021-02-13 | Corrected the use of special string container classes in cases when a | Leonard Richardson | |
single tag may contain strings with different containers; such as the <template> tag, which may contain both TemplateString objects and Comment objects. [bug=1913406] | |||
2020-09-26 | Fixed a bug that inconsistently moved elements over when passing | Leonard Richardson | |
a Tag, rather than a list, into Tag.extend(). [bug=1885710] | |||
2020-04-12 | Fixed test failures when run against soupselect 2.0. Patch by Tomáš | Leonard Richardson | |
Chvátal. [bug=1872279] | |||
2020-04-05 | Embedded CSS and Javascript is now stored in distinct Stylesheet and | Leonard Richardson | |
Script tags, which are ignored by methods like get_text(). This feature is not supported by the html5lib treebuilder. [bug=1868861] | |||
2020-01-01 | API CHANGE - Added PageElement.decomposed, a new property which lets you | Leonard Richardson | |
check whether you've already called decompose() on a Tag or NavigableString. | |||
2019-12-29 | Fixed an unhandled exception when formatting a Tag that had been ↵ | Leonard Richardson | |
decomposed.[bug=1857767] | |||
2019-08-21 | Copying a Tag preserves information that was originally obtained from | Leonard Richardson | |
the TreeBuilder used to build the original Tag. [bug=1838903] | |||
2019-08-21 | Fixed a crash when pretty-printing tags that were not created | Leonard Richardson | |
during initial parsing. [bug=1838903] | |||
2019-07-15 | Implemented Tag.smooth. | Leonard Richardson | |
2019-07-15 | Moved the formatter to its own class and updated its documentation. | Leonard Richardson | |
2019-07-15 | Improved comments in tests. | Leonard Richardson | |
2019-07-14 | Give the Formatter class more control over formatting decisions. | Leonard Richardson | |
2019-07-07 | A Formatter can now decide how (or whether) to order the attributes | Leonard Richardson | |
inside a tag. [bug=1812422] | |||
2019-07-07 | It's now possible to override a TreeBuilder's cdata_list_attributes ↵ | Leonard Richardson | |
dictionary by passing in a replacement. None will disable the feature altogether. [bug=1832978] | |||
2019-01-06 | Fixed an incorrectly raised exception when inserting a tag before or | Leonard Richardson | |
after an identical tag. [bug=1810692] | |||
2018-12-31 | Improved and tested error checking for insert_before and insert_after. | Leonard Richardson | |
2018-12-30 | Add convienances for inserting multiple tags | Isaac Muse | |
Add extend method to append a list of tags. Make insert_before and insert_after accept multiple arguments | |||
2018-12-19 | Add Soup Sieve support | Isaac Muse | |
2018-07-30 | Fix an exception when a custom formatter was asked to format a void | Leonard Richardson | |
element. [bug=1784408] | |||
2018-07-28 | When markup contains duplicate elements, a select() call that | Leonard Richardson | |
includes multiple match clauses will match all relevant elements. [bug=1770596] | |||
2018-07-28 | Correctly handle invalid HTML numeric character entities like “ | Leonard Richardson | |
which reference code points that are not Unicode code points. Note that this is only fixed when Beautiful Soup is used with the html.parser parser -- html5lib already worked and I couldn't fix it with lxml. [bug=1782933] | |||
2018-07-15 | You can pass a dictionary of into | Leonard Richardson | |
BeautifulSoup.new_tag. This makes it possible to create a tag with an attribute like 'name' that would otherwise be masked by another argument of new_tag. [bug=1779276] | |||
2018-07-15 | Introduced the Formatter system. [bug=1716272]. | Leonard Richardson | |
2018-07-14 | Fixed a disconnected parse tree when one BeautifulSoup object was | Leonard Richardson | |
inserted into another. [bug=1105148] | |||
2018-07-14 | Fixed code that was causing deprecation warnings in recent Python 3 | Leonard Richardson | |
versions. Includes a patch from Ville Skyttä. [bug=1778909] [bug=1689496] | |||
2017-05-06 | Replace get_attribute_text with get_attribute_list. | Leonard Richardson | |
2017-05-06 | Renamed convenience method to get_attribute_text. | Leonard Richardson | |
2017-05-06 | Added the method, which acts like for | Leonard Richardson | |
getting the value of an attribute, but which joins attribute multi-values into a single string value. [bug=1678589] | |||
2017-05-06 | It's now possible to use a tag's namespace prefix when searching, | Leonard Richardson | |
e.g. soup.find('namespace:tag') [bug=1655332] | |||
2016-07-26 | Spelling fixes | Ville Skyttä | |
2016-07-19 | Fixed test that fails in Python 3.5. | Leonard Richardson | |
2016-07-18 | Pass in bytes so that the BeautifulSoup object always has an original_encoding. | Leonard Richardson | |
2016-07-18 | If a search against each individual value of a multi-valued | Leonard Richardson | |
attribute fails, the search will be run one final time against the complete attribute value considered as a single string. [bug=1476868] |