Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-03-20 | Added fuzz tests. | Leonard Richardson | |
2023-03-20 | Added a test harness for clusterfuzz test cases. | Leonard Richardson | |
2023-02-15 | Removed some error checking code from diagnose(), which is redundant with | Leonard Richardson | |
similar (but more Pythonic) code in the BeautifulSoup constructor. [bug=2007344] | |||
2023-02-15 | When the html.parser parser decides it can't parse a document, Beautiful | Leonard Richardson | |
Soup now consistently propagates this fact by raising a ParserRejectedMarkup error. [bug=2007343] | |||
2023-02-15 | Added missing import. | Leonard Richardson | |
2023-02-12 | Improved documentation of new method. | Leonard Richardson | |
2023-02-10 | Add compile method | facelessuser | |
2023-02-10 | Remove unnecessary test and don't pass namespace on precompiled select | facelessuser | |
2023-02-07 | Removed Soup Sieve fallback method, added documentation. | Leonard Richardson | |
2023-02-04 | Added a __getattr__ fallback. | Leonard Richardson | |
2023-02-03 | Added tests of CSS.escape. | Leonard Richardson | |
2023-02-03 | Move the Soup Sieve proxy and its tests into separate files. | Leonard Richardson | |
2023-02-03 | Consistently use the name 'tag' instead of 'element,' since CSS selectors ↵ | Leonard Richardson | |
only operate on tags. Verify that select() and filter() return ResultSets. | |||
2023-02-03 | Removed redundant whitespace. | Leonard Richardson | |
2023-02-03 | Added some docstrings and made the return values more consistent. | Leonard Richardson | |
2023-02-02 | Test implementation. | Leonard Richardson | |
2023-01-31 | Fixed missing import that caused test failures when Soup Sieve is installed. | Leonard Richardson | |
2023-01-31 | Consistently use pytest.mark.skipif to skip tests when the corresponding ↵ | Leonard Richardson | |
libraries are not installed. | |||
2023-01-29 | Reworded the 'multi-valued attributes' portion of the documentation to make ↵ | Leonard Richardson | |
it more clear. [bug=1970767] | |||
2023-01-28 | Incremented version number. | Leonard Richardson | |
2023-01-27 | Parametrize the 'string is deprecated' warning test so we can test all of ↵ | Leonard Richardson | |
the relevant methods. | |||
2023-01-27 | Check the associated filename for more warnings. | Leonard Richardson | |
2023-01-27 | Change the tests that check warnings to also (indirectly) verify that the ↵ | Leonard Richardson | |
stacklevel associated with the warning is more or less correct. | |||
2023-01-27 | Implemented the more complicated case of providing an appropriate stacklevel ↵ | Leonard Richardson | |
for the warning issued when the deprecated 'text' argument is passed in. | |||
2023-01-27 | Got rid of some more warnings by removing code that's not relevant anymore, ↵ | Leonard Richardson | |
now that the minimum supported Python version is 3.6. | |||
2023-01-27 | Warnings now do their best to provide an appropriate stacklevel, | Leonard Richardson | |
improving the usefulness of the message. [bug=1978744] | |||
2023-01-25 | Tag.interesting_string_types is now propagated when a tag is | Leonard Richardson | |
copied. [bug=1990400] | |||
2023-01-25 | Made the ISO-8859 test robust in a less hacky way. | Leonard Richardson | |
2023-01-25 | Made the ISO-8859-1 smoke test more robust. | Leonard Richardson | |
2023-01-25 | Removed very old documentation that was imported as part of the bzr import ↵ | Leonard Richardson | |
but not removed. | |||
2023-01-25 | The HTMLFormatter and XMLFormatter constructors no longer return a | Leonard Richardson | |
value. [bug=1992693] | |||
2023-01-25 | Passing a Tag's .contents into PageElement.extend() now works the | Leonard Richardson | |
same way as passing the Tag itself. | |||
2022-05-15 | Fixed a test failure when cchardet is not installed but | Leonard Richardson | |
charset_normalizer is. [bug=1973072] | |||
2022-04-10 | Fixed another crash when overriding multi_valued_attributes and using the | Leonard Richardson | |
html5lib parser. [bug=1948488] | |||
2022-04-08 | Some cleanup work to get more consistent and complete about what gets ↵ | Leonard Richardson | |
packaged with the Beautiful Soup release. | |||
2022-04-07 | Omit untrusted input when issuing warnings. | Leonard Richardson | |
2021-12-21 | Standardized the wording of the MarkupResemblesLocatorWarning | Leonard Richardson | |
warnings to to make them less judgemental about what you ought to be doing. [bug=1955450] | |||
2021-12-21 | It's now possible to customize the way output is indented by | Leonard Richardson | |
providing a value for the 'indent' argument to the Formatter constructor. The 'indent' argument works very similarly to the argument of the same name in the Python standard library's json.dump() method. [bug=1955497] | |||
2021-12-19 | Remove a huge list of HTML entities that was only necessary under Python 2. | Leonard Richardson | |
2021-12-19 | Removed support for the iconv_codec library, which doesn't seem | Leonard Richardson | |
to exist anymore and was never put up on PyPI. (The closest replacement on PyPI, iconv_codecs, is GPL-licensed, so we can't use it.) | |||
2021-12-19 | If the charset-normalizer Python module | Leonard Richardson | |
(https://pypi.org/project/charset-normalizer/) is installed, Beautiful Soup will use it to detect the character sets of incoming documents. This is also the module used by newer versions of the Requests library. For the sake of backwards compatibility, chardet and cchardet both take precedence if installed. [bug=1955346] | |||
2021-12-17 | Fix a crash when pickling a BeautifulSoup object that has no | Leonard Richardson | |
tree builder. [bug=1934003] | |||
2021-11-29 | Do a better job of keeping track of namespaces as an XML document is | Leonard Richardson | |
parsed, so that CSS selectors that use namespaces will do the right thing more often. [bug=1946243] | |||
2021-10-24 | Added test of warn_if_markup_looks_like_xml. | Leonard Richardson | |
2021-10-24 | Issue a warning when an HTML parser is used to parse a document that | Leonard Richardson | |
looks like XML but not XHTML. [bug=1939121] | |||
2021-10-24 | Used a warning to formally deprecate the 'text' argument in favor of 'string'. | Leonard Richardson | |
2021-10-23 | Changing find* tests to use string instead of text, except for one test that ↵ | Leonard Richardson | |
specifically checks that text is an alias for string. | |||
2021-10-23 | Renamed the 'text' field to 'string' for real. Tests are not changed in this ↵ | Leonard Richardson | |
commit to demonstrate that the renaming doesn't break anything. [bug=1947038] | |||
2021-10-23 | Added a workaround for an lxml bug ↵ | Leonard Richardson | |
(https://bugs.launchpad.net/lxml/+bug/1948551) that caused problems when parsing a Unicode string beginning with BYTE ORDER MARK. [bug=1947768] | |||
2021-10-23 | Fixed a crash when overriding multi_valued_attributes and using the | Leonard Richardson | |
html5lib parser. [bug=1948488] |