Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-02-03 | Consistently use the name 'tag' instead of 'element,' since CSS selectors ↵ | Leonard Richardson | |
only operate on tags. Verify that select() and filter() return ResultSets. | |||
2023-02-03 | Removed redundant whitespace. | Leonard Richardson | |
2023-02-03 | Added some docstrings and made the return values more consistent. | Leonard Richardson | |
2023-02-02 | Test implementation. | Leonard Richardson | |
2023-01-31 | Fixed missing import that caused test failures when Soup Sieve is installed. | Leonard Richardson | |
2023-01-31 | Consistently use pytest.mark.skipif to skip tests when the corresponding ↵ | Leonard Richardson | |
libraries are not installed. | |||
2023-01-29 | Reworded the 'multi-valued attributes' portion of the documentation to make ↵ | Leonard Richardson | |
it more clear. [bug=1970767] | |||
2023-01-28 | Incremented version number. | Leonard Richardson | |
2023-01-27 | Parametrize the 'string is deprecated' warning test so we can test all of ↵ | Leonard Richardson | |
the relevant methods. | |||
2023-01-27 | Check the associated filename for more warnings. | Leonard Richardson | |
2023-01-27 | Change the tests that check warnings to also (indirectly) verify that the ↵ | Leonard Richardson | |
stacklevel associated with the warning is more or less correct. | |||
2023-01-27 | Implemented the more complicated case of providing an appropriate stacklevel ↵ | Leonard Richardson | |
for the warning issued when the deprecated 'text' argument is passed in. | |||
2023-01-27 | Got rid of some more warnings by removing code that's not relevant anymore, ↵ | Leonard Richardson | |
now that the minimum supported Python version is 3.6. | |||
2023-01-27 | Warnings now do their best to provide an appropriate stacklevel, | Leonard Richardson | |
improving the usefulness of the message. [bug=1978744] | |||
2023-01-25 | Tag.interesting_string_types is now propagated when a tag is | Leonard Richardson | |
copied. [bug=1990400] | |||
2023-01-25 | Made the ISO-8859 test robust in a less hacky way. | Leonard Richardson | |
2023-01-25 | Made the ISO-8859-1 smoke test more robust. | Leonard Richardson | |
2023-01-25 | Removed very old documentation that was imported as part of the bzr import ↵ | Leonard Richardson | |
but not removed. | |||
2023-01-25 | The HTMLFormatter and XMLFormatter constructors no longer return a | Leonard Richardson | |
value. [bug=1992693] | |||
2023-01-25 | Passing a Tag's .contents into PageElement.extend() now works the | Leonard Richardson | |
same way as passing the Tag itself. | |||
2022-05-15 | Fixed a test failure when cchardet is not installed but | Leonard Richardson | |
charset_normalizer is. [bug=1973072] | |||
2022-04-10 | Fixed another crash when overriding multi_valued_attributes and using the | Leonard Richardson | |
html5lib parser. [bug=1948488] | |||
2022-04-08 | Some cleanup work to get more consistent and complete about what gets ↵ | Leonard Richardson | |
packaged with the Beautiful Soup release. | |||
2022-04-07 | Omit untrusted input when issuing warnings. | Leonard Richardson | |
2021-12-21 | Standardized the wording of the MarkupResemblesLocatorWarning | Leonard Richardson | |
warnings to to make them less judgemental about what you ought to be doing. [bug=1955450] | |||
2021-12-21 | It's now possible to customize the way output is indented by | Leonard Richardson | |
providing a value for the 'indent' argument to the Formatter constructor. The 'indent' argument works very similarly to the argument of the same name in the Python standard library's json.dump() method. [bug=1955497] | |||
2021-12-19 | Remove a huge list of HTML entities that was only necessary under Python 2. | Leonard Richardson | |
2021-12-19 | Removed support for the iconv_codec library, which doesn't seem | Leonard Richardson | |
to exist anymore and was never put up on PyPI. (The closest replacement on PyPI, iconv_codecs, is GPL-licensed, so we can't use it.) | |||
2021-12-19 | If the charset-normalizer Python module | Leonard Richardson | |
(https://pypi.org/project/charset-normalizer/) is installed, Beautiful Soup will use it to detect the character sets of incoming documents. This is also the module used by newer versions of the Requests library. For the sake of backwards compatibility, chardet and cchardet both take precedence if installed. [bug=1955346] | |||
2021-12-17 | Fix a crash when pickling a BeautifulSoup object that has no | Leonard Richardson | |
tree builder. [bug=1934003] | |||
2021-11-29 | Do a better job of keeping track of namespaces as an XML document is | Leonard Richardson | |
parsed, so that CSS selectors that use namespaces will do the right thing more often. [bug=1946243] | |||
2021-10-24 | Added test of warn_if_markup_looks_like_xml. | Leonard Richardson | |
2021-10-24 | Issue a warning when an HTML parser is used to parse a document that | Leonard Richardson | |
looks like XML but not XHTML. [bug=1939121] | |||
2021-10-24 | Used a warning to formally deprecate the 'text' argument in favor of 'string'. | Leonard Richardson | |
2021-10-23 | Changing find* tests to use string instead of text, except for one test that ↵ | Leonard Richardson | |
specifically checks that text is an alias for string. | |||
2021-10-23 | Renamed the 'text' field to 'string' for real. Tests are not changed in this ↵ | Leonard Richardson | |
commit to demonstrate that the renaming doesn't break anything. [bug=1947038] | |||
2021-10-23 | Added a workaround for an lxml bug ↵ | Leonard Richardson | |
(https://bugs.launchpad.net/lxml/+bug/1948551) that caused problems when parsing a Unicode string beginning with BYTE ORDER MARK. [bug=1947768] | |||
2021-10-23 | Fixed a crash when overriding multi_valued_attributes and using the | Leonard Richardson | |
html5lib parser. [bug=1948488] | |||
2021-10-23 | Fix a Python 3-specific problem in diagnose.lxml_trace. | Leonard Richardson | |
2021-10-11 | Added special string classes, RubyParenthesisString and RubyTextString, | Leonard Richardson | |
to make it possible to treat ruby text specially in get_text() calls. [bug=1941980] | |||
2021-10-11 | More test refactoring. | Leonard Richardson | |
2021-10-11 | Broke up some monolithic unit test files. | Leonard Richardson | |
2021-10-11 | Moved the test classes to tests/__init__.py. | Leonard Richardson | |
2021-10-09 | Moved testing.py into the same package as the tests. | Leonard Richardson | |
2021-09-12 | Ported unit tests to use pytest. | Leonard Richardson | |
2021-09-07 | Goodbye, Python 2. [bug=1942919] | Leonard Richardson | |
2021-06-01 | The 'replace_with()' method now takes a variable number of arguments, | Leonard Richardson | |
and can be used to replace a single element with a sequence of elements. Patch by Bill Chandos. | |||
2021-05-31 | The html.parser tree builder can now handles named entities | Leonard Richardson | |
found in the HTML5 spec in much the same way that the html5lib tree builder does. Note that the lxml tree builder still handles named entities differently. [bug=1924908] | |||
2021-04-08 | Brought in fuzz tests from the oss-project into Beautiful Soup's unit test ↵ | Leonard Richardson | |
suite. | |||
2021-02-14 | NavigableString and its subclasses now implement the get_text() | Leonard Richardson | |
method, as well as the properties .strings and .stripped_strings. These methods will either return the string itself, or nothing, so the only reason to use this is when iterating over a list of mixed Tag and NavigableString objects. [bug=1904309] |