summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2023-01-27Parametrize the 'string is deprecated' warning test so we can test all of ↵Leonard Richardson
the relevant methods.
2023-01-27Check the associated filename for more warnings.Leonard Richardson
2023-01-27Change the tests that check warnings to also (indirectly) verify that the ↵Leonard Richardson
stacklevel associated with the warning is more or less correct.
2023-01-27Implemented the more complicated case of providing an appropriate stacklevel ↵Leonard Richardson
for the warning issued when the deprecated 'text' argument is passed in.
2023-01-27Got rid of some more warnings by removing code that's not relevant anymore, ↵Leonard Richardson
now that the minimum supported Python version is 3.6.
2023-01-27Warnings now do their best to provide an appropriate stacklevel,Leonard Richardson
improving the usefulness of the message. [bug=1978744]
2023-01-25Tag.interesting_string_types is now propagated when a tag isLeonard Richardson
copied. [bug=1990400]
2023-01-25Added missing Fish-Footman image from documentation.Leonard Richardson
2023-01-25Made the ISO-8859 test robust in a less hacky way.Leonard Richardson
2023-01-25Made the ISO-8859-1 smoke test more robust.Leonard Richardson
2023-01-25Removed very copy of the code that was imported as part of the bzr import ↵Leonard Richardson
but not removed.
2023-01-25Removed very copy of the code that was imported as part of the bzr import ↵Leonard Richardson
but not removed.
2023-01-25Added sphinx Makefile, which was not originally under version control.Leonard Richardson
2023-01-25Removed very old documentation that was imported as part of the bzr import ↵Leonard Richardson
but not removed.
2023-01-25The HTMLFormatter and XMLFormatter constructors no longer return aLeonard Richardson
value. [bug=1992693]
2023-01-25Passing a Tag's .contents into PageElement.extend() now works theLeonard Richardson
same way as passing the Tag itself.
2023-01-25Removed very old tests that were imported as part of the bzr import but not ↵Leonard Richardson
removed.
2023-01-23Corrected a typo in several translations of the documentation.Leonard Richardson
2023-01-23Corrected typo in Russian translation of the documentation.Leonard Richardson
2022-05-15Fixed a test failure when cchardet is not installed butLeonard Richardson
charset_normalizer is. [bug=1973072]
2022-04-10Fixed another crash when overriding multi_valued_attributes and using theLeonard Richardson
html5lib parser. [bug=1948488]
2022-04-10Install more dependencies before running the pre-release tests.Leonard Richardson
2022-04-08Some cleanup work to get more consistent and complete about what gets ↵Leonard Richardson
packaged with the Beautiful Soup release.
2022-04-07Incremented version number in documentation.Leonard Richardson
2022-04-07Redid the increasingly irrelevant test-all-versions script to use pytest.Leonard Richardson
2022-04-07Omit untrusted input when issuing warnings.Leonard Richardson
2021-12-22Corrected error in documentation (patch by Frank Dana).Leonard Richardson
2021-12-22Correct documentation on parser differencesFeRD (Frank Dana)
2021-12-21Added a bit about not modifying the .contents list directly.Leonard Richardson
2021-12-21Corrected typo.Leonard Richardson
2021-12-21Standardized the wording of the MarkupResemblesLocatorWarningLeonard Richardson
warnings to to make them less judgemental about what you ought to be doing. [bug=1955450]
2021-12-21Fixed typo in documentation spotted by a reader.Leonard Richardson
2021-12-21I guess that's not a method.Leonard Richardson
2021-12-21It's now possible to customize the way output is indented byLeonard Richardson
providing a value for the 'indent' argument to the Formatter constructor. The 'indent' argument works very similarly to the argument of the same name in the Python standard library's json.dump() method. [bug=1955497]
2021-12-19Remove a huge list of HTML entities that was only necessary under Python 2.Leonard Richardson
2021-12-19Removed support for the iconv_codec library, which doesn't seemLeonard Richardson
to exist anymore and was never put up on PyPI. (The closest replacement on PyPI, iconv_codecs, is GPL-licensed, so we can't use it.)
2021-12-19If the charset-normalizer Python moduleLeonard Richardson
(https://pypi.org/project/charset-normalizer/) is installed, Beautiful Soup will use it to detect the character sets of incoming documents. This is also the module used by newer versions of the Requests library. For the sake of backwards compatibility, chardet and cchardet both take precedence if installed. [bug=1955346]
2021-12-17Fix a crash when pickling a BeautifulSoup object that has noLeonard Richardson
tree builder. [bug=1934003]
2021-11-29Do a better job of keeping track of namespaces as an XML document isLeonard Richardson
parsed, so that CSS selectors that use namespaces will do the right thing more often. [bug=1946243]
2021-10-24Added test of warn_if_markup_looks_like_xml.Leonard Richardson
2021-10-24Issue a warning when an HTML parser is used to parse a document thatLeonard Richardson
looks like XML but not XHTML. [bug=1939121]
2021-10-24Used a warning to formally deprecate the 'text' argument in favor of 'string'.Leonard Richardson
2021-10-23Changing find* tests to use string instead of text, except for one test that ↵Leonard Richardson
specifically checks that text is an alias for string.
2021-10-23Renamed the 'text' field to 'string' for real. Tests are not changed in this ↵Leonard Richardson
commit to demonstrate that the renaming doesn't break anything. [bug=1947038]
2021-10-23Added a workaround for an lxml bug ↵Leonard Richardson
(https://bugs.launchpad.net/lxml/+bug/1948551) that caused problems when parsing a Unicode string beginning with BYTE ORDER MARK. [bug=1947768]
2021-10-23Fixed a crash when overriding multi_valued_attributes and using theLeonard Richardson
html5lib parser. [bug=1948488]
2021-10-23Fix a Python 3-specific problem in diagnose.lxml_trace.Leonard Richardson
2021-10-11Removed redundant and nonworking argument from example code. [bug=1946243]Leonard Richardson
2021-10-11Added special string classes, RubyParenthesisString and RubyTextString,Leonard Richardson
to make it possible to treat ruby text specially in get_text() calls. [bug=1941980]
2021-10-11More test refactoring.Leonard Richardson