Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-04-07 | Fixed an unhandled exception in BeautifulSoup.decode_contents | Leonard Richardson | |
and methods that call it. [bug=2015545] | |||
2023-04-05 | Prepare for 4.12.1 release. | Leonard Richardson | |
2023-04-05 | The demonstrate_parser_differences.py script was still written in | Leonard Richardson | |
Python 2. I've converted it to Python 3, but since no one noticed this problem, it's a sign that no one uses this script and it's not serving its purpose. I may rework or remove it in a later version. | |||
2023-04-05 | Remove setup.py, completing the migration to hatch. Add a .gitignore file ↵ | Leonard Richardson | |
based on the soupsieve .gitignore file. | |||
2023-04-05 | Move the Python 2 notice to the last version to support Python 2, and add a ↵ | Leonard Richardson | |
similar notice for 3.6 | |||
2023-03-30 | Added to README. | Leonard Richardson | |
2023-03-28 | Reworded changelog. | Leonard Richardson | |
2023-03-27 | Make it possible to pickle a deeply nested BeautifulSoup object. | Leonard Richardson | |
2023-03-26 | Implement a proper BeautifulSoup.deepcopy rather than parsing the document ↵ | Leonard Richardson | |
again. | |||
2023-03-24 | Added a test just to verify that you can encode a document more deeply ↵ | Leonard Richardson | |
nested than the Python recursion limit. | |||
2023-03-20 | Added unit tests for fuzz test cases created by third | Leonard Richardson | |
parties. Most of these tests are skipped since they either point out problems in code outside of Beautiful Soup, or problems with Beautiful Soup that haven't been resolved yet, but this puts them all in one convenient place. | |||
2023-03-20 | Prepare for 4.12.0 release. | Leonard Richardson | |
2023-02-15 | Removed some error checking code from diagnose(), which is redundant with | Leonard Richardson | |
similar (but more Pythonic) code in the BeautifulSoup constructor. [bug=2007344] | |||
2023-02-15 | When the html.parser parser decides it can't parse a document, Beautiful | Leonard Richardson | |
Soup now consistently propagates this fact by raising a ParserRejectedMarkup error. [bug=2007343] | |||
2023-02-12 | Improved documentation of new method. | Leonard Richardson | |
2023-02-08 | Fixed syntax errors in documentation. | Leonard Richardson | |
2023-02-07 | Removed Soup Sieve fallback method, added documentation. | Leonard Richardson | |
2023-01-31 | Fixed missing import that caused test failures when Soup Sieve is installed. | Leonard Richardson | |
2023-01-31 | Consistently use pytest.mark.skipif to skip tests when the corresponding ↵ | Leonard Richardson | |
libraries are not installed. | |||
2023-01-28 | Incremented version number. | Leonard Richardson | |
2023-01-27 | Implemented the more complicated case of providing an appropriate stacklevel ↵ | Leonard Richardson | |
for the warning issued when the deprecated 'text' argument is passed in. | |||
2023-01-27 | Warnings now do their best to provide an appropriate stacklevel, | Leonard Richardson | |
improving the usefulness of the message. [bug=1978744] | |||
2023-01-25 | Tag.interesting_string_types is now propagated when a tag is | Leonard Richardson | |
copied. [bug=1990400] | |||
2023-01-25 | The HTMLFormatter and XMLFormatter constructors no longer return a | Leonard Richardson | |
value. [bug=1992693] | |||
2023-01-25 | Passing a Tag's .contents into PageElement.extend() now works the | Leonard Richardson | |
same way as passing the Tag itself. | |||
2022-05-15 | Fixed a test failure when cchardet is not installed but | Leonard Richardson | |
charset_normalizer is. [bug=1973072] | |||
2022-04-10 | Fixed another crash when overriding multi_valued_attributes and using the | Leonard Richardson | |
html5lib parser. [bug=1948488] | |||
2022-04-08 | Some cleanup work to get more consistent and complete about what gets ↵ | Leonard Richardson | |
packaged with the Beautiful Soup release. | |||
2022-04-07 | Omit untrusted input when issuing warnings. | Leonard Richardson | |
2021-12-21 | Corrected typo. | Leonard Richardson | |
2021-12-21 | Standardized the wording of the MarkupResemblesLocatorWarning | Leonard Richardson | |
warnings to to make them less judgemental about what you ought to be doing. [bug=1955450] | |||
2021-12-21 | I guess that's not a method. | Leonard Richardson | |
2021-12-21 | It's now possible to customize the way output is indented by | Leonard Richardson | |
providing a value for the 'indent' argument to the Formatter constructor. The 'indent' argument works very similarly to the argument of the same name in the Python standard library's json.dump() method. [bug=1955497] | |||
2021-12-19 | Remove a huge list of HTML entities that was only necessary under Python 2. | Leonard Richardson | |
2021-12-19 | Removed support for the iconv_codec library, which doesn't seem | Leonard Richardson | |
to exist anymore and was never put up on PyPI. (The closest replacement on PyPI, iconv_codecs, is GPL-licensed, so we can't use it.) | |||
2021-12-19 | If the charset-normalizer Python module | Leonard Richardson | |
(https://pypi.org/project/charset-normalizer/) is installed, Beautiful Soup will use it to detect the character sets of incoming documents. This is also the module used by newer versions of the Requests library. For the sake of backwards compatibility, chardet and cchardet both take precedence if installed. [bug=1955346] | |||
2021-12-17 | Fix a crash when pickling a BeautifulSoup object that has no | Leonard Richardson | |
tree builder. [bug=1934003] | |||
2021-11-29 | Do a better job of keeping track of namespaces as an XML document is | Leonard Richardson | |
parsed, so that CSS selectors that use namespaces will do the right thing more often. [bug=1946243] | |||
2021-10-24 | Issue a warning when an HTML parser is used to parse a document that | Leonard Richardson | |
looks like XML but not XHTML. [bug=1939121] | |||
2021-10-24 | Used a warning to formally deprecate the 'text' argument in favor of 'string'. | Leonard Richardson | |
2021-10-23 | Renamed the 'text' field to 'string' for real. Tests are not changed in this ↵ | Leonard Richardson | |
commit to demonstrate that the renaming doesn't break anything. [bug=1947038] | |||
2021-10-23 | Added a workaround for an lxml bug ↵ | Leonard Richardson | |
(https://bugs.launchpad.net/lxml/+bug/1948551) that caused problems when parsing a Unicode string beginning with BYTE ORDER MARK. [bug=1947768] | |||
2021-10-23 | Fixed a crash when overriding multi_valued_attributes and using the | Leonard Richardson | |
html5lib parser. [bug=1948488] | |||
2021-10-11 | Added special string classes, RubyParenthesisString and RubyTextString, | Leonard Richardson | |
to make it possible to treat ruby text specially in get_text() calls. [bug=1941980] | |||
2021-09-12 | Ported unit tests to use pytest. | Leonard Richardson | |
2021-09-07 | Goodbye, Python 2. [bug=1942919] | Leonard Richardson | |
2021-06-01 | The 'replace_with()' method now takes a variable number of arguments, | Leonard Richardson | |
and can be used to replace a single element with a sequence of elements. Patch by Bill Chandos. | |||
2021-05-31 | The html.parser tree builder can now handles named entities | Leonard Richardson | |
found in the HTML5 spec in much the same way that the html5lib tree builder does. Note that the lxml tree builder still handles named entities differently. [bug=1924908] | |||
2021-02-14 | NavigableString and its subclasses now implement the get_text() | Leonard Richardson | |
method, as well as the properties .strings and .stripped_strings. These methods will either return the string itself, or nothing, so the only reason to use this is when iterating over a list of mixed Tag and NavigableString objects. [bug=1904309] | |||
2021-02-14 | The 'html5' formatter now treats attributes whose values are the | Leonard Richardson | |
empty string as HTML boolean attributes. Previously (and in other formatters), an attribute value must be set as None to be treated as a boolean attribute. In a future release, I plan to also give this behavior to the 'html' formatter. Patch by Isaac Muse. [bug=1915424] |