Age | Commit message (Collapse) | Author | |
---|---|---|---|
2020-04-21 | Added two distinct UserWarning subclasses for warnings issued from the ↵ | Leonard Richardson | |
BeautifulSoup constructor which a caller may want to filter out. [bug=1873787] | |||
2020-04-12 | Fixed test failures when run against soupselect 2.0. Patch by Tomáš | Leonard Richardson | |
Chvátal. [bug=1872279] | |||
2020-04-07 | Add Script, Stylesheet, and TemplateString to the 'bs4' namespace. | Leonard Richardson | |
2020-04-05 | Embedded CSS and Javascript is now stored in distinct Stylesheet and | Leonard Richardson | |
Script tags, which are ignored by methods like get_text(). This feature is not supported by the html5lib treebuilder. [bug=1868861] | |||
2020-04-04 | Use an :rtype: reported to work in pycharm. | Leonard Richardson | |
2020-04-04 | select() always returns a Tag, so be more specific about its return type. | Leonard Richardson | |
2020-03-10 | Fixed a bug that happened when passing a Unicode filename containing | Leonard Richardson | |
non-ASCII characters as markup into Beautiful Soup, on a system that allows Unicode filenames. [bug=1866717] | |||
2020-03-09 | Make find() methods return a union type of the two most common PageElements, ↵ | Leonard Richardson | |
rather than PageElement itself. | |||
2020-03-05 | Added a performance optimization to PageElement.extract(). Patch by Arthur ↵ | Leonard Richardson | |
Darcet. | |||
2020-01-01 | API CHANGE - Added PageElement.decomposed, a new property which lets you | Leonard Richardson | |
check whether you've already called decompose() on a Tag or NavigableString. | |||
2019-12-29 | Fixed an unhandled exception when formatting a Tag that had been ↵ | Leonard Richardson | |
decomposed.[bug=1857767] | |||
2019-12-24 | Bumped version number. | Leonard Richardson | |
2019-12-24 | Minor changes to docstrings. | Leonard Richardson | |
2019-12-24 | Added :rtype: to the find method docstrings. | Leonard Richardson | |
2019-12-24 | Added docstrings for some but not all tree buidlers. | Leonard Richardson | |
2019-12-24 | Added docstrings to diagnose.py. | Leonard Richardson | |
2019-12-24 | Wrote docstrings for formatter.py. | Leonard Richardson | |
2019-12-24 | Fixed deprecation warning. [bug=1855301] | Leonard Richardson | |
2019-12-24 | Added docstrings to all public methods in dammit.py. | Leonard Richardson | |
2019-12-20 | Added docstrings to all methods in __init__.py | Leonard Richardson | |
2019-12-18 | Added Python docstrings to all public methods in element.py. | Leonard Richardson | |
2019-11-11 | Simplified code. | Leonard Richardson | |
2019-11-11 | The html.parser tree builder now correctly handles DOCTYPEs that are | Leonard Richardson | |
not uppercase. [bug=1848401] | |||
2019-11-11 | Fixed a deprecation warning on Python 3.7. Patch by Colin | Leonard Richardson | |
Watson. [bug=1847592] | |||
2019-11-11 | Added a Brazilian Portuguese translation by Cezar Peixeiro. | Leonard Richardson | |
2019-11-10 | Fix deprecation warning with Python >= 3.7. | Colin Watson | |
Python >= 3.7 issues a deprecation warning when using collections.Callable rather than collections.abc.Callable. Most of Beautiful Soup deals with this by using a conditional import, but the automatic Python 3 conversion apparently translates `callable(obj)` to `isinstance(obj, collections.Callable)` which trips this deprecation warning. `isinstance(obj, Callable)` works fine in Python 2 as well as 3, so just use it directly. | |||
2019-10-06 | Added section on Python 2 sunsetting. | Leonard Richardson | |
2019-10-05 | Avoid a crash when unpickling certain parse trees generated using html5lib ↵ | Leonard Richardson | |
on Python 3. [bug=1843545] | |||
2019-09-02 | Avoid a crash when trying to detect the declared encoding of a | Leonard Richardson | |
Unicode document. Raise an explanatory exception when the underlying parser completely rejects the incoming markup. [bug=1838877] | |||
2019-08-26 | It's now possible to override any of the element classes. | Leonard Richardson | |
2019-08-26 | Fixed the definition of the default XML namespace when using | Leonard Richardson | |
lxml 4.4. Patch by Isaac Muse. [bug=1840141] | |||
2019-08-22 | Test the ability to build a tree using objects other than Tag and ↵ | Leonard Richardson | |
NavigableString. | |||
2019-08-21 | When instantiating a BeautifulSoup object, it's now possible to | Leonard Richardson | |
provide replacement classes to be instantiated for every tag ('tag_class') or string ('string_class') encountered during parsing, rather than using the default Tag and NavigableString objects. | |||
2019-08-21 | Copying a Tag preserves information that was originally obtained from | Leonard Richardson | |
the TreeBuilder used to build the original Tag. [bug=1838903] | |||
2019-08-21 | Explicitly set preserve_whitespace_tags to None if there is no TreeBuilder. | Leonard Richardson | |
2019-08-21 | Fixed a crash when pretty-printing tags that were not created | Leonard Richardson | |
during initial parsing. [bug=1838903] | |||
2019-07-22 | Added a section about project support to the README. | Leonard Richardson | |
2019-07-21 | Implemented line number tracking for html5lib. | Leonard Richardson | |
2019-07-21 | Adapt Chris Mayo's code to track line number and position when using ↵ | Leonard Richardson | |
html.parser. | |||
2019-07-16 | Prep for release. | Leonard Richardson | |
2019-07-16 | Suppressed warnings during tests that aren't about the warnings. | Leonard Richardson | |
2019-07-15 | Implemented Tag.smooth. | Leonard Richardson | |
2019-07-15 | Moved the formatter to its own class and updated its documentation. | Leonard Richardson | |
2019-07-15 | Improved comments in tests. | Leonard Richardson | |
2019-07-14 | Give the Formatter class more control over formatting decisions. | Leonard Richardson | |
2019-07-07 | A Formatter can now decide how (or whether) to order the attributes | Leonard Richardson | |
inside a tag. [bug=1812422] | |||
2019-07-07 | ' (which is valid in XML and XHTML, but not HTML 4) is now | Leonard Richardson | |
recognized as a named entity and converted to a single quote. [bug=1818721] | |||
2019-07-07 | Renamed the cdata_list_attributes argument to multi_valued_attributes since ↵ | Leonard Richardson | |
it's facing the end-user and that's a more easily understandable name. | |||
2019-07-07 | It's now possible to override a TreeBuilder's cdata_list_attributes ↵ | Leonard Richardson | |
dictionary by passing in a replacement. None will disable the feature altogether. [bug=1832978] | |||
2019-07-07 | It's now possible to customize the TreeBuilder object by passing | Leonard Richardson | |
keyword arguments into the BeautifulSoup constructor. The main reason to do this right now is to change how multi-valued attributes are treated. [bug=1832978] |