summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2020-06-11Converted the sample code in README.md to Python 3.Leonard Richardson
2020-05-31Make the doc a little less defensive.Leonard Richardson
2020-05-31Added to the troubleshooting section a bit to catch searches for the ↵Leonard Richardson
AttributeError that happens if you treat a string like a tag.
2020-05-30Fixed a bug that caused too many tags to be popped from the tagLeonard Richardson
stack during tree building, when encountering a closing tag that had no matching opening tag. [bug=1880420]
2020-05-30Remove explicit reference to the module name within the module, replacing it ↵Leonard Richardson
with __name__.
2020-05-17Prep for release.Leonard Richardson
2020-05-17Switch entirely to Python 3-style print statements, even in Python 2.Leonard Richardson
2020-05-17Documented some recently added customization features.Leonard Richardson
2020-05-17Added docstring for BeautifulSoup.new_tag.Leonard Richardson
2020-05-17Added a keyword argument on_duplicate_attribute to theLeonard Richardson
BeautifulSoupHTMLParser constructor (used by the html.parser tree builder) which lets you customize the handling of markup that contains the same attribute more than once, as in: <a href="url1" href="url2"> [bug=1878209]
2020-04-25Try to clarify the docs further that get_text now returns human-readable text.Leonard Richardson
2020-04-24If you encode a document with a Python-specific encoding likeLeonard Richardson
'unicode_escape', that encoding is no longer mentioned in the final XML or HTML document. Instead, encoding information is omitted or left blank. [bug=1874955]
2020-04-21Fixed typo.Leonard Richardson
2020-04-21Added two distinct UserWarning subclasses for warnings issued from the ↵Leonard Richardson
BeautifulSoup constructor which a caller may want to filter out. [bug=1873787]
2020-04-12Fixed test failures when run against soupselect 2.0. Patch by TomášLeonard Richardson
Chvátal. [bug=1872279]
2020-04-07Add Script, Stylesheet, and TemplateString to the 'bs4' namespace.Leonard Richardson
2020-04-07Added a notice about the new behavior of .text to the documentation.Leonard Richardson
2020-04-05Set up a different soupsieve dependency for Python 2.Leonard Richardson
2020-04-05Embedded CSS and Javascript is now stored in distinct Stylesheet andLeonard Richardson
Script tags, which are ignored by methods like get_text(). This feature is not supported by the html5lib treebuilder. [bug=1868861]
2020-04-04Use an :rtype: reported to work in pycharm.Leonard Richardson
2020-04-04select() always returns a Tag, so be more specific about its return type.Leonard Richardson
2020-04-04Added a Russian translation by 'authoress' to the repository.Leonard Richardson
2020-04-04Corrected error in Chinese translation, found by "One J".Leonard Richardson
2020-03-10Fixed a bug that happened when passing a Unicode filename containingLeonard Richardson
non-ASCII characters as markup into Beautiful Soup, on a system that allows Unicode filenames. [bug=1866717]
2020-03-09Make find() methods return a union type of the two most common PageElements, ↵Leonard Richardson
rather than PageElement itself.
2020-03-06Added a paragraph about the fact that prettify() adds whitespace to a document.Leonard Richardson
2020-03-05Added a performance optimization to PageElement.extract(). Patch by Arthur ↵Leonard Richardson
Darcet.
2020-01-22Merging in request 377978Leonard Richardson
2020-01-23Fix a confusing typo in the description of formatter="html5".Colin Watson
2020-01-01API CHANGE - Added PageElement.decomposed, a new property which lets youLeonard Richardson
check whether you've already called decompose() on a Tag or NavigableString.
2019-12-29Fixed an unhandled exception when formatting a Tag that had been ↵Leonard Richardson
decomposed.[bug=1857767]
2019-12-24Clarify the expected output to hopefully make it clear that part of a long ↵Leonard Richardson
paragraph is being omitted.
2019-12-24How about getting the year right.Leonard Richardson
2019-12-24Corrected sunsetting date.Leonard Richardson
2019-12-24Bumped version number.Leonard Richardson
2019-12-24Minor changes to docstrings.Leonard Richardson
2019-12-24Added :rtype: to the find method docstrings.Leonard Richardson
2019-12-24Added docstrings for some but not all tree buidlers.Leonard Richardson
2019-12-24Added docstrings to diagnose.py.Leonard Richardson
2019-12-24Wrote docstrings for formatter.py.Leonard Richardson
2019-12-24Fixed deprecation warning. [bug=1855301]Leonard Richardson
2019-12-24Added docstrings to all public methods in dammit.py.Leonard Richardson
2019-12-20Added docstrings to all methods in __init__.pyLeonard Richardson
2019-12-18Added Python docstrings to all public methods in element.py.Leonard Richardson
2019-11-11Simplified code.Leonard Richardson
2019-11-11The html.parser tree builder now correctly handles DOCTYPEs that areLeonard Richardson
not uppercase. [bug=1848401]
2019-11-11Added a version sentence to the English documentation.Leonard Richardson
2019-11-11Improved the presentation of the Korean translation.Leonard Richardson
2019-11-11Got the Brazilian Portuguese doc to build without warnings.Leonard Richardson
2019-11-11Fixed a deprecation warning on Python 3.7. Patch by ColinLeonard Richardson
Watson. [bug=1847592]