summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2018-07-28When markup contains duplicate elements, a select() call thatLeonard Richardson
includes multiple match clauses will match all relevant elements. [bug=1770596]
2018-07-28Correctly handle invalid HTML numeric character entities like “Leonard Richardson
which reference code points that are not Unicode code points. Note that this is only fixed when Beautiful Soup is used with the html.parser parser -- html5lib already worked and I couldn't fix it with lxml. [bug=1782933]
2018-07-21Clarified the deprecation warning when accessing tag.fooTag, to coverLeonard Richardson
the possibility that you might really have been looking for a tag called 'fooTag'.
2018-07-21Fixed a problem where the html.parser tree builder interpretedLeonard Richardson
a string like '&foo ' as the character entity '&foo;' [bug=1728706]
2018-07-21Include LICENSE in the manifest. [bug=1736563]Leonard Richardson
2018-07-19Clarified phrasing.Leonard Richardson
2018-07-18Fixed a bug where find_all() was not working when asked to find aLeonard Richardson
tag with a namespaced name in an XML document that was parsed as HTML. [bug=1723783]
2018-07-18Preserve XML namespaces when they are introduced inside an XMLLeonard Richardson
document, not just the ones introduced at the top level. [bug=1718787]
2018-07-15You can pass a dictionary of intoLeonard Richardson
BeautifulSoup.new_tag. This makes it possible to create a tag with an attribute like 'name' that would otherwise be masked by another argument of new_tag. [bug=1779276]
2018-07-15Corrected some typos in the documentation.Leonard Richardson
2018-07-15Introduced the Formatter system. [bug=1716272].Leonard Richardson
2018-07-15It's possible for a TreeBuilder subclass to specify that voidLeonard Richardson
elements should be represented as <element> rather than <element/>, by setting TreeBuilder.void_element_close_prefix to the empty string. [bug=1716272]
2018-07-15Improved the 'no parser specified' warning so it doesn't show up in a REPL.Leonard Richardson
2018-07-15Stop data loss when encountering an empty numeric entity, andLeonard Richardson
possibly in other cases. Thanks to tos.kamiya for the fix. [bug=1698503]
2018-07-14Fixed a disconnected parse tree when one BeautifulSoup object wasLeonard Richardson
inserted into another. [bug=1105148]
2018-07-14Fix an error in the warning when run from REPL.Leonard Richardson
2018-07-14Bring in some more code from warnings.py.Leonard Richardson
2018-07-14Improve the technique for finding the line number with the problematic ↵Leonard Richardson
method call.
2018-07-14Stopped HTMLParser from raising an exception in very rare cases ofLeonard Richardson
bad markup. [bug=1708831]
2018-07-14Fixed a Windows crash in diagnose() when checking whether a longLeonard Richardson
markup string is a filename. [bug=1737121]
2018-07-14Fixed code that was causing deprecation warnings in recent Python 3Leonard Richardson
versions. Includes a patch from Ville Skyttä. [bug=1778909] [bug=1689496]
2018-07-14Improve the warning given when no parser is specified. [bug=1780571]Leonard Richardson
2017-10-01Fix two typos in docstt.
2017-05-07Prep for 4.6.0 release.Leonard Richardson
2017-05-07Namespace prefix is preserved when an XML tag is copied. ThanksLeonard Richardson
to Vikas for a patch and test. [bug=1685172]
2017-05-07Corrected formatting of warning.Leonard Richardson
2017-05-06Replace get_attribute_text with get_attribute_list.Leonard Richardson
2017-05-06 Improved the handling of empty-element tags like <br> when using theLeonard Richardson
html.parser parser. [bug=1676935]
2017-05-06Renamed convenience method to get_attribute_text.Leonard Richardson
2017-05-06Added the method, which acts like forLeonard Richardson
getting the value of an attribute, but which joins attribute multi-values into a single string value. [bug=1678589]
2017-05-06HTML parsers treat all HTML4 and HTML5 empty element tags (aka void element ↵Leonard Richardson
tags) correctly. [bug=1656909]
2017-05-06It's now possible to use a tag's namespace prefix when searching,Leonard Richardson
e.g. soup.find('namespace:tag') [bug=1655332]
2017-05-06Implement ResultSet.__getattr__ to give a helpful message in a common error ↵Leonard Richardson
scenario.
2017-05-06Change no-parser-specified warning to avoid the implication that you should ↵Leonard Richardson
put your markup into square brackets.
2017-01-02I need to do another release because of an error while running the release ↵Leonard Richardson
script.
2017-01-02Prep for 4.5.2 release.Leonard Richardson
2016-12-19Indentation change contributed by Pranav Salunke.Leonard Richardson
2016-12-19Fixed foster parenting when html5lib is the tree builder. Thanks to Geoffrey ↵Leonard Richardson
Sneddon for a patch and test.
2016-12-19Remove breakpoint.Leonard Richardson
2016-12-19Fixed yet another problem that caused the html5lib tree builder toLeonard Richardson
2016-12-19Documentation fixes. [bug=1651050]Leonard Richardson
2016-12-11Show how to use the attrs argument to search by the 'name' attribute. ↵Leonard Richardson
[bug=1639580]
2016-12-10Corrected documentation left over from when class was treated as a ↵Leonard Richardson
single-valued attribute. [bug=1631743]
2016-08-02Bump version number.Leonard Richardson
2016-07-30Explained why we test both unicode and bytestring processing instructions.Leonard Richardson
2016-07-26Fixed a reported (but not duplicated) bug involving processing instructions ↵Leonard Richardson
fed into the lxml HTML parser.
2016-07-26Clarify installation differences between Python 2 and Python 3. Contributed ↵Leonard Richardson
by James Lu.
2016-07-26Clarify that Beautiful Soup is no longer compatible with versions of Python ↵Leonard Richardson
pre-2.7. Contributed by Ville Skyttä.
2016-07-26Change the way open() is used. Code contributed by Ville Skyttä.Leonard Richardson
2016-07-26Renamed deprecated method. Contributed by Ville Skyttä.Leonard Richardson