summaryrefslogtreecommitdiff
path: root/bs4
AgeCommit message (Collapse)Author
2017-05-07Corrected formatting of warning.Leonard Richardson
2017-05-06Replace get_attribute_text with get_attribute_list.Leonard Richardson
2017-05-06 Improved the handling of empty-element tags like <br> when using theLeonard Richardson
html.parser parser. [bug=1676935]
2017-05-06Renamed convenience method to get_attribute_text.Leonard Richardson
2017-05-06Added the method, which acts like forLeonard Richardson
getting the value of an attribute, but which joins attribute multi-values into a single string value. [bug=1678589]
2017-05-06HTML parsers treat all HTML4 and HTML5 empty element tags (aka void element ↵Leonard Richardson
tags) correctly. [bug=1656909]
2017-05-06It's now possible to use a tag's namespace prefix when searching,Leonard Richardson
e.g. soup.find('namespace:tag') [bug=1655332]
2017-05-06Implement ResultSet.__getattr__ to give a helpful message in a common error ↵Leonard Richardson
scenario.
2017-05-06Change no-parser-specified warning to avoid the implication that you should ↵Leonard Richardson
put your markup into square brackets.
2017-01-02I need to do another release because of an error while running the release ↵Leonard Richardson
script.
2017-01-02Prep for 4.5.2 release.Leonard Richardson
2016-12-19Indentation change contributed by Pranav Salunke.Leonard Richardson
2016-12-19Fixed foster parenting when html5lib is the tree builder. Thanks to Geoffrey ↵Leonard Richardson
Sneddon for a patch and test.
2016-12-19Remove breakpoint.Leonard Richardson
2016-12-19Fixed yet another problem that caused the html5lib tree builder toLeonard Richardson
2016-08-02Bump version number.Leonard Richardson
2016-07-30Explained why we test both unicode and bytestring processing instructions.Leonard Richardson
2016-07-26Fixed a reported (but not duplicated) bug involving processing instructions ↵Leonard Richardson
fed into the lxml HTML parser.
2016-07-26Clarify that Beautiful Soup is no longer compatible with versions of Python ↵Leonard Richardson
pre-2.7. Contributed by Ville Skyttä.
2016-07-26Change the way open() is used. Code contributed by Ville Skyttä.Leonard Richardson
2016-07-26Renamed deprecated method. Contributed by Ville Skyttä.Leonard Richardson
2016-07-26Spelling fixesVille Skyttä
2016-07-26Use assertEqual instead of deprecated assertEqualsVille Skyttä
2016-07-26Open/close files with "with"Ville Skyttä
2016-07-26Clarify Python 2(.7) support statusVille Skyttä
2016-07-19Fixed test that fails in Python 3.5.Leonard Richardson
2016-07-18Pass in bytes so that the BeautifulSoup object always has an original_encoding.Leonard Richardson
2016-07-18If a search against each individual value of a multi-valuedLeonard Richardson
attribute fails, the search will be run one final time against the complete attribute value considered as a single string. [bug=1476868]
2016-07-18Corrected an encoding error that happened when a BeautifulSoupLeonard Richardson
object was copied. [bug=1554439]
2016-07-18Added support for CSS selector values that contain quoted spaces,Leonard Richardson
such as tag[style="display: foo"]. [bug=1540588]
2016-07-18The argument to now works correctly, though it'sLeonard Richardson
not implemented very efficiently. [bug=1520530]
2016-07-17Use a dedicated logger instead of the root logger. [bug=1511661]Leonard Richardson
2016-07-17Use a dedicated logger instead of the root logger. [bug=1511661]Leonard Richardson
2016-07-17 When a BeautifulSoup object is pickled but its tree builder cannotLeonard Richardson
be pickled, its .builder attribute is set to None instead of being destroyed. This avoids a performance problem once the object is unpickled. [bug=1523629]
2016-07-17Although the previously fixed problem only occurs when using the html5lib ↵Leonard Richardson
tree builder, it's not actually a problem with the tree builder itself.
2016-07-17Fixed a bug in the html5lib treebuilder that deranged the treeLeonard Richardson
when a whitespace element was reparented into a tag that contained an identical whitespace element. [bug=1505351]
2016-07-17Use known_xml instead of continually adding underscores to is_xml.Leonard Richardson
2016-07-17Whenever possible, keep track ahead of time whether a PageElement is HTML or ↵Leonard Richardson
XML.
2016-07-16Beautiful Soup will now work with versions of html5lib greater thanLeonard Richardson
0.99999999. [bug=1603299]
2016-07-16We don't run the check for a filename passed in as markup if theLeonard Richardson
'filename' contains a less-than character; the less-than character indicates it's most likely a very small document. [bug=1577864]
2016-07-16Removed imports to pdb, since pdb is not available in some environments. ↵Leonard Richardson
[bug=1491700]
2016-07-16corrected typo. [bug=1561510]Leonard Richardson
2016-07-16Specify the file and line number when warning about aLeonard Richardson
BeautifulSoup object being instantiated without a parser being specified. [bug=1574647]
2016-07-16The contents of <textarea> tags will no longer be modified when theLeonard Richardson
tree is prettified. [bug=1555829]
2016-07-16Fixed a Python 3 ByteWarning when a URL was passed in as though itLeonard Richardson
were markup. Thanks to James Salter for a patch and test. [bug=1533762]
2016-07-16Added a separate class for XML processing instructions, which have a ↵Leonard Richardson
slightly different format from SGML processing instructions. [bug=1504383]
2016-07-16Rename COPYING.txt to LICENSE. Add a reference to LICENSE in every source file.Leonard Richardson
2016-04-06Minor change. Extra indent for character so it looks nicer.Pranav Salunke
2015-12-08Fix foster parenting with html5lib.Geoffrey Sneddon
This makes all of the html5lib tests pass. Yay!
2015-12-08Make TreeBuilderForHtml5lib strictly follow the html5lib API.Geoffrey Sneddon
This slightly changes the constructor (to make soup optional), and adds a testSerializer method so the tests can be run against it.