summaryrefslogtreecommitdiff
path: root/bs4/__init__.py
AgeCommit message (Collapse)Author
2016-07-18Corrected an encoding error that happened when a BeautifulSoupLeonard Richardson
object was copied. [bug=1554439]
2016-07-17 When a BeautifulSoup object is pickled but its tree builder cannotLeonard Richardson
be pickled, its .builder attribute is set to None instead of being destroyed. This avoids a performance problem once the object is unpickled. [bug=1523629]
2016-07-17Although the previously fixed problem only occurs when using the html5lib ↵Leonard Richardson
tree builder, it's not actually a problem with the tree builder itself.
2016-07-17Fixed a bug in the html5lib treebuilder that deranged the treeLeonard Richardson
when a whitespace element was reparented into a tag that contained an identical whitespace element. [bug=1505351]
2016-07-17Use known_xml instead of continually adding underscores to is_xml.Leonard Richardson
2016-07-16Beautiful Soup will now work with versions of html5lib greater thanLeonard Richardson
0.99999999. [bug=1603299]
2016-07-16We don't run the check for a filename passed in as markup if theLeonard Richardson
'filename' contains a less-than character; the less-than character indicates it's most likely a very small document. [bug=1577864]
2016-07-16corrected typo. [bug=1561510]Leonard Richardson
2016-07-16Specify the file and line number when warning about aLeonard Richardson
BeautifulSoup object being instantiated without a parser being specified. [bug=1574647]
2016-07-16Fixed a Python 3 ByteWarning when a URL was passed in as though itLeonard Richardson
were markup. Thanks to James Salter for a patch and test. [bug=1533762]
2016-07-16Rename COPYING.txt to LICENSE. Add a reference to LICENSE in every source file.Leonard Richardson
2015-07-03Change setup.py to focus on creating wheels.Leonard Richardson
2015-06-28 It's now possible to pickle a BeautifulSoup object no matter whichLeonard Richardson
tree builder was used to create it. However, the only tree builder that survives the pickling process is the HTMLParserTreeBuilder ('html.parser'). If you unpickle a BeautifulSoup object created with some other tree builder, soup.builder will be None. [bug=1231545]
2015-06-28Changed the way soup objects work under copy.copy(). Copying aLeonard Richardson
NavigableString or a Tag will give you a new NavigableString that's equal to the old one but not connected to the parse tree. Patch by Martijn Peters. [bug=1307490]
2015-06-28Reorganized changelog.Leonard Richardson
2015-06-28Accept 'xml' as an unambiguous identifier for the lxml XML parser, since ↵Leonard Richardson
it's the only XML parser supported at the moment.
2015-06-27Added an exclude_encodings argument to UnicodeDammit and to theLeonard Richardson
Beautiful Soup constructor, which lets you prohibit the detection of an encoding that you know is wrong. [bug=1469408]
2015-06-25You can now create a NavigableString or a subclass just by invokingLeonard Richardson
the constructor. [bug=1294315]
2015-06-23Made the previous fix nicer by adding arguments to setup() that let us ↵Leonard Richardson
preserve a tag's existing place in the tree.
2015-06-23Got a hacky fix for the latest html5lib problem.Leonard Richardson
2015-06-23Force object_was_parsed() to keep the tree intact even when an elementLeonard Richardson
from later in the document is moved into place. [bug=1430633]
2014-12-10The warning when you pass in a filename or URL as markup will now beLeonard Richardson
displayed correctly even if the filename or URL is a Unicode string. [bug=1268888]
2014-12-07Tweaked the parser warning.Leonard Richardson
2014-12-07Issue a warning if the BeautifulSoup constructor arguments do not explicitly ↵Leonard Richardson
name a parser.
2013-10-02Restored the helpful syntax error that happens when you try toLeonard Richardson
import the Python 2 edition of Beautiful Soup under Python 3. [bug=1213387]
2013-10-02Prep for release.Leonard Richardson
2013-10-01 Fixed a crash when a short input contains data not valid inLeonard Richardson
filenames. [bug=1232604]
2013-10-01Fixed a bug in which short Unicode input was improperly encoded to ASCII ↵Leonard Richardson
when checking whether or not it was a file on disk. [bug=1227016]
2013-08-15Bumped version number.Leonard Richardson
2013-08-13* Fixed yet another problem with the html5lib tree builder, caused byLeonard Richardson
html5lib's tendency to rearrange the tree during parsing. [bug=1189267]
2013-08-12A little cleanup.Leonard Richardson
2013-06-03Changed _popToTag to run through a single range instead of two.Leonard Richardson
2013-06-03Improved _popToTag a tiny bit.Leonard Richardson
2013-06-03Inlined some commonly called code to save a function call.Leonard Richardson
2013-06-03Made it a lot faster to check whether whitespace is being preserved.Leonard Richardson
2013-06-03Put the more frequently-used ASCII spaces in front.Leonard Richardson
2013-06-03Wrote a more efficient replacement for string.translate() when checking ↵Leonard Richardson
whether a string is nothing but ASCII spaces.
2013-06-03Let's get some profiling going.Leonard Richardson
2013-06-03Beautiful Soup will issue a warning if instead of markup you pass itLeonard Richardson
a URL or the name of a file on disk (a common beginner mistake).
2013-06-02Merged in big encoding-detection refactoring branch.Leonard Richardson
2013-05-31Reverted the patch that gives NavigableString a .name property, because ↵Leonard Richardson
that's too big an API change for a bugfix release.
2013-05-31Create a new lxml parser object for every new parsing strategy.Leonard Richardson
2013-05-20A NavigableString object now has an immutable '.name' property whoseLeonard Richardson
value is always None. This makes it easier to iterate over a mixed list of tags and strings without having to check whether each element is a tag or a string.
2013-05-20The .next_element attribute used during parsing was confusingly similar to ↵Leonard Richardson
the .next_element navigation attribute. Renamed the former to _most_recent_element.
2013-05-20Gave new_string() the ability to create subclasses ofLeonard Richardson
NavigableString. [bug=1181986]
2013-05-07Fixed up diagnose() and added it to the docs.Leonard Richardson
2013-05-07Aliased the BeautifulSoup class to the easier-to-type "_s" and "_soup".Leonard Richardson
2012-08-21Fixed a problem with the html5lib builder not handling comments correctly.Leonard Richardson
2012-08-20Prep for release.Leonard Richardson
2012-08-20Raise a more specific error (FeatureNotFound) when a requestedLeonard Richardson
parser or parser feature is not installed. Raise NotImplementedError instead of ValueError when the user calls insert_before() or insert_after() on the BeautifulSoup object itself. Patch by Aaron Devore. [bug=1038301]