summaryrefslogtreecommitdiff
path: root/bs4/__init__.py
AgeCommit message (Collapse)Author
2016-07-16Specify the file and line number when warning about aLeonard Richardson
BeautifulSoup object being instantiated without a parser being specified. [bug=1574647]
2016-07-16Fixed a Python 3 ByteWarning when a URL was passed in as though itLeonard Richardson
were markup. Thanks to James Salter for a patch and test. [bug=1533762]
2016-07-16Rename COPYING.txt to LICENSE. Add a reference to LICENSE in every source file.Leonard Richardson
2015-07-03Change setup.py to focus on creating wheels.Leonard Richardson
2015-06-28 It's now possible to pickle a BeautifulSoup object no matter whichLeonard Richardson
tree builder was used to create it. However, the only tree builder that survives the pickling process is the HTMLParserTreeBuilder ('html.parser'). If you unpickle a BeautifulSoup object created with some other tree builder, soup.builder will be None. [bug=1231545]
2015-06-28Changed the way soup objects work under copy.copy(). Copying aLeonard Richardson
NavigableString or a Tag will give you a new NavigableString that's equal to the old one but not connected to the parse tree. Patch by Martijn Peters. [bug=1307490]
2015-06-28Reorganized changelog.Leonard Richardson
2015-06-28Accept 'xml' as an unambiguous identifier for the lxml XML parser, since ↵Leonard Richardson
it's the only XML parser supported at the moment.
2015-06-27Added an exclude_encodings argument to UnicodeDammit and to theLeonard Richardson
Beautiful Soup constructor, which lets you prohibit the detection of an encoding that you know is wrong. [bug=1469408]
2015-06-25You can now create a NavigableString or a subclass just by invokingLeonard Richardson
the constructor. [bug=1294315]
2015-06-23Made the previous fix nicer by adding arguments to setup() that let us ↵Leonard Richardson
preserve a tag's existing place in the tree.
2015-06-23Got a hacky fix for the latest html5lib problem.Leonard Richardson
2015-06-23Force object_was_parsed() to keep the tree intact even when an elementLeonard Richardson
from later in the document is moved into place. [bug=1430633]
2014-12-10The warning when you pass in a filename or URL as markup will now beLeonard Richardson
displayed correctly even if the filename or URL is a Unicode string. [bug=1268888]
2014-12-07Tweaked the parser warning.Leonard Richardson
2014-12-07Issue a warning if the BeautifulSoup constructor arguments do not explicitly ↵Leonard Richardson
name a parser.
2013-10-02Restored the helpful syntax error that happens when you try toLeonard Richardson
import the Python 2 edition of Beautiful Soup under Python 3. [bug=1213387]
2013-10-02Prep for release.Leonard Richardson
2013-10-01 Fixed a crash when a short input contains data not valid inLeonard Richardson
filenames. [bug=1232604]
2013-10-01Fixed a bug in which short Unicode input was improperly encoded to ASCII ↵Leonard Richardson
when checking whether or not it was a file on disk. [bug=1227016]
2013-08-15Bumped version number.Leonard Richardson
2013-08-13* Fixed yet another problem with the html5lib tree builder, caused byLeonard Richardson
html5lib's tendency to rearrange the tree during parsing. [bug=1189267]
2013-08-12A little cleanup.Leonard Richardson
2013-06-03Changed _popToTag to run through a single range instead of two.Leonard Richardson
2013-06-03Improved _popToTag a tiny bit.Leonard Richardson
2013-06-03Inlined some commonly called code to save a function call.Leonard Richardson
2013-06-03Made it a lot faster to check whether whitespace is being preserved.Leonard Richardson
2013-06-03Put the more frequently-used ASCII spaces in front.Leonard Richardson
2013-06-03Wrote a more efficient replacement for string.translate() when checking ↵Leonard Richardson
whether a string is nothing but ASCII spaces.
2013-06-03Let's get some profiling going.Leonard Richardson
2013-06-03Beautiful Soup will issue a warning if instead of markup you pass itLeonard Richardson
a URL or the name of a file on disk (a common beginner mistake).
2013-06-02Merged in big encoding-detection refactoring branch.Leonard Richardson
2013-05-31Reverted the patch that gives NavigableString a .name property, because ↵Leonard Richardson
that's too big an API change for a bugfix release.
2013-05-31Create a new lxml parser object for every new parsing strategy.Leonard Richardson
2013-05-20A NavigableString object now has an immutable '.name' property whoseLeonard Richardson
value is always None. This makes it easier to iterate over a mixed list of tags and strings without having to check whether each element is a tag or a string.
2013-05-20The .next_element attribute used during parsing was confusingly similar to ↵Leonard Richardson
the .next_element navigation attribute. Renamed the former to _most_recent_element.
2013-05-20Gave new_string() the ability to create subclasses ofLeonard Richardson
NavigableString. [bug=1181986]
2013-05-07Fixed up diagnose() and added it to the docs.Leonard Richardson
2013-05-07Aliased the BeautifulSoup class to the easier-to-type "_s" and "_soup".Leonard Richardson
2012-08-21Fixed a problem with the html5lib builder not handling comments correctly.Leonard Richardson
2012-08-20Prep for release.Leonard Richardson
2012-08-20Raise a more specific error (FeatureNotFound) when a requestedLeonard Richardson
parser or parser feature is not installed. Raise NotImplementedError instead of ValueError when the user calls insert_before() or insert_after() on the BeautifulSoup object itself. Patch by Aaron Devore. [bug=1038301]
2012-08-17Prep for release.Leonard Richardson
2012-07-03Prep for release.Leonard Richardson
2012-07-02Fixed a typo that made parsing much slower than it should have been. ↵Leonard Richardson
[bug=1020268]
2012-05-29Prep for release.Leonard Richardson
2012-04-27Prep for release.Leonard Richardson
2012-04-16Prep for release.Leonard Richardson
2012-04-16Give a more useful error when the user tries to run the Python 2 version of ↵Leonard Richardson
BS under Python 3.
2012-04-03Prep for release.Leonard Richardson