Age | Commit message (Collapse) | Author | |
---|---|---|---|
2015-09-28 | Add a __license__ statement to all source files. | Leonard Richardson | |
2015-09-28 | Fixed a parse bug with the html5lib tree-builder. Thanks to Roel | Leonard Richardson | |
Kramer for the patch. [bug=1483781] | |||
2015-09-28 | Improved the implementation of CSS selector grouping. Thanks to Orangain for ↵ | Leonard Richardson | |
the patch. [bug=1484543] | |||
2015-09-28 | Corrected the output of Declaration objects. [bug=1477847] | Leonard Richardson | |
2015-09-28 | Fixed a bug that deranged the tree when part of it was | Leonard Richardson | |
removed. Thanks to Eric Weiser for the patch and John Wiseman for a test. [bug=1481520] | |||
2015-09-28 | Don't allow inserting None into a tag. | Leonard Richardson | |
2015-08-06 | Use identity comparisons for tree traversal | Eric Wieser | |
Otherwise, different NavigableStrings compare equal. Fixes Bug #1481520 | |||
2015-07-05 | Fixed the test_detect_utf8 test so that it works when chardet is | Leonard Richardson | |
installed. [bug=1471359] | |||
2015-07-03 | Change setup.py to focus on creating wheels. | Leonard Richardson | |
2015-07-03 | Unicode data cannot have a byte-order mark. Returning early stops a warning ↵ | Leonard Richardson | |
from happening. | |||
2015-06-28 | It's now possible to pickle a BeautifulSoup object no matter which | Leonard Richardson | |
tree builder was used to create it. However, the only tree builder that survives the pickling process is the HTMLParserTreeBuilder ('html.parser'). If you unpickle a BeautifulSoup object created with some other tree builder, soup.builder will be None. [bug=1231545] | |||
2015-06-28 | Changed the way soup objects work under copy.copy(). Copying a | Leonard Richardson | |
NavigableString or a Tag will give you a new NavigableString that's equal to the old one but not connected to the parse tree. Patch by Martijn Peters. [bug=1307490] | |||
2015-06-28 | Copying a NavigableString will give you a new NavigableString that is not ↵ | Leonard Richardson | |
connected to the parse tree. | |||
2015-06-28 | Reorganized changelog. | Leonard Richardson | |
2015-06-28 | Fixed a bug where Element.extract() could create an infinite loop in | Leonard Richardson | |
the remaining tree. | |||
2015-06-28 | Accept 'xml' as an unambiguous identifier for the lxml XML parser, since ↵ | Leonard Richardson | |
it's the only XML parser supported at the moment. | |||
2015-06-28 | Raise a NotImplementedError whenever an unsupported CSS pseudoclass | Leonard Richardson | |
is used in select(). Previously some cases did not result in a NotImplementedError. | |||
2015-06-27 | Added another layer of security to catch cases where lxml and html5lib are ↵ | Leonard Richardson | |
not installed. | |||
2015-06-27 | Added an exclude_encodings argument to UnicodeDammit and to the | Leonard Richardson | |
Beautiful Soup constructor, which lets you prohibit the detection of an encoding that you know is wrong. [bug=1469408] | |||
2015-06-26 | Added a sanity check helper method that makes sure all the elements of a ↵ | Leonard Richardson | |
tree are properly connected via .next_element and .previous_element. | |||
2015-06-25 | Introduced the select_one() method, which uses a CSS selector but | Leonard Richardson | |
only returns the first match, instead of a list of matches. [bug=1349367] | |||
2015-06-25 | The text argument to the find_* methods is now called string, | Leonard Richardson | |
which is more accurate. text still works, but is the argument described in the documentation. text may eventually change its meaning, but not for a very long time. [bug=1366856] | |||
2015-06-25 | Make it possible to invoke the Tag() constructor without providing a ↵ | Leonard Richardson | |
builder. [bug=1307471] | |||
2015-06-25 | You can now create a NavigableString or a subclass just by invoking | Leonard Richardson | |
the constructor. [bug=1294315] | |||
2015-06-25 | Improved the exception raised when you call .unwrap() or | Leonard Richardson | |
.replace_with() on an element that's not attached to a tree. | |||
2015-06-25 | __repr__ now returns an ASCII bytestring in Python 2, and a Unicode string ↵ | Leonard Richardson | |
in Python 3, instead of a UTF8-encoded bytestring in both versions. [bug=1420131] | |||
2015-06-25 | Fixed a crash in Unicode, Dammit's encoding detector when the name | Leonard Richardson | |
of the encoding itself contained invalid bytes. [bug=1360913] | |||
2015-06-24 | Fixed an import error in Python 3.5 caused by the removal of the | Leonard Richardson | |
2015-06-24 | Made double sure that we don't use the 'strict' constructor argument when ↵ | Leonard Richardson | |
it's deprecated. [bug=1341055] | |||
2015-06-24 | If the initial <html> tag contains a CDATA list attribute such as | Leonard Richardson | |
'class', the html5lib tree builder will now turn its value into a list, as it would with any other tag. [bug=1296481] | |||
2015-06-24 | The select() method can now find tags with attributes whose names | Leonard Richardson | |
contain dashes. Patch by Marek Kapolka. [bug=1304007] | |||
2015-06-24 | Improved docstring for encode_contents() and decode_contents(). [bug=1441543] | Leonard Richardson | |
2015-06-23 | Made the previous fix nicer by adding arguments to setup() that let us ↵ | Leonard Richardson | |
preserve a tag's existing place in the tree. | |||
2015-06-23 | Got a hacky fix for the latest html5lib problem. | Leonard Richardson | |
2015-06-23 | Force object_was_parsed() to keep the tree intact even when an element | Leonard Richardson | |
from later in the document is moved into place. [bug=1430633] | |||
2014-12-11 | Improved the lxml tree builder's handling of processing | Leonard Richardson | |
instructions. [bug=1294645] | |||
2014-12-11 | The select() method can now find tags whose names contain | Leonard Richardson | |
dashes. Patch by Francisco Canas [bug=1276211] | |||
2014-12-10 | The warning when you pass in a filename or URL as markup will now be | Leonard Richardson | |
displayed correctly even if the filename or URL is a Unicode string. [bug=1268888] | |||
2014-12-10 | The select() method now supports selector grouping. Patch by | Leonard Richardson | |
Francisco Canas [bug=1191917] | |||
2014-12-07 | In Python 3.4 and above, set the new convert_charrefs argument to | Leonard Richardson | |
the html.parser constructor to avoid a warning and future failures. Patch by Stefano Revera. [bug=1375721] | |||
2014-12-07 | Tweaked the parser warning. | Leonard Richardson | |
2014-12-07 | Issue a warning if the BeautifulSoup constructor arguments do not explicitly ↵ | Leonard Richardson | |
name a parser. | |||
2013-10-18 | Fixed yet another problem that caused the html5lib tree builder to | Leonard Richardson | |
create a disconnected parse tree. [bug=1237763] | |||
2013-10-02 | Restored the helpful syntax error that happens when you try to | Leonard Richardson | |
import the Python 2 edition of Beautiful Soup under Python 3. [bug=1213387] | |||
2013-10-02 | Prep for release. | Leonard Richardson | |
2013-10-02 | Fixed a bug that caused Unicode data put into UnicodeDammit to | Leonard Richardson | |
return None instead of the original data. [bug=1214983] | |||
2013-10-01 | Fixed a crash when a short input contains data not valid in | Leonard Richardson | |
filenames. [bug=1232604] | |||
2013-10-01 | Fixed a bug in which short Unicode input was improperly encoded to ASCII ↵ | Leonard Richardson | |
when checking whether or not it was a file on disk. [bug=1227016] | |||
2013-08-19 | Combined two tests to stop a spurious test failure when tests are | Leonard Richardson | |
run by nodetests. [bug=1212445] | |||
2013-08-15 | Bumped version number. | Leonard Richardson | |