Age | Commit message (Collapse) | Author | |
---|---|---|---|
2016-07-30 | Explained why we test both unicode and bytestring processing instructions. | Leonard Richardson | |
2016-07-26 | Fixed a reported (but not duplicated) bug involving processing instructions ↵ | Leonard Richardson | |
fed into the lxml HTML parser. | |||
2016-07-26 | Spelling fixes | Ville Skyttä | |
2016-07-19 | Updated NEWS in preparation for release. | Leonard Richardson | |
2016-07-18 | If a search against each individual value of a multi-valued | Leonard Richardson | |
attribute fails, the search will be run one final time against the complete attribute value considered as a single string. [bug=1476868] | |||
2016-07-18 | Corrected an encoding error that happened when a BeautifulSoup | Leonard Richardson | |
object was copied. [bug=1554439] | |||
2016-07-18 | Added support for CSS selector values that contain quoted spaces, | Leonard Richardson | |
such as tag[style="display: foo"]. [bug=1540588] | |||
2016-07-18 | The argument to now works correctly, though it's | Leonard Richardson | |
not implemented very efficiently. [bug=1520530] | |||
2016-07-17 | When a BeautifulSoup object is pickled but its tree builder cannot | Leonard Richardson | |
be pickled, its .builder attribute is set to None instead of being destroyed. This avoids a performance problem once the object is unpickled. [bug=1523629] | |||
2016-07-17 | Although the previously fixed problem only occurs when using the html5lib ↵ | Leonard Richardson | |
tree builder, it's not actually a problem with the tree builder itself. | |||
2016-07-17 | Fixed a bug in the html5lib treebuilder that deranged the tree | Leonard Richardson | |
when a whitespace element was reparented into a tag that contained an identical whitespace element. [bug=1505351] | |||
2016-07-16 | Beautiful Soup will now work with versions of html5lib greater than | Leonard Richardson | |
0.99999999. [bug=1603299] | |||
2016-07-16 | We don't run the check for a filename passed in as markup if the | Leonard Richardson | |
'filename' contains a less-than character; the less-than character indicates it's most likely a very small document. [bug=1577864] | |||
2016-07-16 | Specify the file and line number when warning about a | Leonard Richardson | |
BeautifulSoup object being instantiated without a parser being specified. [bug=1574647] | |||
2016-07-16 | The contents of <textarea> tags will no longer be modified when the | Leonard Richardson | |
tree is prettified. [bug=1555829] | |||
2016-07-16 | Fixed a Python 3 ByteWarning when a URL was passed in as though it | Leonard Richardson | |
were markup. Thanks to James Salter for a patch and test. [bug=1533762] | |||
2015-09-28 | Fixed a parse bug with the html5lib tree-builder. Thanks to Roel | Leonard Richardson | |
Kramer for the patch. [bug=1483781] | |||
2015-09-28 | Improved the implementation of CSS selector grouping. Thanks to Orangain for ↵ | Leonard Richardson | |
the patch. [bug=1484543] | |||
2015-09-28 | Corrected the output of Declaration objects. [bug=1477847] | Leonard Richardson | |
2015-09-28 | Fixed a bug that deranged the tree when part of it was | Leonard Richardson | |
removed. Thanks to Eric Weiser for the patch and John Wiseman for a test. [bug=1481520] | |||
2015-07-05 | Fixed the test_detect_utf8 test so that it works when chardet is | Leonard Richardson | |
installed. [bug=1471359] | |||
2015-07-03 | Turns out setup.py requiring lxml was never in a released version which is a ↵ | Leonard Richardson | |
big relief as we don't need that anymore. | |||
2015-07-03 | Change setup.py to focus on creating wheels. | Leonard Richardson | |
2015-06-28 | It's now possible to pickle a BeautifulSoup object no matter which | Leonard Richardson | |
tree builder was used to create it. However, the only tree builder that survives the pickling process is the HTMLParserTreeBuilder ('html.parser'). If you unpickle a BeautifulSoup object created with some other tree builder, soup.builder will be None. [bug=1231545] | |||
2015-06-28 | Changed the way soup objects work under copy.copy(). Copying a | Leonard Richardson | |
NavigableString or a Tag will give you a new NavigableString that's equal to the old one but not connected to the parse tree. Patch by Martijn Peters. [bug=1307490] | |||
2015-06-28 | Copying a NavigableString will give you a new NavigableString that is not ↵ | Leonard Richardson | |
connected to the parse tree. | |||
2015-06-28 | Reorganized changelog. | Leonard Richardson | |
2015-06-28 | Fixed a bug where Element.extract() could create an infinite loop in | Leonard Richardson | |
the remaining tree. | |||
2015-06-28 | Raise a NotImplementedError whenever an unsupported CSS pseudoclass | Leonard Richardson | |
is used in select(). Previously some cases did not result in a NotImplementedError. | |||
2015-06-27 | Added an exclude_encodings argument to UnicodeDammit and to the | Leonard Richardson | |
Beautiful Soup constructor, which lets you prohibit the detection of an encoding that you know is wrong. [bug=1469408] | |||
2015-06-25 | Introduced the select_one() method, which uses a CSS selector but | Leonard Richardson | |
only returns the first match, instead of a list of matches. [bug=1349367] | |||
2015-06-25 | The text argument to the find_* methods is now called string, | Leonard Richardson | |
which is more accurate. text still works, but is the argument described in the documentation. text may eventually change its meaning, but not for a very long time. [bug=1366856] | |||
2015-06-25 | You can now create a NavigableString or a subclass just by invoking | Leonard Richardson | |
the constructor. [bug=1294315] | |||
2015-06-25 | Improved the exception raised when you call .unwrap() or | Leonard Richardson | |
.replace_with() on an element that's not attached to a tree. | |||
2015-06-25 | In Python 3, __str__ now returns a Unicode string instead | Leonard Richardson | |
of a bytestring. [bug=1420131] | |||
2015-06-25 | __repr__ now returns an ASCII bytestring in Python 2, and a Unicode string ↵ | Leonard Richardson | |
in Python 3, instead of a UTF8-encoded bytestring in both versions. [bug=1420131] | |||
2015-06-25 | Fixed a crash in Unicode, Dammit's encoding detector when the name | Leonard Richardson | |
of the encoding itself contained invalid bytes. [bug=1360913] | |||
2015-06-24 | Fixed an import error in Python 3.5 caused by the removal of the | Leonard Richardson | |
2015-06-24 | If the initial <html> tag contains a CDATA list attribute such as | Leonard Richardson | |
'class', the html5lib tree builder will now turn its value into a list, as it would with any other tag. [bug=1296481] | |||
2015-06-24 | The select() method can now find tags with attributes whose names | Leonard Richardson | |
contain dashes. Patch by Marek Kapolka. [bug=1304007] | |||
2015-06-24 | Improved docstring for encode_contents() and decode_contents(). [bug=1441543] | Leonard Richardson | |
2015-06-24 | Started using a standard MIT license. [bug=1294662] | Leonard Richardson | |
2015-06-23 | Fixed yet another bug that caused a disconnected tree when html5lib | Leonard Richardson | |
copied an element from one part of the tree to another. [bug=1270611] | |||
2015-06-23 | Force object_was_parsed() to keep the tree intact even when an element | Leonard Richardson | |
from later in the document is moved into place. [bug=1430633] | |||
2014-12-11 | Improved the lxml tree builder's handling of processing | Leonard Richardson | |
instructions. [bug=1294645] | |||
2014-12-11 | The select() method can now find tags whose names contain | Leonard Richardson | |
dashes. Patch by Francisco Canas [bug=1276211] | |||
2014-12-10 | The warning when you pass in a filename or URL as markup will now be | Leonard Richardson | |
displayed correctly even if the filename or URL is a Unicode string. [bug=1268888] | |||
2014-12-10 | The select() method now supports selector grouping. Patch by | Leonard Richardson | |
Francisco Canas [bug=1191917] | |||
2014-12-07 | In Python 3.4 and above, set the new convert_charrefs argument to | Leonard Richardson | |
the html.parser constructor to avoid a warning and future failures. Patch by Stefano Revera. [bug=1375721] | |||
2014-12-07 | Added a warning when you instantiate a BeautifulSoup object without | Leonard Richardson | |
explicitly naming a parser. [bug=1398866] |