summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2015-11-23Fixing typo in example of nth-of-type css selectorGene Wood
2015-09-28Add a __license__ statement to all source files.Leonard Richardson
2015-09-28Fixed a parse bug with the html5lib tree-builder. Thanks to RoelLeonard Richardson
Kramer for the patch. [bug=1483781]
2015-09-28Improved the implementation of CSS selector grouping. Thanks to Orangain for ↵Leonard Richardson
the patch. [bug=1484543]
2015-09-28Corrected the output of Declaration objects. [bug=1477847]Leonard Richardson
2015-09-28Fixed a bug that deranged the tree when part of it wasLeonard Richardson
removed. Thanks to Eric Weiser for the patch and John Wiseman for a test. [bug=1481520]
2015-09-28Don't allow inserting None into a tag.Leonard Richardson
2015-08-06Use identity comparisons for tree traversalEric Wieser
Otherwise, different NavigableStrings compare equal. Fixes Bug #1481520
2015-07-05Fixed the test_detect_utf8 test so that it works when chardet isLeonard Richardson
installed. [bug=1471359]
2015-07-05Added reference to old 'text' name to documentation.Leonard Richardson
2015-07-03Added instructions for final post-release test.Leonard Richardson
2015-07-03Turns out setup.py requiring lxml was never in a released version which is a ↵Leonard Richardson
big relief as we don't need that anymore.
2015-07-03Change setup.py to focus on creating wheels.Leonard Richardson
2015-07-03Unicode data cannot have a byte-order mark. Returning early stops a warning ↵Leonard Richardson
from happening.
2015-06-28Also include convert-py3k in source distributions. [bug=1304006]Leonard Richardson
2015-06-28Added test-all-versions and the Chinese docs to the manifest.Leonard Richardson
2015-06-28 It's now possible to pickle a BeautifulSoup object no matter whichLeonard Richardson
tree builder was used to create it. However, the only tree builder that survives the pickling process is the HTMLParserTreeBuilder ('html.parser'). If you unpickle a BeautifulSoup object created with some other tree builder, soup.builder will be None. [bug=1231545]
2015-06-28Changed the way soup objects work under copy.copy(). Copying aLeonard Richardson
NavigableString or a Tag will give you a new NavigableString that's equal to the old one but not connected to the parse tree. Patch by Martijn Peters. [bug=1307490]
2015-06-28Copying a NavigableString will give you a new NavigableString that is not ↵Leonard Richardson
connected to the parse tree.
2015-06-28Reorganized changelog.Leonard Richardson
2015-06-28Fixed a bug where Element.extract() could create an infinite loop inLeonard Richardson
the remaining tree.
2015-06-28Accept 'xml' as an unambiguous identifier for the lxml XML parser, since ↵Leonard Richardson
it's the only XML parser supported at the moment.
2015-06-28Raise a NotImplementedError whenever an unsupported CSS pseudoclassLeonard Richardson
is used in select(). Previously some cases did not result in a NotImplementedError.
2015-06-27Added an example of using a fuction on an attribute value/using a function ↵Leonard Richardson
to invert a normal search.
2015-06-27Added another layer of security to catch cases where lxml and html5lib are ↵Leonard Richardson
not installed.
2015-06-27Added an exclude_encodings argument to UnicodeDammit and to theLeonard Richardson
Beautiful Soup constructor, which lets you prohibit the detection of an encoding that you know is wrong. [bug=1469408]
2015-06-26Added a sanity check helper method that makes sure all the elements of a ↵Leonard Richardson
tree are properly connected via .next_element and .previous_element.
2015-06-25Introduced the select_one() method, which uses a CSS selector butLeonard Richardson
only returns the first match, instead of a list of matches. [bug=1349367]
2015-06-25The text argument to the find_* methods is now called string,Leonard Richardson
which is more accurate. text still works, but is the argument described in the documentation. text may eventually change its meaning, but not for a very long time. [bug=1366856]
2015-06-25Make it possible to invoke the Tag() constructor without providing a ↵Leonard Richardson
builder. [bug=1307471]
2015-06-25You can now create a NavigableString or a subclass just by invokingLeonard Richardson
the constructor. [bug=1294315]
2015-06-25Improved the exception raised when you call .unwrap() orLeonard Richardson
.replace_with() on an element that's not attached to a tree.
2015-06-25In Python 3, __str__ now returns a Unicode string insteadLeonard Richardson
of a bytestring. [bug=1420131]
2015-06-25__repr__ now returns an ASCII bytestring in Python 2, and a Unicode string ↵Leonard Richardson
in Python 3, instead of a UTF8-encoded bytestring in both versions. [bug=1420131]
2015-06-25Fixed a crash in Unicode, Dammit's encoding detector when the nameLeonard Richardson
of the encoding itself contained invalid bytes. [bug=1360913]
2015-06-25Fixed syntax errors in documentation.Leonard Richardson
2015-06-25Merge in HEAD.Leonard Richardson
2015-06-25Put the Chinese version of the documentation into its own Sphinx build to ↵Leonard Richardson
stop identifiers from conflicting. [bug=1401862]
2015-06-24Fixed an import error in Python 3.5 caused by the removal of theLeonard Richardson
2015-06-24Made double sure that we don't use the 'strict' constructor argument when ↵Leonard Richardson
it's deprecated. [bug=1341055]
2015-06-24If the initial <html> tag contains a CDATA list attribute such asLeonard Richardson
'class', the html5lib tree builder will now turn its value into a list, as it would with any other tag. [bug=1296481]
2015-06-24The select() method can now find tags with attributes whose namesLeonard Richardson
contain dashes. Patch by Marek Kapolka. [bug=1304007]
2015-06-24Improved docstring for encode_contents() and decode_contents(). [bug=1441543]Leonard Richardson
2015-06-24Started using a standard MIT license. [bug=1294662]Leonard Richardson
2015-06-23Fixed yet another bug that caused a disconnected tree when html5libLeonard Richardson
copied an element from one part of the tree to another. [bug=1270611]
2015-06-23Made the previous fix nicer by adding arguments to setup() that let us ↵Leonard Richardson
preserve a tag's existing place in the tree.
2015-06-23Got a hacky fix for the latest html5lib problem.Leonard Richardson
2015-06-23Force object_was_parsed() to keep the tree intact even when an elementLeonard Richardson
from later in the document is moved into place. [bug=1430633]
2014-12-11Improved the lxml tree builder's handling of processingLeonard Richardson
instructions. [bug=1294645]
2014-12-11The select() method can now find tags whose names containLeonard Richardson
dashes. Patch by Francisco Canas [bug=1276211]