summaryrefslogtreecommitdiff
path: root/bs4/element.py
AgeCommit message (Collapse)Author
2013-05-07Since the string part of a NavigableString is immutable, gave it a simpler ↵Leonard Richardson
__copy__ implementation. [bug=682685]
2013-05-07Fixed an exception when an overspecified CSS selector didn't matchLeonard Richardson
anything. Code by Stefaan Lippens. [bug=1168167]
2013-05-07Added support for the "nth-of-type" CSS selector. The CSS selector ">" can ↵Leonard Richardson
now find a tag by means other than the tag name. Code by Sven Slootweg.
2013-05-07The prettify() method now leaves the contents of <pre> tagsLeonard Richardson
alone. [bug=1095654]
2013-05-07Merged.Leonard Richardson
2013-05-07Now that lxml's segfault on invalid doctype has been fixed, fix aLeonard Richardson
corresponding problem on the Beautiful Soup end that was previously invisible. [bug=984936]
2013-05-06Stop a crash when unwisely messing with a tag that's beenLeonard Richardson
decomposed. [bug=1097699]
2013-05-06Methods like get_text() and properties like .strings now only giveLeonard Richardson
you strings that are visible in the document--no comments or processing commands. [bug=1050164]
2013-05-06 Fix a bug by which keyword arguments to find_parent() were not being passed ↵Leonard Richardson
on. [bug=1126734]
2013-05-06In an HTML document, the contents of a <script> or <style> tag willLeonard Richardson
no longer undergo entity substitution by default. XML documents work the same way they did before. [bug=1085953]
2012-10-11Fix a bug in the lxml treebuilder which crashed when a tag includedLeonard Richardson
an attribute from the predefined xml: namespace. [bug=1065617]
2012-08-17Got rid of a UnicodeDecodeError when attribute names contain non-ASCII ↵Leonard Richardson
characters.
2012-08-16As per PEP-8, allow searching by CSS class using the 'class_'Leonard Richardson
keyword argument. [bug=1037624]
2012-07-03get_text() now returns an empty Unicode string if there is no text, rather ↵Leonard Richardson
than an empty bytestring. [bug=1020387]
2012-07-02Correctly handle closing tags with an XML namespace declared. Patch by ↵Leonard Richardson
Andreas Kostyrka. [bug=1019635]
2012-05-24 Fixed the inability to search for non-ASCII attributeLeonard Richardson
values. [bug=1003974] This caused a major refactoring of the search code. All the tests pass, but it's possible that some searches will behave differently.
2012-05-24Fixed the basic failure in [bug=1003974], but not more advanced cases.Leonard Richardson
2012-05-24 Fixed some edge-case bugs having to do with inserting an elementLeonard Richardson
into a tag it's already inside, and replacing one of a tag's children with another. [bug=997529]
2012-05-24Comments, processing instructions, document type declarations, and markup ↵Leonard Richardson
declarations are now treated as preformatted strings, the way CData blocks are. [bug=1001025] Also in this commit: renamed detwingle method to detwingle().
2012-04-26Added a new method, wrap().Leonard Richardson
2012-04-26Renamed replace_with_children() to the jQuery name, unwrap().Leonard Richardson
2012-04-26Upon document generation, CData objects are no longer run through the ↵Leonard Richardson
formatter. [bug=988905]
2012-04-26The test suite now passes when lxml is not installed, whether or not ↵Leonard Richardson
html5lib is installed. [bug=987004]
2012-04-18Got rid of contains_substitutions.Leonard Richardson
2012-04-18Made encoding substitution in <meta> tags completely transparent (no more ↵Leonard Richardson
%SOUP-ENCODING%).
2012-04-16Attribute values are now run through the provided output formatter. ↵Leonard Richardson
Previously they were always run through the 'minimal' formatter. [bug=980237]
2012-04-16 Fixed a bug with the string setter that moved a string around theLeonard Richardson
tree instead of copying it. [bug=983050]
2012-04-11Added renderContents back.Leonard Richardson
2012-04-07Have objects_was_parsed set the previous element's next_element if possible. ↵Leonard Richardson
[bug=975926]
2012-03-15Fixed a bug where specifying 'text' while searching for a tag only worked if ↵Leonard Richardson
'text' specified an exact string match. [bug=955942]
2012-03-02Brought the soupselect port up to date.Leonard Richardson
2012-03-02Fixed a bug that caused calling a tag to sometimes call find_all() with the ↵Leonard Richardson
wrong arguments. [bug=944426]
2012-03-01Test that CSS selectors work within the tree as well as at the top level.Leonard Richardson
2012-03-01Got tests to pass on Python 2 and Python 3.Leonard Richardson
2012-03-01Initial port of code and tests.Leonard Richardson
2012-02-27Make prettify() return Unicode by default, so it will look nice when passed ↵Leonard Richardson
into print() under Python 3.
2012-02-26Fixed DOCTYPE handling.Leonard Richardson
2012-02-24Renamed Tag.nsprefix to Tag.prefix, for consistency with NamespacedAttribute.Leonard Richardson
2012-02-23Fixed handling of the closing of namespaced tags.Leonard Richardson
2012-02-23Bumped version number.Leonard Richardson
2012-02-23Namespaced attributes are equal if they correspond to the same string.Leonard Richardson
2012-02-23Merge from trunk and added tests.Leonard Richardson
2012-02-21Have lxml invert namespace maps as they come in and set each tag's prefix ↵Leonard Richardson
appropriately.
2012-02-21Added nsprefix argument to the tag class.Leonard Richardson
2012-02-21Merged from trunk.Leonard Richardson
2012-02-20Added code from 2.7's standard library so that the tests will run on Python 2.6.Leonard Richardson
2012-02-16It's a start, at least.Leonard Richardson
2012-02-16By default, turn unrecognized characters into numeric XML entity refs.Leonard Richardson
2012-02-15Added a kind of hacky way to interpret the restriction class='foo bar'. Stop ↵Leonard Richardson
generating a space before the slash that closes an empty-element tag.
2012-02-15The value of multi-valued attributes like class are always turned into a ↵Leonard Richardson
list, even if there's only one value.