Age | Commit message (Collapse) | Author | |
---|---|---|---|
2013-08-15 | Make sure the optimized find_all() ResultSets actually contain the right data. | Leonard Richardson | |
2013-08-13 | * Fixed yet another problem with the html5lib tree builder, caused by | Leonard Richardson | |
html5lib's tendency to rearrange the tree during parsing. [bug=1189267] | |||
2013-08-12 | Fixed incorrect superclass in super() Call. | Leonard Richardson | |
2013-08-12 | All find_all calls should now return a ResultSet object. Patch by | Leonard Richardson | |
Aaron DeVore. [bug=1194034] | |||
2013-06-03 | A NavigableString object now has an immutable '.name' property whose | Leonard Richardson | |
+ value is always None. This makes it easier to iterate over a mixed + list of tags and strings without having to check whether each + element is a tag or a string. | |||
2013-06-03 | _last_descendant can be optimized in some cases. | Leonard Richardson | |
2013-06-03 | Improved performance for html5lib. | Leonard Richardson | |
2013-06-03 | Improved performance of _replace_cdata_list_attribute_values, and greatly ↵ | Leonard Richardson | |
reduced the number of times it is called. | |||
2013-05-31 | Reverted the patch that gives NavigableString a .name property, because ↵ | Leonard Richardson | |
that's too big an API change for a bugfix release. | |||
2013-05-20 | A NavigableString object now has an immutable '.name' property whose | Leonard Richardson | |
value is always None. This makes it easier to iterate over a mixed list of tags and strings without having to check whether each element is a tag or a string. | |||
2013-05-14 | Added a deprecation warning to has_key(). | Leonard Richardson | |
2013-05-08 | A CSS selector should never match the same tag twice. | Leonard Richardson | |
2013-05-08 | Minor cleanup. | Leonard Richardson | |
2013-05-08 | Fixed terminology. | Leonard Richardson | |
2013-05-08 | Moved select() to Tag. It was always an error to call select() on a string, ↵ | Leonard Richardson | |
so there's no reason for it to be in PageElement. | |||
2013-05-08 | Give the checker the ability to stop the iteration over the generator by ↵ | Leonard Richardson | |
raising StopIteration. | |||
2013-05-08 | Aaand... it's now trivial to implement sibling selectors. | Leonard Richardson | |
2013-05-08 | Once again, we're back to the steady state. | Leonard Richardson | |
2013-05-08 | Got it all working again except for nth_child_of_type. | Leonard Richardson | |
2013-05-08 | Refactored again to use iterators instead of calling find_all(). | Leonard Richardson | |
2013-05-08 | OK, the tests pass. | Leonard Richardson | |
2013-05-08 | Almost there. | Leonard Richardson | |
2013-05-08 | We're getting there. | Leonard Richardson | |
2013-05-08 | Fixing test failures. | Leonard Richardson | |
2013-05-08 | Initial refactoring. | Leonard Richardson | |
2013-05-07 | Since the string part of a NavigableString is immutable, gave it a simpler ↵ | Leonard Richardson | |
__copy__ implementation. [bug=682685] | |||
2013-05-07 | Fixed an exception when an overspecified CSS selector didn't match | Leonard Richardson | |
anything. Code by Stefaan Lippens. [bug=1168167] | |||
2013-05-07 | Added support for the "nth-of-type" CSS selector. The CSS selector ">" can ↵ | Leonard Richardson | |
now find a tag by means other than the tag name. Code by Sven Slootweg. | |||
2013-05-07 | The prettify() method now leaves the contents of <pre> tags | Leonard Richardson | |
alone. [bug=1095654] | |||
2013-05-07 | Merged. | Leonard Richardson | |
2013-05-07 | Now that lxml's segfault on invalid doctype has been fixed, fix a | Leonard Richardson | |
corresponding problem on the Beautiful Soup end that was previously invisible. [bug=984936] | |||
2013-05-06 | Stop a crash when unwisely messing with a tag that's been | Leonard Richardson | |
decomposed. [bug=1097699] | |||
2013-05-06 | Methods like get_text() and properties like .strings now only give | Leonard Richardson | |
you strings that are visible in the document--no comments or processing commands. [bug=1050164] | |||
2013-05-06 | Fix a bug by which keyword arguments to find_parent() were not being passed ↵ | Leonard Richardson | |
on. [bug=1126734] | |||
2013-05-06 | In an HTML document, the contents of a <script> or <style> tag will | Leonard Richardson | |
no longer undergo entity substitution by default. XML documents work the same way they did before. [bug=1085953] | |||
2012-10-11 | Fix a bug in the lxml treebuilder which crashed when a tag included | Leonard Richardson | |
an attribute from the predefined xml: namespace. [bug=1065617] | |||
2012-08-17 | Got rid of a UnicodeDecodeError when attribute names contain non-ASCII ↵ | Leonard Richardson | |
characters. | |||
2012-08-16 | As per PEP-8, allow searching by CSS class using the 'class_' | Leonard Richardson | |
keyword argument. [bug=1037624] | |||
2012-07-03 | get_text() now returns an empty Unicode string if there is no text, rather ↵ | Leonard Richardson | |
than an empty bytestring. [bug=1020387] | |||
2012-07-02 | Correctly handle closing tags with an XML namespace declared. Patch by ↵ | Leonard Richardson | |
Andreas Kostyrka. [bug=1019635] | |||
2012-05-24 | Fixed the inability to search for non-ASCII attribute | Leonard Richardson | |
values. [bug=1003974] This caused a major refactoring of the search code. All the tests pass, but it's possible that some searches will behave differently. | |||
2012-05-24 | Fixed the basic failure in [bug=1003974], but not more advanced cases. | Leonard Richardson | |
2012-05-24 | Fixed some edge-case bugs having to do with inserting an element | Leonard Richardson | |
into a tag it's already inside, and replacing one of a tag's children with another. [bug=997529] | |||
2012-05-24 | Comments, processing instructions, document type declarations, and markup ↵ | Leonard Richardson | |
declarations are now treated as preformatted strings, the way CData blocks are. [bug=1001025] Also in this commit: renamed detwingle method to detwingle(). | |||
2012-04-26 | Added a new method, wrap(). | Leonard Richardson | |
2012-04-26 | Renamed replace_with_children() to the jQuery name, unwrap(). | Leonard Richardson | |
2012-04-26 | Upon document generation, CData objects are no longer run through the ↵ | Leonard Richardson | |
formatter. [bug=988905] | |||
2012-04-26 | The test suite now passes when lxml is not installed, whether or not ↵ | Leonard Richardson | |
html5lib is installed. [bug=987004] | |||
2012-04-18 | Got rid of contains_substitutions. | Leonard Richardson | |
2012-04-18 | Made encoding substitution in <meta> tags completely transparent (no more ↵ | Leonard Richardson | |
%SOUP-ENCODING%). |