Age | Commit message (Collapse) | Author | |
---|---|---|---|
2013-08-15 | Make sure the optimized find_all() ResultSets actually contain the right data. | Leonard Richardson | |
2013-08-13 | * Fixed yet another problem with the html5lib tree builder, caused by | Leonard Richardson | |
html5lib's tendency to rearrange the tree during parsing. [bug=1189267] | |||
2013-08-12 | All find_all calls should now return a ResultSet object. Patch by | Leonard Richardson | |
Aaron DeVore. [bug=1194034] | |||
2013-06-03 | A NavigableString object now has an immutable '.name' property whose | Leonard Richardson | |
+ value is always None. This makes it easier to iterate over a mixed + list of tags and strings without having to check whether each + element is a tag or a string. | |||
2013-06-03 | Let's get some profiling going. | Leonard Richardson | |
2013-06-03 | Test that the filename warning isn't given unless the file actually exists ↵ | Leonard Richardson | |
on disk. | |||
2013-06-03 | Beautiful Soup will issue a warning if instead of markup you pass it | Leonard Richardson | |
a URL or the name of a file on disk (a common beginner mistake). | |||
2013-06-02 | Merged in big encoding-detection refactoring branch. | Leonard Richardson | |
2013-06-02 | Turns out we had two bits of code to strip byte-order marks. | Leonard Richardson | |
2013-06-02 | It turns out most of the untested code wasn't doing anything useful. | Leonard Richardson | |
2013-05-31 | Reverted the patch that gives NavigableString a .name property, because ↵ | Leonard Richardson | |
that's too big an API change for a bugfix release. | |||
2013-05-31 | Create a new lxml parser object for every new parsing strategy. | Leonard Richardson | |
2013-05-30 | Split out the code that guesses at encodings from the code that tries to ↵ | Leonard Richardson | |
decode a bytestring based on those encodings. This is necessary because lxml wants to do the decoding itself. | |||
2013-05-20 | The default XML formatter will now replace ampersands even if they appear to ↵ | Leonard Richardson | |
be part of entities. That is, "<" will become "&lt;".[bug=1182183] | |||
2013-05-20 | A NavigableString object now has an immutable '.name' property whose | Leonard Richardson | |
value is always None. This makes it easier to iterate over a mixed list of tags and strings without having to check whether each element is a tag or a string. | |||
2013-05-20 | Gave new_string() the ability to create subclasses of | Leonard Richardson | |
NavigableString. [bug=1181986] | |||
2013-05-20 | html5lib now supports Python 3. Fixed some Python 2-specific | Leonard Richardson | |
code in the html5lib test suite. [bug=1181624] | |||
2013-05-20 | Fixed test failures when lxml is not installed. | Leonard Richardson | |
2013-05-15 | Merge. | Leonard Richardson | |
2013-05-14 | Added a deprecation warning to has_key(). | Leonard Richardson | |
2013-05-09 | Changed lxml.feed() to handle the eventuality that it may be given a bytestring. | Leonard Richardson | |
2013-05-08 | A CSS selector should never match the same tag twice. | Leonard Richardson | |
2013-05-08 | Added tests. | Leonard Richardson | |
2013-05-08 | Aaand... it's now trivial to implement sibling selectors. | Leonard Richardson | |
2013-05-08 | OK, the tests pass. | Leonard Richardson | |
2013-05-08 | We're getting there. | Leonard Richardson | |
2013-05-07 | Fixed an exception when an overspecified CSS selector didn't match | Leonard Richardson | |
anything. Code by Stefaan Lippens. [bug=1168167] | |||
2013-05-07 | Added support for the "nth-of-type" CSS selector. The CSS selector ">" can ↵ | Leonard Richardson | |
now find a tag by means other than the tag name. Code by Sven Slootweg. | |||
2013-05-07 | The prettify() method now leaves the contents of <pre> tags | Leonard Richardson | |
alone. [bug=1095654] | |||
2013-05-07 | Improved detection of lxml version number. | Leonard Richardson | |
2013-05-07 | Now that lxml's segfault on invalid doctype has been fixed, fix a | Leonard Richardson | |
corresponding problem on the Beautiful Soup end that was previously invisible. [bug=984936] | |||
2013-05-06 | Methods like get_text() and properties like .strings now only give | Leonard Richardson | |
you strings that are visible in the document--no comments or processing commands. [bug=1050164] | |||
2013-05-06 | Fix a bug by which keyword arguments to find_parent() were not being passed ↵ | Leonard Richardson | |
on. [bug=1126734] | |||
2013-05-06 | In an HTML document, the contents of a <script> or <style> tag will | Leonard Richardson | |
no longer undergo entity substitution by default. XML documents work the same way they did before. [bug=1085953] | |||
2013-05-06 | Added failing test. | Leonard Richardson | |
2012-08-21 | Fixed a problem with the html5lib builder not handling comments correctly. | Leonard Richardson | |
2012-08-20 | Python 3.1 also needs to skip the unicode attribute name test. | Leonard Richardson | |
2012-08-20 | Raise a more specific error (FeatureNotFound) when a requested | Leonard Richardson | |
parser or parser feature is not installed. Raise NotImplementedError instead of ValueError when the user calls insert_before() or insert_after() on the BeautifulSoup object itself. Patch by Aaron Devore. [bug=1038301] | |||
2012-08-20 | Skipped a test under Python 2.6 to avoid a spurious test failure. [bug=1038503] | Leonard Richardson | |
2012-08-17 | Okay, I'll use assertEqual instead. | Leonard Richardson | |
2012-08-17 | Fixed a crash on encoding when an attribute name contained | Leonard Richardson | |
non-ASCII characters. | |||
2012-08-16 | As per PEP-8, allow searching by CSS class using the 'class_' | Leonard Richardson | |
keyword argument. [bug=1037624] | |||
2012-07-03 | Mentioned cchardet in docs. | Leonard Richardson | |
2012-07-03 | When sniffing encodings, if the cchardet library is installed, use it ↵ | Leonard Richardson | |
instead of chardet. It's much faster. [bug=1020748] | |||
2012-07-03 | Use logging.warning() instead of warning.warn() to notify the user that ↵ | Leonard Richardson | |
characters were replaced with REPLACEMENT CHARACTER. [bug=1013862] | |||
2012-05-24 | Fixed the inability to search for non-ASCII attribute | Leonard Richardson | |
values. [bug=1003974] This caused a major refactoring of the search code. All the tests pass, but it's possible that some searches will behave differently. | |||
2012-05-24 | Fixed the basic failure in [bug=1003974], but not more advanced cases. | Leonard Richardson | |
2012-05-24 | Fixed some edge-case bugs having to do with inserting an element | Leonard Richardson | |
into a tag it's already inside, and replacing one of a tag's children with another. [bug=997529] | |||
2012-05-24 | Fixed a bug with the lxml treebuilder that prevented the user from adding ↵ | Leonard Richardson | |
attributes to a tag that didn't originally have any. [bug=1002378] Thanks to Oliver Beattie for the patch. | |||
2012-05-03 | Fixed the handling of " with the built-in parser. [bug=993871] | Leonard Richardson | |