Age | Commit message (Collapse) | Author | |
---|---|---|---|
2013-10-02 | Prep for release. | Leonard Richardson | |
2013-10-02 | Fixed a bug that caused Unicode data put into UnicodeDammit to | Leonard Richardson | |
return None instead of the original data. [bug=1214983] | |||
2013-10-01 | Fixed a crash when a short input contains data not valid in | Leonard Richardson | |
filenames. [bug=1232604] | |||
2013-10-01 | Fixed a bug in which short Unicode input was improperly encoded to ASCII ↵ | Leonard Richardson | |
when checking whether or not it was a file on disk. [bug=1227016] | |||
2013-08-19 | Fixed typo. | Leonard Richardson | |
2013-08-19 | Combined two tests to stop a spurious test failure when tests are | Leonard Richardson | |
run by nodetests. [bug=1212445] | |||
2013-08-15 | Bumped version number. | Leonard Richardson | |
2013-08-15 | Updated NEWS. | Leonard Richardson | |
2013-08-15 | Make sure the optimized find_all() ResultSets actually contain the right data. | Leonard Richardson | |
2013-08-13 | * Fixed yet another problem with the html5lib tree builder, caused by | Leonard Richardson | |
html5lib's tendency to rearrange the tree during parsing. [bug=1189267] | |||
2013-08-12 | Prep for release. | Leonard Richardson | |
2013-08-12 | Fixed incorrect superclass in super() Call. | Leonard Richardson | |
2013-08-12 | All find_all calls should now return a ResultSet object. Patch by | Leonard Richardson | |
Aaron DeVore. [bug=1194034] | |||
2013-08-12 | A little cleanup. | Leonard Richardson | |
2013-06-03 | Updated NEWS. | Leonard Richardson | |
2013-06-03 | A NavigableString object now has an immutable '.name' property whose | Leonard Richardson | |
+ value is always None. This makes it easier to iterate over a mixed + list of tags and strings without having to check whether each + element is a tag or a string. | |||
2013-06-03 | _last_descendant can be optimized in some cases. | Leonard Richardson | |
2013-06-03 | Save another Element creation. | Leonard Richardson | |
2013-06-03 | Improved performance for html5lib. | Leonard Richardson | |
2013-06-03 | Added raw html5lib to the list of parsers that get tested. | Leonard Richardson | |
2013-06-03 | Changed _popToTag to run through a single range instead of two. | Leonard Richardson | |
2013-06-03 | Improved _popToTag a tiny bit. | Leonard Richardson | |
2013-06-03 | Inlined some commonly called code to save a function call. | Leonard Richardson | |
2013-06-03 | Limit how much of the document is searched via regular expression for a ↵ | Leonard Richardson | |
declared encoding. | |||
2013-06-03 | Improved performance of _replace_cdata_list_attribute_values, and greatly ↵ | Leonard Richardson | |
reduced the number of times it is called. | |||
2013-06-03 | Made it a lot faster to check whether whitespace is being preserved. | Leonard Richardson | |
2013-06-03 | Put the more frequently-used ASCII spaces in front. | Leonard Richardson | |
2013-06-03 | Wrote a more efficient replacement for string.translate() when checking ↵ | Leonard Richardson | |
whether a string is nothing but ASCII spaces. | |||
2013-06-03 | Let's get some profiling going. | Leonard Richardson | |
2013-06-03 | Test that the filename warning isn't given unless the file actually exists ↵ | Leonard Richardson | |
on disk. | |||
2013-06-03 | Beautiful Soup will issue a warning if instead of markup you pass it | Leonard Richardson | |
a URL or the name of a file on disk (a common beginner mistake). | |||
2013-06-02 | Merged in big encoding-detection refactoring branch. | Leonard Richardson | |
2013-06-02 | Turns out we had two bits of code to strip byte-order marks. | Leonard Richardson | |
2013-06-02 | It turns out most of the untested code wasn't doing anything useful. | Leonard Richardson | |
2013-06-02 | Treat an lxml ParserError as a ParserRejectedMarkup. | Leonard Richardson | |
2013-05-31 | Prep for release. | Leonard Richardson | |
2013-05-31 | The html.parser treebuilder can now handle numeric attributes in | Leonard Richardson | |
text when the hexidecimal name of the attribute starts with a capital X. | |||
2013-05-31 | Reverted the patch that gives NavigableString a .name property, because ↵ | Leonard Richardson | |
that's too big an API change for a bugfix release. | |||
2013-05-31 | Create a new lxml parser object for every new parsing strategy. | Leonard Richardson | |
2013-05-30 | Refactored code a bit. | Leonard Richardson | |
2013-05-30 | Split out the code that guesses at encodings from the code that tries to ↵ | Leonard Richardson | |
decode a bytestring based on those encodings. This is necessary because lxml wants to do the decoding itself. | |||
2013-05-20 | The default XML formatter will now replace ampersands even if they appear to ↵ | Leonard Richardson | |
be part of entities. That is, "<" will become "&lt;".[bug=1182183] | |||
2013-05-20 | A NavigableString object now has an immutable '.name' property whose | Leonard Richardson | |
value is always None. This makes it easier to iterate over a mixed list of tags and strings without having to check whether each element is a tag or a string. | |||
2013-05-20 | The .previous_element of a BeautifulSoup object is now always None, | Leonard Richardson | |
2013-05-20 | The .next_element attribute used during parsing was confusingly similar to ↵ | Leonard Richardson | |
the .next_element navigation attribute. Renamed the former to _most_recent_element. | |||
2013-05-20 | Fixed another bug by which the html5lib tree builder could create a | Leonard Richardson | |
disconnected tree. [bug=1182089] | |||
2013-05-20 | Gave new_string() the ability to create subclasses of | Leonard Richardson | |
NavigableString. [bug=1181986] | |||
2013-05-20 | html5lib now supports Python 3. Fixed some Python 2-specific | Leonard Richardson | |
code in the html5lib test suite. [bug=1181624] | |||
2013-05-20 | Fixed test failures when lxml is not installed. | Leonard Richardson | |
2013-05-15 | How about actually parsing the same markup with different parsers. | Leonard Richardson | |