Age | Commit message (Collapse) | Author | |
---|---|---|---|
2013-10-18 | Fixed yet another problem that caused the html5lib tree builder to | Leonard Richardson | |
create a disconnected parse tree. [bug=1237763] | |||
2013-10-01 | Fixed a bug in which short Unicode input was improperly encoded to ASCII ↵ | Leonard Richardson | |
when checking whether or not it was a file on disk. [bug=1227016] | |||
2013-08-13 | * Fixed yet another problem with the html5lib tree builder, caused by | Leonard Richardson | |
html5lib's tendency to rearrange the tree during parsing. [bug=1189267] | |||
2013-06-03 | Save another Element creation. | Leonard Richardson | |
2013-06-03 | Improved performance for html5lib. | Leonard Richardson | |
2013-06-03 | Improved performance of _replace_cdata_list_attribute_values, and greatly ↵ | Leonard Richardson | |
reduced the number of times it is called. | |||
2013-06-02 | Merged in big encoding-detection refactoring branch. | Leonard Richardson | |
2013-06-02 | Turns out we had two bits of code to strip byte-order marks. | Leonard Richardson | |
2013-06-02 | It turns out most of the untested code wasn't doing anything useful. | Leonard Richardson | |
2013-06-02 | Treat an lxml ParserError as a ParserRejectedMarkup. | Leonard Richardson | |
2013-05-31 | The html.parser treebuilder can now handle numeric attributes in | Leonard Richardson | |
text when the hexidecimal name of the attribute starts with a capital X. | |||
2013-05-31 | Create a new lxml parser object for every new parsing strategy. | Leonard Richardson | |
2013-05-20 | The default XML formatter will now replace ampersands even if they appear to ↵ | Leonard Richardson | |
be part of entities. That is, "<" will become "&lt;".[bug=1182183] | |||
2013-05-20 | The .next_element attribute used during parsing was confusingly similar to ↵ | Leonard Richardson | |
the .next_element navigation attribute. Renamed the former to _most_recent_element. | |||
2013-05-20 | Fixed another bug by which the html5lib tree builder could create a | Leonard Richardson | |
disconnected tree. [bug=1182089] | |||
2013-05-09 | Changed lxml.feed() to handle the eventuality that it may be given a bytestring. | Leonard Richardson | |
2013-05-09 | Added a diagnostic function for randomly generating a simple, invalid HTML ↵ | Leonard Richardson | |
document. | |||
2013-05-07 | Now that lxml's segfault on invalid doctype has been fixed, fix a | Leonard Richardson | |
corresponding problem on the Beautiful Soup end that was previously invisible. [bug=984936] | |||
2012-10-11 | Fix a bug in the lxml treebuilder which crashed when a tag included | Leonard Richardson | |
an attribute from the predefined xml: namespace. [bug=1065617] | |||
2012-09-28 | Fixed package name. | Leonard Richardson | |
2012-08-21 | We don't need a special insertComment method, we just need to make ↵ | Leonard Richardson | |
Element.appendChild call object_was_parsed. | |||
2012-08-21 | Fixed a problem with the html5lib builder not handling comments correctly. | Leonard Richardson | |
2012-08-16 | Use namespace prefixes for namespaced attribute names, instead of | Leonard Richardson | |
the fully-qualified names given by the lxml parser. [bug=1037597] | |||
2012-06-30 | Fixed an html5lib tree builder crash which happened when html5lib | Leonard Richardson | |
moved a tag with a multivalued attribute from one part of the tree to another. [bug=1019603] | |||
2012-05-29 | Removed breakpoints. | Leonard Richardson | |
2012-05-29 | Prep for release. | Leonard Richardson | |
2012-05-24 | Fixed a bug with the lxml treebuilder that prevented the user from adding ↵ | Leonard Richardson | |
attributes to a tag that didn't originally have any. [bug=1002378] Thanks to Oliver Beattie for the patch. | |||
2012-04-26 | The test suite now passes when lxml is not installed, whether or not ↵ | Leonard Richardson | |
html5lib is installed. [bug=987004] | |||
2012-04-18 | Got rid of contains_substitutions. | Leonard Richardson | |
2012-04-18 | Made encoding substitution in <meta> tags completely transparent (no more ↵ | Leonard Richardson | |
%SOUP-ENCODING%). | |||
2012-04-18 | Changed wording slightly. | Leonard Richardson | |
2012-04-18 | Print a warning on HTMLParseErrors to let people know they should install an ↵ | Leonard Richardson | |
external parser. | |||
2012-04-18 | Fixed a bug that made the HTMLParser treebuilder generate XML definitions ↵ | Leonard Richardson | |
ending with two question marks instead of one. [bug=984258] | |||
2012-04-03 | Got rid of the 4.0.2 workaround for HTML documents--it was unnecessary and ↵ | Leonard Richardson | |
the workaround was triggering a (possibly different, but related) bug in lxml. [bug=972466] | |||
2012-04-03 | Don't split up the markup into chunks when using the lxml HTML parser, which ↵ | Leonard Richardson | |
doesn't have the problems of the XML parser. | |||
2012-03-30 | Fixed a typo that caused some versions of Python 3 to convert the Beautiful ↵ | Leonard Richardson | |
Soup codebase incorrectly. | |||
2012-03-24 | Pass data into XMLParser.feed() in chunks. [bug=963880] | Leonard Richardson | |
2012-03-01 | In HTML5-style <meta charset="foo"> tags, the value of the "charset" ↵ | Leonard Richardson | |
attribute is now replaced with the appropriate encoding on output. [bug=942714] | |||
2012-03-01 | Added missing __len__ method that stopped html5lib tree builder from working ↵ | Leonard Richardson | |
on nested formatting elements. [bug=943246] | |||
2012-02-28 | Fixed the generated XML declaration. | Leonard Richardson | |
2012-02-24 | Warn when SoupStrainer is used with the html5lib tree builder. | Leonard Richardson | |
2012-02-23 | Fixed handling of the closing of namespaced tags. | Leonard Richardson | |
2012-02-23 | Bumped version number. | Leonard Richardson | |
2012-02-23 | Merge from trunk and added tests. | Leonard Richardson | |
2012-02-22 | Added comments. | Leonard Richardson | |
2012-02-22 | Treat a new namespace mapping as a set of attributes on the tag that defines ↵ | Leonard Richardson | |
it, so we don't lose the mappings. | |||
2012-02-21 | Have lxml invert namespace maps as they come in and set each tag's prefix ↵ | Leonard Richardson | |
appropriately. | |||
2012-02-21 | Added nsprefix argument to the tag class. | Leonard Richardson | |
2012-02-21 | Merged from trunk. | Leonard Richardson | |
2012-02-20 | It's now possible to copy a BeautifulSoup object created with the ↵ | Leonard Richardson | |
html.parser treebuilder. |