summaryrefslogtreecommitdiff
path: root/bs4
AgeCommit message (Expand)Author
2013-06-03Made it a lot faster to check whether whitespace is being preserved.Leonard Richardson
2013-06-03Put the more frequently-used ASCII spaces in front.Leonard Richardson
2013-06-03Wrote a more efficient replacement for string.translate() when checking wheth...Leonard Richardson
2013-06-03Let's get some profiling going.Leonard Richardson
2013-06-03Test that the filename warning isn't given unless the file actually exists on...Leonard Richardson
2013-06-03Beautiful Soup will issue a warning if instead of markup you pass itLeonard Richardson
2013-06-02Merged in big encoding-detection refactoring branch.Leonard Richardson
2013-06-02Turns out we had two bits of code to strip byte-order marks.Leonard Richardson
2013-06-02It turns out most of the untested code wasn't doing anything useful.Leonard Richardson
2013-06-02Treat an lxml ParserError as a ParserRejectedMarkup.Leonard Richardson
2013-05-31The html.parser treebuilder can now handle numeric attributes inLeonard Richardson
2013-05-31Reverted the patch that gives NavigableString a .name property, because that'...Leonard Richardson
2013-05-31Create a new lxml parser object for every new parsing strategy.Leonard Richardson
2013-05-30Refactored code a bit.Leonard Richardson
2013-05-30Split out the code that guesses at encodings from the code that tries to deco...Leonard Richardson
2013-05-20The default XML formatter will now replace ampersands even if they appear to ...Leonard Richardson
2013-05-20A NavigableString object now has an immutable '.name' property whoseLeonard Richardson
2013-05-20The .next_element attribute used during parsing was confusingly similar to th...Leonard Richardson
2013-05-20Fixed another bug by which the html5lib tree builder could create aLeonard Richardson
2013-05-20Gave new_string() the ability to create subclasses ofLeonard Richardson
2013-05-20html5lib now supports Python 3. Fixed some Python 2-specificLeonard Richardson
2013-05-20Fixed test failures when lxml is not installed.Leonard Richardson
2013-05-15How about actually parsing the same markup with different parsers.Leonard Richardson
2013-05-15Merge.Leonard Richardson
2013-05-14Added diagnostic case for attempting to parse a URL as HTML.Leonard Richardson
2013-05-14Added a deprecation warning to has_key().Leonard Richardson
2013-05-09Changed lxml.feed() to handle the eventuality that it may be given a bytestring.Leonard Richardson
2013-05-09Added a basic benchmark function to the diagnose module.Leonard Richardson
2013-05-09Added a diagnostic function for randomly generating a simple, invalid HTML do...Leonard Richardson
2013-05-08A CSS selector should never match the same tag twice.Leonard Richardson
2013-05-08Minor cleanup.Leonard Richardson
2013-05-08Added tests.Leonard Richardson
2013-05-08Fixed terminology.Leonard Richardson
2013-05-08Moved select() to Tag. It was always an error to call select() on a string, s...Leonard Richardson
2013-05-08Give the checker the ability to stop the iteration over the generator by rais...Leonard Richardson
2013-05-08Aaand... it's now trivial to implement sibling selectors.Leonard Richardson
2013-05-08Once again, we're back to the steady state.Leonard Richardson
2013-05-08Got it all working again except for nth_child_of_type.Leonard Richardson
2013-05-08Refactored again to use iterators instead of calling find_all().Leonard Richardson
2013-05-08OK, the tests pass.Leonard Richardson
2013-05-08Almost there.Leonard Richardson
2013-05-08We're getting there.Leonard Richardson
2013-05-08Fixing test failures.Leonard Richardson
2013-05-08Initial refactoring.Leonard Richardson
2013-05-07Fixed up diagnose() and added it to the docs.Leonard Richardson
2013-05-07Since the string part of a NavigableString is immutable, gave it a simpler __...Leonard Richardson
2013-05-07Fixed an exception when an overspecified CSS selector didn't matchLeonard Richardson
2013-05-07Added support for the "nth-of-type" CSS selector. The CSS selector ">" can no...Leonard Richardson
2013-05-07The prettify() method now leaves the contents of <pre> tagsLeonard Richardson
2013-05-07Merged.Leonard Richardson