summaryrefslogtreecommitdiff
path: root/bs4/tests/test_soup.py
AgeCommit message (Collapse)Author
2012-07-03When sniffing encodings, if the cchardet library is installed, use it ↵Leonard Richardson
instead of chardet. It's much faster. [bug=1020748]
2012-07-03Use logging.warning() instead of warning.warn() to notify the user that ↵Leonard Richardson
characters were replaced with REPLACEMENT CHARACTER. [bug=1013862]
2012-05-24Fixed a bug with the lxml treebuilder that prevented the user from adding ↵Leonard Richardson
attributes to a tag that didn't originally have any. [bug=1002378] Thanks to Oliver Beattie for the patch.
2012-04-27Added experimental support for fixing Windows-1252 characters embedded in ↵Leonard Richardson
UTF-8 documents.
2012-04-26Fixed a bug in decoding data that contained a byte-order mark, such as data ↵Leonard Richardson
encoded in UTF-16LE. [bug=988980]
2012-04-26Fixed test failure when lxml is not installed.Leonard Richardson
2012-04-18Made encoding substitution in <meta> tags completely transparent (no more ↵Leonard Richardson
%SOUP-ENCODING%).
2012-04-16Unicode, Dammit now has an option to turn MS smart quotes into ASCII characters.Leonard Richardson
2012-03-01For backwards compatibility, brought back the BeautifulStoneSoup class as a ↵Leonard Richardson
deprecated wrapper around BeautifulSoup.
2012-02-26Fixed DOCTYPE handling.Leonard Richardson
2012-02-24Fixed a test failure that occured on Python 3.x when chardet was installed.Leonard Richardson
2012-02-23Fixed handling of the closing of namespaced tags.Leonard Richardson
2012-02-23Bumped version number.Leonard Richardson
2012-02-23Namespaced attributes are equal if they correspond to the same string.Leonard Richardson
2012-02-22Removed tests that merely illustrated parser behavior, behavior that ↵Leonard Richardson
wouldn't break Beautiful Soup if it changed.
2012-02-20Changd the class structure so that the default parser test class uses ↵Leonard Richardson
html.parser.
2012-02-16Issue a warning if characters were replaced with REPLACEMENT CHARACTER ↵Leonard Richardson
during Unicode conversion.
2012-02-09As a last-ditch attempt to turn data into Unicode, use errors=replace ↵Leonard Richardson
instead of errors=strict.
2012-02-09Unicode, Dammit now detects the encoding in HTML 5-style <meta> tags like ↵Leonard Richardson
<meta charset="utf-8" />. [bug=837268]
2012-01-20Tests now work in both versions, and it's possible to test both versions by ↵Leonard Richardson
running one command.
2012-01-20Made it easier to convert BS3 code to BS4.Leonard Richardson
2012-01-20Replaced assertEquals with assertEqual to get rid of deprecation notice.Leonard Richardson
2011-06-29Various changes so most tests pass on Python 3.Thomas Kluyver