summaryrefslogtreecommitdiff
path: root/bs4/dammit.py
AgeCommit message (Collapse)Author
2012-04-27Added experimental support for fixing Windows-1252 characters embedded in ↵Leonard Richardson
UTF-8 documents.
2012-04-26Fixed a bug in decoding data that contained a byte-order mark, such as data ↵Leonard Richardson
encoded in UTF-16LE. [bug=988980]
2012-04-16Unicode, Dammit now has an option to turn MS smart quotes into ASCII characters.Leonard Richardson
2012-04-16Attribute values are now run through the provided output formatter. ↵Leonard Richardson
Previously they were always run through the 'minimal' formatter. [bug=980237]
2012-02-16Issue a warning if characters were replaced with REPLACEMENT CHARACTER ↵Leonard Richardson
during Unicode conversion.
2012-02-09As a last-ditch attempt to turn data into Unicode, use errors=replace ↵Leonard Richardson
instead of errors=strict.
2012-02-09Unicode, Dammit now detects the encoding in HTML 5-style <meta> tags like ↵Leonard Richardson
<meta charset="utf-8" />. [bug=837268]
2012-02-09Minor Unicode, Dammit cleanup.Leonard Richardson
2012-02-09Improved Unicode, Dammit's behavior when you give it Unicode to begin with.Leonard Richardson
2011-06-29Various changes so most tests pass on Python 3.Thomas Kluyver
2011-05-21OK, figured that out.Leonard Richardson
2011-05-21Changed dammit.py to require fewer changes to be Python 3 compatible.Leonard Richardson
2011-03-05PEP8ifyingAaron DeVore
2011-02-27Added a tree builder for the built-in HTMLParser, and tests.Leonard Richardson