Age | Commit message (Collapse) | Author | |
---|---|---|---|
2019-10-05 | Avoid a crash when unpickling certain parse trees generated using html5lib ↵ | Leonard Richardson | |
on Python 3. [bug=1843545] | |||
2019-09-02 | Avoid a crash when trying to detect the declared encoding of a | Leonard Richardson | |
Unicode document. Raise an explanatory exception when the underlying parser completely rejects the incoming markup. [bug=1838877] | |||
2019-08-26 | It's now possible to override any of the element classes. | Leonard Richardson | |
2019-08-22 | Test the ability to build a tree using objects other than Tag and ↵ | Leonard Richardson | |
NavigableString. | |||
2019-07-16 | Suppressed warnings during tests that aren't about the warnings. | Leonard Richardson | |
2019-07-07 | ' (which is valid in XML and XHTML, but not HTML 4) is now | Leonard Richardson | |
recognized as a named entity and converted to a single quote. [bug=1818721] | |||
2019-07-07 | Renamed the cdata_list_attributes argument to multi_valued_attributes since ↵ | Leonard Richardson | |
it's facing the end-user and that's a more easily understandable name. | |||
2019-07-07 | It's now possible to override a TreeBuilder's cdata_list_attributes ↵ | Leonard Richardson | |
dictionary by passing in a replacement. None will disable the feature altogether. [bug=1832978] | |||
2019-07-07 | It's now possible to customize the TreeBuilder object by passing | Leonard Richardson | |
keyword arguments into the BeautifulSoup constructor. The main reason to do this right now is to change how multi-valued attributes are treated. [bug=1832978] | |||
2016-07-26 | Clarify that Beautiful Soup is no longer compatible with versions of Python ↵ | Leonard Richardson | |
pre-2.7. Contributed by Ville Skyttä. | |||
2016-07-26 | Use assertEqual instead of deprecated assertEquals | Ville Skyttä | |
2016-07-26 | Clarify Python 2(.7) support status | Ville Skyttä | |
2016-07-16 | Fixed a Python 3 ByteWarning when a URL was passed in as though it | Leonard Richardson | |
were markup. Thanks to James Salter for a patch and test. [bug=1533762] | |||
2015-07-05 | Fixed the test_detect_utf8 test so that it works when chardet is | Leonard Richardson | |
installed. [bug=1471359] | |||
2015-06-27 | Added an exclude_encodings argument to UnicodeDammit and to the | Leonard Richardson | |
Beautiful Soup constructor, which lets you prohibit the detection of an encoding that you know is wrong. [bug=1469408] | |||
2015-06-25 | Fixed a crash in Unicode, Dammit's encoding detector when the name | Leonard Richardson | |
of the encoding itself contained invalid bytes. [bug=1360913] | |||
2014-12-07 | Issue a warning if the BeautifulSoup constructor arguments do not explicitly ↵ | Leonard Richardson | |
name a parser. | |||
2013-10-02 | Fixed a bug that caused Unicode data put into UnicodeDammit to | Leonard Richardson | |
return None instead of the original data. [bug=1214983] | |||
2013-10-01 | Fixed a crash when a short input contains data not valid in | Leonard Richardson | |
filenames. [bug=1232604] | |||
2013-10-01 | Fixed a bug in which short Unicode input was improperly encoded to ASCII ↵ | Leonard Richardson | |
when checking whether or not it was a file on disk. [bug=1227016] | |||
2013-08-19 | Combined two tests to stop a spurious test failure when tests are | Leonard Richardson | |
run by nodetests. [bug=1212445] | |||
2013-06-03 | Let's get some profiling going. | Leonard Richardson | |
2013-06-03 | Test that the filename warning isn't given unless the file actually exists ↵ | Leonard Richardson | |
on disk. | |||
2013-06-03 | Beautiful Soup will issue a warning if instead of markup you pass it | Leonard Richardson | |
a URL or the name of a file on disk (a common beginner mistake). | |||
2013-06-02 | Turns out we had two bits of code to strip byte-order marks. | Leonard Richardson | |
2013-06-02 | It turns out most of the untested code wasn't doing anything useful. | Leonard Richardson | |
2013-05-30 | Split out the code that guesses at encodings from the code that tries to ↵ | Leonard Richardson | |
decode a bytestring based on those encodings. This is necessary because lxml wants to do the decoding itself. | |||
2013-05-20 | The default XML formatter will now replace ampersands even if they appear to ↵ | Leonard Richardson | |
be part of entities. That is, "<" will become "&lt;".[bug=1182183] | |||
2012-08-20 | Python 3.1 also needs to skip the unicode attribute name test. | Leonard Richardson | |
2012-08-20 | Skipped a test under Python 2.6 to avoid a spurious test failure. [bug=1038503] | Leonard Richardson | |
2012-08-17 | Okay, I'll use assertEqual instead. | Leonard Richardson | |
2012-08-17 | Fixed a crash on encoding when an attribute name contained | Leonard Richardson | |
non-ASCII characters. | |||
2012-07-03 | Mentioned cchardet in docs. | Leonard Richardson | |
2012-07-03 | When sniffing encodings, if the cchardet library is installed, use it ↵ | Leonard Richardson | |
instead of chardet. It's much faster. [bug=1020748] | |||
2012-07-03 | Use logging.warning() instead of warning.warn() to notify the user that ↵ | Leonard Richardson | |
characters were replaced with REPLACEMENT CHARACTER. [bug=1013862] | |||
2012-05-24 | Fixed a bug with the lxml treebuilder that prevented the user from adding ↵ | Leonard Richardson | |
attributes to a tag that didn't originally have any. [bug=1002378] Thanks to Oliver Beattie for the patch. | |||
2012-04-27 | Added experimental support for fixing Windows-1252 characters embedded in ↵ | Leonard Richardson | |
UTF-8 documents. | |||
2012-04-26 | Fixed a bug in decoding data that contained a byte-order mark, such as data ↵ | Leonard Richardson | |
encoded in UTF-16LE. [bug=988980] | |||
2012-04-26 | Fixed test failure when lxml is not installed. | Leonard Richardson | |
2012-04-18 | Made encoding substitution in <meta> tags completely transparent (no more ↵ | Leonard Richardson | |
%SOUP-ENCODING%). | |||
2012-04-16 | Unicode, Dammit now has an option to turn MS smart quotes into ASCII characters. | Leonard Richardson | |
2012-03-01 | For backwards compatibility, brought back the BeautifulStoneSoup class as a ↵ | Leonard Richardson | |
deprecated wrapper around BeautifulSoup. | |||
2012-02-26 | Fixed DOCTYPE handling. | Leonard Richardson | |
2012-02-24 | Fixed a test failure that occured on Python 3.x when chardet was installed. | Leonard Richardson | |
2012-02-23 | Fixed handling of the closing of namespaced tags. | Leonard Richardson | |
2012-02-23 | Bumped version number. | Leonard Richardson | |
2012-02-23 | Namespaced attributes are equal if they correspond to the same string. | Leonard Richardson | |
2012-02-22 | Removed tests that merely illustrated parser behavior, behavior that ↵ | Leonard Richardson | |
wouldn't break Beautiful Soup if it changed. | |||
2012-02-20 | Changd the class structure so that the default parser test class uses ↵ | Leonard Richardson | |
html.parser. | |||
2012-02-16 | Issue a warning if characters were replaced with REPLACEMENT CHARACTER ↵ | Leonard Richardson | |
during Unicode conversion. |