summaryrefslogtreecommitdiff
path: root/tests/test_lxml.py
AgeCommit message (Collapse)Author
2023-01-25Removed very old tests that were imported as part of the bzr import but not ↵Leonard Richardson
removed.
2011-05-21OK, figured that out.Leonard Richardson
2011-05-21Test that mixed-case tags are folded to lowercase.Leonard Richardson
2011-02-27Added tests to verify that bug 606662 is fixed.Leonard Richardson
2011-02-27Added a test to verify that bug 369897 is fixed.Leonard Richardson
2011-02-27Added test to verify that bug 403640 is fixed.Leonard Richardson
2011-02-27Added a tree builder for the built-in HTMLParser, and tests.Leonard Richardson
2011-02-27Renamed the beautifulsoup module to bs4 to save typing.Leonard Richardson
2011-02-26Emit an XML declaration when appropriate.Leonard Richardson
2011-02-22Solved the question of how to decide between ' (XML) and &squot; (HTML) ↵Leonard Richardson
by cutting the Gordian knot: quote the *double* quotes, which are always ".
2011-02-20Renamed constructor arguments to comply with PEP 8.Leonard Richardson
2011-02-20Use registration code to register builders. The registration code will be ↵Leonard Richardson
expanded later.
2011-02-20Discovered that html5lib can't be made to support SoupStrainers, and changed ↵Leonard Richardson
the test suite appropriately.
2011-02-20Since we can't parse in CData objects ATM, added a test for CData objects ↵Leonard Richardson
created manually, to keep the bits from rotting.
2011-02-20Made the XML treebuilder able to handle basic invalid XML.Leonard Richardson
2011-02-20Refactored some empty-element tests and added more.Leonard Richardson
2011-02-20Test that empty-element tags that get children stop being empty-element tags.Leonard Richardson
2011-02-20Added tests of custom lists of empty-element tags.Leonard Richardson
2011-02-20Added an empty-element tag test.Leonard Richardson
2011-02-20Tag.is_empty_element is determined dynamically, based on a) whether the ↵Leonard Richardson
builder used to create the tag had an explicit list of empty-element tags, and b) whether the tag actually contains anything.
2011-02-20Why is the test failing? Because I'm asserting the wrong thing.Leonard Richardson
2011-02-19Hacked in something to get lxml's behavior where any empty tag is treated as ↵Leonard Richardson
self-closing. This may or may not stay as is.
2011-02-19Oh, good, html5lib correctly handles literals in <textarea> tags.Leonard Richardson
2011-02-19Set up an lxml parser that only parses XML, though it's not very functional yet.Leonard Richardson
2011-02-18Moved in the last of the tests from TODO.Leonard Richardson
2011-02-18Ported tests of bad markup that were lying around the TODO.Leonard Richardson
2011-02-18Made Unicode, Dammit more PEP-8 compliant.Leonard Richardson
2011-02-18Fixed the test by giving it more data to sniff.Leonard Richardson
2011-02-18Pass the user-specified encoding in to html5lib rather than dropping it on ↵Leonard Richardson
the floor.
2011-02-18Have the html5lib builder set the sniffed encoding after parsing, rather ↵Leonard Richardson
than before as happens with lxml.
2011-02-18Added failing encoding conversion tests for html5lib.Leonard Richardson
2011-02-18Made conversion of markup to Unicode the responsibility of the builder, not ↵Leonard Richardson
the BeautifulSoup class itself. lxml uses Unicode, Dammit; html5lib uses its internal algorithms.
2011-02-18Moved the substitution code to the Tag constructor so that we don't have to ↵Leonard Richardson
rely on handle_starttag to trigger it.
2011-02-18Clarified wording.Leonard Richardson
2011-02-18Removed partially ported test that's now completely ported.Leonard Richardson
2011-02-18Ported the encoding tests, and split them up into logical chunks. The ↵Leonard Richardson
html5lib writer isn't setting up the charset substitution.
2011-02-13Ported more tests of bad declarations.Leonard Richardson
2011-02-13Added tests of nonsensical declarations.Leonard Richardson
2011-02-13Got the doctype tests to work for html5lib.Leonard Richardson
2011-02-13Got a variety of doctype tests working.Leonard Richardson
2011-02-13Added tests for namespaced doctypes.Leonard Richardson
2011-02-13Clarified lxml's behavior w/r/t CDATA sections.Leonard Richardson
2011-02-13Figured out the deal with CDATA sections in lxml and html5lib, and added ↵Leonard Richardson
comments and tests.
2011-02-10Added some elementary doctype handling.Leonard Richardson
2011-02-10Added more table tests.Leonard Richardson
2011-02-10Added tests illustrating the different ways lxml and html5lib handle nested ↵Leonard Richardson
tables.
2011-02-10Ported some more tests demonstrating that entities are converted to Unicode ↵Leonard Richardson
characters on the way in.
2011-02-10Added a test to verify that both lxml and html5lib convert entities to ↵Leonard Richardson
Unicode characters during parsing.
2011-01-30Stop pretending that the 'generic' builder test is different from the lxml test.Leonard Richardson
2011-01-28Refactored enough to get all of the tests to pass with test discovery, even ↵Leonard Richardson
though there are still some underlying problems.