summaryrefslogtreecommitdiff
path: root/bs4/tests/test_lxml.py
AgeCommit message (Collapse)Author
2023-03-27Make it possible to pickle a deeply nested BeautifulSoup object.Leonard Richardson
2023-01-31Consistently use pytest.mark.skipif to skip tests when the corresponding ↵Leonard Richardson
libraries are not installed.
2023-01-27Check the associated filename for more warnings.Leonard Richardson
2021-12-17Fix a crash when pickling a BeautifulSoup object that has noLeonard Richardson
tree builder. [bug=1934003]
2021-11-29Do a better job of keeping track of namespaces as an XML document isLeonard Richardson
parsed, so that CSS selectors that use namespaces will do the right thing more often. [bug=1946243]
2021-10-23Added a workaround for an lxml bug ↵Leonard Richardson
(https://bugs.launchpad.net/lxml/+bug/1948551) that caused problems when parsing a Unicode string beginning with BYTE ORDER MARK. [bug=1947768]
2021-10-11Moved the test classes to tests/__init__.py.Leonard Richardson
2021-10-09Moved testing.py into the same package as the tests.Leonard Richardson
2021-09-12Ported unit tests to use pytest.Leonard Richardson
2021-09-07Goodbye, Python 2. [bug=1942919]Leonard Richardson
2021-05-31The html.parser tree builder can now handles named entitiesLeonard Richardson
found in the HTML5 spec in much the same way that the html5lib tree builder does. Note that the lxml tree builder still handles named entities differently. [bug=1924908]
2019-07-21Implemented line number tracking for html5lib.Leonard Richardson
2019-07-07It's now possible to override a TreeBuilder's cdata_list_attributes ↵Leonard Richardson
dictionary by passing in a replacement. None will disable the feature altogether. [bug=1832978]
2019-01-06Don't track un-prefixed namespacesIsaac Muse
2018-07-28When markup contains duplicate elements, a select() call thatLeonard Richardson
includes multiple match clauses will match all relevant elements. [bug=1770596]
2018-07-28Correctly handle invalid HTML numeric character entities like “Leonard Richardson
which reference code points that are not Unicode code points. Note that this is only fixed when Beautiful Soup is used with the html.parser parser -- html5lib already worked and I couldn't fix it with lxml. [bug=1782933]
2014-12-11Improved the lxml tree builder's handling of processingLeonard Richardson
instructions. [bug=1294645]
2013-08-19Combined two tests to stop a spurious test failure when tests areLeonard Richardson
run by nodetests. [bug=1212445]
2013-05-31Create a new lxml parser object for every new parsing strategy.Leonard Richardson
2013-05-20Fixed test failures when lxml is not installed.Leonard Richardson
2013-05-09Changed lxml.feed() to handle the eventuality that it may be given a bytestring.Leonard Richardson
2013-05-07Improved detection of lxml version number.Leonard Richardson
2013-05-07Now that lxml's segfault on invalid doctype has been fixed, fix aLeonard Richardson
corresponding problem on the Beautiful Soup end that was previously invisible. [bug=984936]
2012-04-18Fixed a bug that made the HTMLParser treebuilder generate XML definitions ↵Leonard Richardson
ending with two question marks instead of one. [bug=984258]
2012-03-01For backwards compatibility, brought back the BeautifulStoneSoup class as a ↵Leonard Richardson
deprecated wrapper around BeautifulSoup.
2012-02-23Namespaced attributes are equal if they correspond to the same string.Leonard Richardson
2012-02-22Minor cleanup.Leonard Richardson
2012-02-22Removed tests that merely illustrated parser behavior, behavior that ↵Leonard Richardson
wouldn't break Beautiful Soup if it changed.
2012-02-20lxml tests are once again run and pass when lxml is installed.Leonard Richardson
2012-02-20Changd the class structure so that the default parser test class uses ↵Leonard Richardson
html.parser.
2012-02-15Added a kind of hacky way to interpret the restriction class='foo bar'. Stop ↵Leonard Richardson
generating a space before the slash that closes an empty-element tag.
2012-02-15Tested and cleaned up html5lib insertBefore.Leonard Richardson
2012-02-15Tested improvements to html5lib treebuilder.Leonard Richardson
2012-02-07On output, always convert special XML characters to entities.Leonard Richardson
2012-01-20Replaced assertEquals with assertEqual to get rid of deprecation notice.Leonard Richardson
2012-01-20Actually fixed the test failure.Leonard Richardson
2011-06-29Various changes so most tests pass on Python 3.Thomas Kluyver