summaryrefslogtreecommitdiff
path: root/bs4
AgeCommit message (Collapse)Author
2019-09-02Avoid a crash when trying to detect the declared encoding of aLeonard Richardson
Unicode document. Raise an explanatory exception when the underlying parser completely rejects the incoming markup. [bug=1838877]
2019-08-26It's now possible to override any of the element classes.Leonard Richardson
2019-08-26Fixed the definition of the default XML namespace when usingLeonard Richardson
lxml 4.4. Patch by Isaac Muse. [bug=1840141]
2019-08-22Test the ability to build a tree using objects other than Tag and ↵Leonard Richardson
NavigableString.
2019-08-21When instantiating a BeautifulSoup object, it's now possible toLeonard Richardson
provide replacement classes to be instantiated for every tag ('tag_class') or string ('string_class') encountered during parsing, rather than using the default Tag and NavigableString objects.
2019-08-21Copying a Tag preserves information that was originally obtained fromLeonard Richardson
the TreeBuilder used to build the original Tag. [bug=1838903]
2019-08-21Explicitly set preserve_whitespace_tags to None if there is no TreeBuilder.Leonard Richardson
2019-08-21Fixed a crash when pretty-printing tags that were not createdLeonard Richardson
during initial parsing. [bug=1838903]
2019-07-22Added a section about project support to the README.Leonard Richardson
2019-07-21Implemented line number tracking for html5lib.Leonard Richardson
2019-07-21Adapt Chris Mayo's code to track line number and position when using ↵Leonard Richardson
html.parser.
2019-07-16Prep for release.Leonard Richardson
2019-07-16Suppressed warnings during tests that aren't about the warnings.Leonard Richardson
2019-07-15Implemented Tag.smooth.Leonard Richardson
2019-07-15Moved the formatter to its own class and updated its documentation.Leonard Richardson
2019-07-15Improved comments in tests.Leonard Richardson
2019-07-14Give the Formatter class more control over formatting decisions.Leonard Richardson
2019-07-07A Formatter can now decide how (or whether) to order the attributesLeonard Richardson
inside a tag. [bug=1812422]
2019-07-07' (which is valid in XML and XHTML, but not HTML 4) is nowLeonard Richardson
recognized as a named entity and converted to a single quote. [bug=1818721]
2019-07-07Renamed the cdata_list_attributes argument to multi_valued_attributes since ↵Leonard Richardson
it's facing the end-user and that's a more easily understandable name.
2019-07-07It's now possible to override a TreeBuilder's cdata_list_attributes ↵Leonard Richardson
dictionary by passing in a replacement. None will disable the feature altogether. [bug=1832978]
2019-07-07It's now possible to customize the TreeBuilder object by passingLeonard Richardson
keyword arguments into the BeautifulSoup constructor. The main reason to do this right now is to change how multi-valued attributes are treated. [bug=1832978]
2019-01-06Prep for release.Leonard Richardson
2019-01-06Tried even harder to avoid the deprecation warning originally fixed inLeonard Richardson
4.6.1. [bug=1778909]
2019-01-06Fixed an incorrectly raised exception when inserting a tag before orLeonard Richardson
after an identical tag. [bug=1810692]
2019-01-06Don't track un-prefixed namespacesIsaac Muse
2019-01-05Fix for performance with the linkage fix.Isaac Muse
The exact situations have been pinned down, and now solve current known issues without excessive and aggressive recursion.
2018-12-31Prep for release.Leonard Richardson
2018-12-31Improved and tested error checking for insert_before and insert_after.Leonard Richardson
2018-12-30Add convienances for inserting multiple tagsIsaac Muse
Add extend method to append a list of tags. Make insert_before and insert_after accept multiple arguments
2018-12-30Fixed a problem with multi-valued attributes where the valueLeonard Richardson
contained whitespace. Thanks to Jens Svalgaard for the fix. [bug=1787453]
2018-12-30Merging the linkage checker and html5lib fixes by Isaac Muse found in ↵Leonard Richardson
https://code.launchpad.net/~facelessuser/beautifulsoup/html5lib-fix/+merge/361282. [bug=1809910]
2018-12-26Remove dead line of codeIsaac Muse
2018-12-25Ensure html5lib always has valid internal linkageIsaac Muse
html5lib, with malformed HTML, can end up with detached linkage internally. Improve the current code to ensure html5lib always has proper linkage.
2018-12-24Clarified the software license.Leonard Richardson
2018-12-24Issue a warning and raise a more useful exception if someone tries to call ↵Leonard Richardson
Tag.select() without SoupSieve installed.
2018-12-24Keep track of the namespace abbreviations found while parsing the document. ↵Leonard Richardson
This makes select() work most of the time without requiring a value for 'namespaces'.
2018-12-23Merging Isaac Muse's Soup Sieve branch as-is before making some modifications.Leonard Richardson
2018-12-23Merged in next_previous_fixes from Isaac Muse. [bug=1782928,1798699]Leonard Richardson
2018-12-22Fix next and previous linkage issues. Fixes issues #1806598 and #1782928.Isaac Muse
2018-12-20Pass flags to soupsieve.Isaac Muse
2018-12-19Add Soup Sieve supportIsaac Muse
2018-08-12Bump up to version 4.6.3 so I can re-release.Leonard Richardson
2018-08-12Converted README to Markdown format.Leonard Richardson
2018-07-30Fix an exception when a custom formatter was asked to format a voidLeonard Richardson
element. [bug=1784408]
2018-07-28Prep for release.Leonard Richardson
2018-07-28When markup contains duplicate elements, a select() call thatLeonard Richardson
includes multiple match clauses will match all relevant elements. [bug=1770596]
2018-07-28Correctly handle invalid HTML numeric character entities like “Leonard Richardson
which reference code points that are not Unicode code points. Note that this is only fixed when Beautiful Soup is used with the html.parser parser -- html5lib already worked and I couldn't fix it with lxml. [bug=1782933]
2018-07-21Clarified the deprecation warning when accessing tag.fooTag, to coverLeonard Richardson
the possibility that you might really have been looking for a tag called 'fooTag'.
2018-07-21Fixed a problem where the html.parser tree builder interpretedLeonard Richardson
a string like '&foo ' as the character entity '&foo;' [bug=1728706]