index
:
beautifulsoup.git
master
Unnamed repository; edit this file 'description' to name the repository.
User &
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
bs4
/
builder
Age
Commit message (
Expand
)
Author
2018-07-14
Stopped HTMLParser from raising an exception in very rare cases of
Leonard Richardson
2017-05-06
Improved the handling of empty-element tags like <br> when using the
Leonard Richardson
2017-05-06
HTML parsers treat all HTML4 and HTML5 empty element tags (aka void element t...
Leonard Richardson
2016-12-19
Fixed foster parenting when html5lib is the tree builder. Thanks to Geoffrey ...
Leonard Richardson
2016-12-19
Fixed yet another problem that caused the html5lib tree builder to
Leonard Richardson
2016-07-30
Explained why we test both unicode and bytestring processing instructions.
Leonard Richardson
2016-07-26
Fixed a reported (but not duplicated) bug involving processing instructions f...
Leonard Richardson
2016-07-16
Beautiful Soup will now work with versions of html5lib greater than
Leonard Richardson
2016-07-16
Removed imports to pdb, since pdb is not available in some environments. [bug...
Leonard Richardson
2016-07-16
The contents of <textarea> tags will no longer be modified when the
Leonard Richardson
2016-07-16
Added a separate class for XML processing instructions, which have a slightly...
Leonard Richardson
2016-07-16
Rename COPYING.txt to LICENSE. Add a reference to LICENSE in every source file.
Leonard Richardson
2015-12-08
Fix foster parenting with html5lib.
Geoffrey Sneddon
2015-12-08
Make TreeBuilderForHtml5lib strictly follow the html5lib API.
Geoffrey Sneddon
2015-09-28
Fixed a parse bug with the html5lib tree-builder. Thanks to Roel
Leonard Richardson
2015-06-28
It's now possible to pickle a BeautifulSoup object no matter which
Leonard Richardson
2015-06-28
Changed the way soup objects work under copy.copy(). Copying a
Leonard Richardson
2015-06-28
Fixed a bug where Element.extract() could create an infinite loop in
Leonard Richardson
2015-06-28
Accept 'xml' as an unambiguous identifier for the lxml XML parser, since it's...
Leonard Richardson
2015-06-27
Added an exclude_encodings argument to UnicodeDammit and to the
Leonard Richardson
2015-06-26
Added a sanity check helper method that makes sure all the elements of a tree...
Leonard Richardson
2015-06-24
Fixed an import error in Python 3.5 caused by the removal of the
Leonard Richardson
2015-06-24
Made double sure that we don't use the 'strict' constructor argument when it'...
Leonard Richardson
2015-06-24
If the initial <html> tag contains a CDATA list attribute such as
Leonard Richardson
2015-06-23
Got a hacky fix for the latest html5lib problem.
Leonard Richardson
2014-12-11
Improved the lxml tree builder's handling of processing
Leonard Richardson
2014-12-07
In Python 3.4 and above, set the new convert_charrefs argument to
Leonard Richardson
2014-12-07
Tweaked the parser warning.
Leonard Richardson
2014-12-07
Issue a warning if the BeautifulSoup constructor arguments do not explicitly ...
Leonard Richardson
2013-10-18
Fixed yet another problem that caused the html5lib tree builder to
Leonard Richardson
2013-10-01
Fixed a bug in which short Unicode input was improperly encoded to ASCII when...
Leonard Richardson
2013-08-13
* Fixed yet another problem with the html5lib tree builder, caused by
Leonard Richardson
2013-06-03
Save another Element creation.
Leonard Richardson
2013-06-03
Improved performance for html5lib.
Leonard Richardson
2013-06-03
Improved performance of _replace_cdata_list_attribute_values, and greatly red...
Leonard Richardson
2013-06-02
Merged in big encoding-detection refactoring branch.
Leonard Richardson
2013-06-02
Turns out we had two bits of code to strip byte-order marks.
Leonard Richardson
2013-06-02
It turns out most of the untested code wasn't doing anything useful.
Leonard Richardson
2013-06-02
Treat an lxml ParserError as a ParserRejectedMarkup.
Leonard Richardson
2013-05-31
The html.parser treebuilder can now handle numeric attributes in
Leonard Richardson
2013-05-31
Create a new lxml parser object for every new parsing strategy.
Leonard Richardson
2013-05-20
The default XML formatter will now replace ampersands even if they appear to ...
Leonard Richardson
2013-05-20
The .next_element attribute used during parsing was confusingly similar to th...
Leonard Richardson
2013-05-20
Fixed another bug by which the html5lib tree builder could create a
Leonard Richardson
2013-05-09
Changed lxml.feed() to handle the eventuality that it may be given a bytestring.
Leonard Richardson
2013-05-09
Added a diagnostic function for randomly generating a simple, invalid HTML do...
Leonard Richardson
2013-05-07
Now that lxml's segfault on invalid doctype has been fixed, fix a
Leonard Richardson
2012-10-11
Fix a bug in the lxml treebuilder which crashed when a tag included
Leonard Richardson
2012-09-28
Fixed package name.
Leonard Richardson
2012-08-21
We don't need a special insertComment method, we just need to make Element.ap...
Leonard Richardson
[next]