If the charset-normalizer Python module

(https://pypi.org/project/charset-normalizer/) is installed, Beautiful Soup will use it to detect the character sets of incoming documents. This is also the module used by newer versions of the Requests library. For the sake of backwards compatibility, chardet and cchardet both take precedence if installed. [bug=1955346]
author: Leonard Richardson <leonardr@segfault.org> 2021-12-19 13:56:48 -0500
committer: Leonard Richardson <leonardr@segfault.org> 2021-12-19 13:56:48 -0500
commit: 497bf714d07e491bc811abd64e5fe3391564998d (patch)
tree: fdc945b11e121fb21b8d65284d0db5a9ebc43d30 /doc
parent: 854fd52ad616d8e9c0860bba2eb4ddd93eb2dc79 (diff)
1 files changed, 4 insertions, 4 deletions
diff --git a/doc/source/index.rst b/doc/source/index.rst
index d81fccd..66bd03e 100644
--- a/doc/source/index.rst
+++ b/doc/source/index.rst
@@ -2676,10 +2676,10 @@ become Unicode::
  # 'utf-8'
 
 Unicode, Dammit's guesses will get a lot more accurate if you install
-the ``chardet`` or ``cchardet`` Python libraries. The more data you
-give Unicode, Dammit, the more accurately it will guess. If you have
-your own suspicions as to what the encoding might be, you can pass
-them in as a list::
+one of these Python libraries: ``charset-normalizer``, ``chardet``, or
+``cchardet``. The more data you give Unicode, Dammit, the more
+accurately it will guess. If you have your own suspicions as to what
+the encoding might be, you can pass them in as a list::
 
  dammit = UnicodeDammit("Sacr\xe9 bleu!", ["latin-1", "iso-8859-1"])
  print(dammit.unicode_markup)
author	Leonard Richardson <leonardr@segfault.org>	2021-12-19 13:56:48 -0500
committer	Leonard Richardson <leonardr@segfault.org>	2021-12-19 13:56:48 -0500
commit	497bf714d07e491bc811abd64e5fe3391564998d (patch)
tree	fdc945b11e121fb21b8d65284d0db5a9ebc43d30 /doc
parent	854fd52ad616d8e9c0860bba2eb4ddd93eb2dc79 (diff)