Source: html5-parser
Section: python
Priority: optional
Maintainer: Html5-Parser maintainer team
Uploaders: YOKOTA Hiroshi
Build-Depends: debhelper-compat (= 13),
dh-python,
python3-all-dev,
python3-bs4,
python3-chardet,
python3-setuptools,
libxml2-dev,
python3-lxml,
pkgconf
Standards-Version: 4.7.2
Homepage: https://github.com/kovidgoyal/html5-parser
Vcs-Browser: https://github.com/debian-calibre/html5-parser
Vcs-Git: https://github.com/debian-calibre/html5-parser.git
Rules-Requires-Root: no
Package: python3-html5-parser
Architecture: any
Depends: ${python3:Depends}, ${misc:Depends}, ${shlibs:Depends}
Description: fast, standards compliant, C based, HTML 5 parser for Python
A fast implementation of the HTML 5 parsing spec for Python. Parsing is
done in C using a variant of the gumbo parser. The gumbo parse tree is
then transformed into an lxml tree, also in C, yielding parse times that
can be a thirtieth of the html5lib parse times. That is a speedup of 30x.
This differs, for instance, from the gumbo Python bindings, where the
initial parsing is done in C but the transformation into the final
tree is done in Python.