An issue was discovered in lxml before 4.2.5. lxml/html/clean.py in the lxml.html.clean module does not remove javascript: URLs that use escaping, allowing a remote attacker to conduct XSS attacks, as demonstrated by "j a v a s c r i p t:" in Internet Explorer. This is a similar issue to CVE-2014-3146.
https://usn.ubuntu.com/3841-2/
https://usn.ubuntu.com/3841-1/
https://lists.debian.org/debian-lts-announce/2020/11/msg00044.html
https://lists.debian.org/debian-lts-announce/2018/12/msg00001.html
https://github.com/lxml/lxml/commit/6be1d081b49c97cfd7b3fbd934a193b668629109