PT-2026-22989 · Unknown · Lxml Html Clean

Uug4Na

CVE-2026-28348

Published

2026-02-27

Updated

2026-03-11

CVSS v2.0

6.4

Medium

Vector

AV:N/AC:L/Au:N/C:P/I:P/A:N

Name of the Vulnerable Software and Affected Versions lxml html clean versions prior to 0.4.4

Description The has sneaky javascript() method in lxml html clean incorrectly strips backslashes before checking for dangerous CSS keywords. This allows CSS Unicode escape sequences to bypass the @import and expression() filters, potentially enabling external CSS loading or cross-site scripting (XSS) in older browsers. The root cause is a faulty backslash stripping operation within the clean.py file, around line 594. Specifically, the line style = style.replace('', '') transforms payloads like @69mport into @69mport, which bypasses the blacklist. Modern browsers interpret 69 as the character 'i' according to CSS specifications, effectively allowing the @import statement to execute. This bypass also affects the detection of expression(), posing a risk in older versions of Internet Explorer. A proof of concept demonstrates that a normally blocked @import statement can be successfully executed using the Unicode escape bypass. This could lead to data exfiltration through attribute selectors, UI redressing, and phishing attacks. In older browsers, it could enable full XSS via the expression() function.

Recommendations Versions prior to 0.4.4 should be updated to version 0.4.4 or later.

Exploit

Fix

Improper Encoding or Escaping of Output

Found an issue in the description? Have something to add? Feel free to write us 👾

dbugs@ptsecurity.com

Weakness Enumeration

CWE-116

Related Identifiers

BDU:2026-07380

CVE-2026-28348

GHSA-HW26-MMPG-FQFG

OPENSUSE-SU-2026:10322-1

OPENSUSE-SU-2026:20345-1

PYSEC-2026-2201

Affected Products

Lxml Html Clean

PT-2026-22989 · Unknown · Lxml Html Clean

CVE-2026-28348

Weakness Enumeration

Related Identifiers

Affected Products

References · 23