Skip to content

AssertionError: 'iso-8859-2' == 'windows-1252' during encoding test #433

Open
@awilfox

Description

@awilfox

Python 3.6.9 on Linux kernel 5.4.5 with musl libc 1.2.0.

    def test_encoding():
        for filename in get_data_files("encoding"):
            tests = _TestData(filename, b"data", encoding=None)
            for test in tests:
>               runParserEncodingTest(test[b'data'], test[b'encoding'])

html5lib/tests/test_encoding.py:102: 
_ _ _ _
data = b'<!DOCTYPE HTML>\n<script>document.write(\'<meta charset="ISO-8859-\' + \'2">\')</script>', encoding = 'iso-8859-2'

    def runParserEncodingTest(data, encoding):
        p = HTMLParser()
        assert p.documentEncoding is None
        p.parse(data, useChardet=False)
        encoding = encoding.lower().decode("ascii")
    
>       assert encoding == p.documentEncoding, errorMessage(data, encoding, p.documentEncoding)
E       AssertionError: Input:
E         b'<!DOCTYPE HTML>\n<script>document.write(\'<meta charset="ISO-8859-\' + \'2">\')</script>'
E         Expected:
E         'iso-8859-2'
E         Recieved
E         'windows-1252'
E         
E       assert 'iso-8859-2' == 'windows-1252'
E         - iso-8859-2
E         + windows-1252

html5lib/tests/test_encoding.py:84: AssertionError
==== short test summary info ====
FAILED html5lib/tests/test_encoding.py::test_encoding - AssertionError: Input:
==== 1 failed, 15715 passed, 14980 skipped, 666 xfailed in 137.44s (0:02:17) ====

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions