Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
Impala 4.2.0
-
None
-
ghx-label-2
Description
In the docker-based tests on Redhat 8 / Ubuntu 20, the ExprTest.Utf8MaskTest fails:
/home/impdev/Impala/be/src/exprs/expr-test.cc:369 Value of: GetValue(expr, ColumnType(TYPE_STRING)) Actual: "xxxx \xC3\xA1\xC3\xA4\xC3\xA8\xC3\xBC XXXX \xC3\x81\xC3\x84\xC3\x88\xC3\x9C" Expected: expected_result Which is: "xxxx xxxx XXXX XXXX" mask('abcd ABCD ')
These come with the C.UTF-8 locale installed. This error goes away if I change bin/bootstrap_system.sh to install langpacks-us (Centos) or language-pack-en (Ubuntu), which installs the en_US.UTF-8 locale.
This might be related to this code: https://github.com/apache/impala/blob/master/be/src/exprs/mask-functions-ir.cc#L150
Installing the language packs is easy, but I'm not sure if users would have those installed.
Attachments
Issue Links
- is related to
-
IMPALA-11519 Document about UTF-8 support requirements
- Resolved