Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
0.15.0
-
I'm running R on Windows 10
Description
Hello.
I'm new to the arrow package in R and I'm having a trouble regarding special characters (Icelandic). I have a large data set and everything is fine until I write the file to disk and read it in again (i.e. I use write_parquet() and then read_parquet()). When I read the data back in to R special characters turn into question mark. I.e. Veitingastaðir becomes Veitingasta�ir.
This does not happen when I use .csv.
Is there anything I can do when I write the .parquet file to disk or when I read it in to prevent this?
Attachments
Issue Links
- is related to
-
ARROW-7288 [C++][R] read_parquet() freezes on Windows with Japanese locale
- Resolved
-
ARROW-9163 [C++] Add methods to StringArray, LargeStringArray, to validate whether its values are all UTF-8
- Resolved
- links to