Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Not A Problem
-
0.17.1
-
None
-
None
Description
If I create a snappy-compressed parquet file from a DataFrame in spark or pandas, and then import this same file into R using:
arrow::read_parquet(the_file, as_data_frame=TRUE)
or
arrow::read_parquet(the_file, as_data_frame=FALSE)
Then the columns that were numeric/float before will load as strings.
Loading the same file in Python through
pd.read_parquet(the_file)
Will correctly interpret numeric columns as numeric.
Integer columns seem to be read as integers however.