[ARROW-9218] [R] Numeric columns turn to string when imported in R - ASF JIRA

XML

Word

Printable

JSON

If I create a snappy-compressed parquet file from a DataFrame in spark or pandas, and then import this same file into R using:

arrow::read_parquet(the_file, as_data_frame=TRUE)

arrow::read_parquet(the_file, as_data_frame=FALSE)

Then the columns that were numeric/float before will load as strings.

Loading the same file in Python through

pd.read_parquet(the_file)

Will correctly interpret numeric columns as numeric.

Integer columns seem to be read as integers however.