Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-9218

[R] Numeric columns turn to string when imported in R

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Not A Problem
    • 0.17.1
    • None
    • R
    • None

    Description

      If I create a snappy-compressed parquet file from a DataFrame in spark or pandas, and then import this same file into R using:

      arrow::read_parquet(the_file, as_data_frame=TRUE)
      

      or

      arrow::read_parquet(the_file, as_data_frame=FALSE)

      Then the columns that were numeric/float before will load as strings.

      Loading the same file in Python through

      pd.read_parquet(the_file)
      

      Will correctly interpret numeric columns as numeric.

       

      Integer columns seem to be read as integers however.

      Attachments

        Activity

          People

            Unassigned Unassigned
            david.cortes David Cortes
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: