Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-1398

[Python] No support reading columns of type decimal(19,4)

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.6.0
    • Fix Version/s: 0.8.0
    • Component/s: Python
    • Labels:
    • Environment:
      Ubuntu 16.04, Python 3.6

      Description

      I have a localy saved parquet database created in spark from querrying an SQL database. When I run:

      import pyarrow.parquet as pq
      
      path = "path/to/parquet/dataset"
      dataset = pq.ParquetDataset(path)
      dataset.read()
      

      an error indicating that there is no support for reading columns of type decimal(19,4). It's quite a common type used in SQL databases and I saw in the source code that there is an implementation for decimals. I'm stuck trying to figuring out a solution. Is there a walk around (conversion of decimals to integers during reading)?

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                cpcloud Phillip Cloud
                Reporter:
                LudwikB Ludwik Bielczynski
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: