Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-15544

[Go][Parquet] pqarrow.getOriginSchema error while decoding ARROW:schema

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 7.0.0
    • 8.0.0
    • Go, Parquet
    • go1.17, python3.8

    Description

      Hello !

      This is my first time participating in the open source community as a junior developer and I would like to thank you all for your hard work

      While using the new pqarrow package for our project Metronlab/bow to read parquet files previously written by Pandas.
      An error is returned by function getOriginSchema if the "ARROW:schema" base64 encoded value is ending with padding characters.
      This is caused by the use of the RawStdEncoding type that omits padding characters.
      Is there any reason for using raw encoding instead of standard?

      Here is a repo with a test script to demonstrate the problem: antoinegelloz/arrowparquet

      Thank you in advance for your help,

      Antoine Gelloz

      Attachments

        Issue Links

          Activity

            People

              zeroshade Matthew Topol
              antoinegelloz Antoine Gelloz
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h
                  1h