Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-12007

[C++] Loading parquet file returns "Invalid UTF8 payload" error

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.0.0
    • 5.0.0
    • C++

    Description

      While loading a specific parquet file (arrow::read_parquet(file = file)), the following error is returned:

      Error in parquet__arrow_FileReader_ReadTable1(self) :
      Invalid: Invalid UTF8 payload

      I managed to load several other parquet files, it is just this specific file due to which I presume it may be due to some syntax used in this file. As there any known bug in terms of handling the UTF8 encoding?

      Attachments

        Issue Links

          Activity

            People

              hideaki Hideaki Hayashi
              ebotman Emiel Botman
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 10m
                  1h 10m