Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-6697

ParquetIO Performance test is failing on (GCS filesystem)

Details

    • New Feature
    • Status: Resolved
    • P0
    • Resolution: Fixed
    • None
    • 2.11.0
    • None

    Description

      Relevant failure logs: 

      Caused by: java.lang.RuntimeException: org.apache.beam.sdk.io.parquet.ParquetIO$ReadFiles$BeamParquetInputFile@2de8303e is not a Parquet file (too small length: -1)
          	at org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:514)
          	at org.apache.parquet.hadoop.ParquetFileReader.<init>(ParquetFileReader.java:689)
          	at org.apache.parquet.hadoop.ParquetFileReader.open(ParquetFileReader.java:595)
          	at org.apache.parquet.hadoop.ParquetReader.initReader(ParquetReader.java:152)
          	at org.apache.parquet.hadoop.ParquetReader.read(ParquetReader.java:135)
          	at org.apache.beam.sdk.io.parquet.ParquetIO$ReadFiles$ReadFn.processElement(ParquetIO.java:221)

       

      Full logs can be found here: https://builds.apache.org/view/A-D/view/Beam/view/PerformanceTests/job/beam_PerformanceTests_ParquetIOIT/

       

       

       

      Attachments

        Issue Links

          Activity

            People

              chamikara Chamikara Madhusanka Jayalath
              ŁukaszG Lukasz Gajowy
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 10m
                  1h 10m