Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-1542

Merge multiple I/O to one time I/O when read footer

    XMLWordPrintableJSON

Details

    Description

      Currently in method readFooter of class ParquetFileReader, it will invoke method converter.readParquetMetadata. To decode file metadata,  there may be quite a number of times of I/O processing underlying, which may heavily slow down parquet read performance.

      A better way is to read all the footer bytes in a single I/O, since we have already got footer length before, and pass down all the bytes to underlying.

       

      And we have a test in our environment, this change may bring  50 percent improvement for some cases.

       

      Attachments

        Issue Links

          Activity

            People

              gwang3 Wang, Gang
              gwang3 Wang, Gang
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: