Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-1422

[C++] Use Arrow IO interfaces natively rather than current parquet:: wrappers

    XMLWordPrintableJSON

Details

    Description

      We are beginning to do some work on asynchronous IO in Arrow and it would be great to be able to leverage this in the Parquet core internals.

      I am proposing to remove the Parquet-specific virtual file interfaces in

      https://github.com/apache/arrow/blob/master/cpp/src/parquet/util/memory.h#L221

      and instead rely directly on the Arrow ones in arrow::io. In addition to reducing the amount of code we have to maintain, we will also be able to improve performance of Parquet by utilizing common utilities for managing asynchronous / background IO

      cc mdeepak xhochy

      Attachments

        Issue Links

          Activity

            People

              wesm Wes McKinney
              wesm Wes McKinney
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 9.5h
                  9.5h