Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-1142

Avoid leaking Hadoop API to downstream libraries

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.9.0
    • Fix Version/s: 1.10.0
    • Component/s: parquet-mr
    • Labels:
      None

      Description

      Parquet currently leaks the Hadoop API by requiring callers to pass Path and Configuration instances, and by using Hadoop codecs. InputFile and SeekableInputStream add alternatives to Hadoop classes in some parts of the read path, but this needs to be extended to the write path and to avoid passing options through Configuration.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                rdblue Ryan Blue
                Reporter:
                rdblue Ryan Blue
              • Votes:
                1 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: