Uploaded image for project: 'Sqoop'
  1. Sqoop
  2. SQOOP-1366

Propose to add Parquet support

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.4.4
    • Fix Version/s: 1.4.6
    • Component/s: tools
    • Labels:

      Description

      Parquet is a file format that stores data organized by column rather than by record. This JIRA proposes to add Parquet support for Sqoop 1. It will cover the use cases as follows:

      • Import data from database as Parquet files into HDFS.
      • Import data from database into Hive as Parquet file
      • Export Parquet files from HDFS to database.

        Attachments

        1. parquet_support_design.pdf
          114 kB
          Qian Xu

          Activity

            People

            • Assignee:
              stanleyxu2005 Qian Xu
              Reporter:
              stanleyxu2005 Qian Xu
            • Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: