Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-14269 Performance optimizations for data on S3
  3. HIVE-14270

Write temporary data to HDFS when doing inserts on tables located on S3

Log workAgile BoardRank to TopRank to BottomVotersWatch issueWatchersConvert to IssueMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.3.0
    • None
    • None

    Description

      Currently, when doing INSERT statements on tables located at S3, Hive writes and reads temporary (or intermediate) files to S3 as well.

      If HDFS is still the default filesystem on Hive, then we can keep such temporary files on HDFS to keep things run faster.

      Attachments

        1. HIVE-14270.1.patch
          8 kB
          Sergio Peña
        2. HIVE-14270.2.patch
          19 kB
          Sergio Peña
        3. HIVE-14270.3.patch
          20 kB
          Sergio Peña
        4. HIVE-14270.4.patch
          15 kB
          Sergio Peña
        5. HIVE-14270.5.patch
          17 kB
          Sergio Peña
        6. HIVE-14270.6.patch
          16 kB
          Sergio Peña

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            spena Sergio Peña Assign to me
            spena Sergio Peña
            Votes:
            0 Vote for this issue
            Watchers:
            12 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Issue deployment