Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-577

Batch hdfsWrite calls for hdfs-text-writer

    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Impala 1.1.1
    • Impala 1.2
    • None
    • None

    Description

      We should look into the right batch size to send to hdfswrite. We previously called it once per row, which lead to very poor performance. Now we batch based on the input batch size.

      This is not effective for partitioned tables where the input batch is split. We should also see if this is the best size to pass to hdfs in general.

      Attachments

        Activity

          People

            nong_impala_60e1 Nong Li
            nong_impala_60e1 Nong Li
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: