Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-1108

Impala should check the number of opened files/partition during insert

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • Impala 1.4
    • None
    • Frontend

    Description

      For insert, when Impala is inserting into a huge number of partition, Impala might be opening too many files. HDFS will return an error, but the error is incomprehensible as "Error(12): Cannot allocate memory".

      We can do better to improve the error message. Here are two suggestions:

      1. During planning, if there's stats, we know how many partitions are being inserted per Impalad. Based on that, we can determine if we'll be opening too many files. Either return an error or a warning message.

      2. During query execution, keep track of the number of files opened for read and write. If we're opening too many files for write, abort the query and returns a proper error message.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              alan@cloudera.com Alan Choi
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: