Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17718 Hive on Spark Debugging Improvements
  3. HIVE-20512

Improve record and memory usage logging in SparkRecordHandler

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 4.0.0-alpha-1
    • Spark
    • None

    Description

      We currently log memory usage and # of records processed in Spark tasks, but we should improve the methodology for how frequently we log this info. Currently we use the following code:

      private long getNextLogThreshold(long currentThreshold) {
          // A very simple counter to keep track of number of rows processed by the
          // reducer. It dumps
          // every 1 million times, and quickly before that
          if (currentThreshold >= 1000000) {
            return currentThreshold + 1000000;
          }
          return 10 * currentThreshold;
        }
      

      The issue is that after a while, the increase by 10x factor means that you have to process a huge # of records before this gets triggered.

      A better approach would be to log this info at a given interval. This would help in debugging tasks that are seemingly hung.

      Attachments

        1. HIVE-20512.1.patch
          3 kB
          Bharath Krishna
        2. HIVE-20512.2.patch
          8 kB
          Bharath Krishna
        3. HIVE-20512.3.patch
          9 kB
          Bharath Krishna
        4. HIVE-20512.4.patch
          10 kB
          Bharath Krishna
        5. HIVE-20512.5.patch
          10 kB
          Bharath Krishna
        6. HIVE-20512.6.patch
          10 kB
          Bharath Krishna
        7. HIVE-20512.7.patch
          10 kB
          Bharath Krishna
        8. HIVE-20512.8.patch
          10 kB
          Bharath Krishna
        9. HIVE-20512.9.patch
          10 kB
          Bharath Krishna
        10. HIVE-20512.91.patch
          10 kB
          Bharath Krishna
        11. HIVE-20512.92.patch
          10 kB
          Bharath Krishna

        Issue Links

          Activity

            People

              bharos92 Bharath Krishna
              stakiar Sahil Takiar
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: