Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-20279

HiveContextAwareRecordReader slows down Druid Scan queries.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 4.0.0-alpha-1
    • None
    • None

    Description

      HiveContextAwareRecordReader add lots of overhead for Druid Scan Queries.
      See attached flame graph.
      Looks like the operations for checking for existence of footer/header buffer takes most of time For druid and other storage handlers that do not have footer buffer we should skip the logic for checking the existence for storage handlers atleast.

      Attachments

        1. scan2.svg
          110 kB
          Nishant Bangarwa
        2. HIVE-20279.patch
          3 kB
          Nishant Bangarwa
        3. HIVE-20279.2.patch
          1 kB
          Nishant Bangarwa
        4. HIVE-20279.1.patch
          3 kB
          Nishant Bangarwa

        Activity

          People

            nishantbangarwa Nishant Bangarwa
            nishantbangarwa Nishant Bangarwa
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: