Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-2068

Speed up query "select xx,xx from xxx LIMIT xxx" if no filtering or aggregation

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.8.0
    • Query Processor
    • None
    • Reviewed

    Description

      Currently, "select xx,xx from xxx where ...(only partition conditions) LIMIT xxx" will start a MapReduce job with input to be the whole table or partition. The latency can be huge if the table or partition is big. We could reduce number of input files to speed up the queries.

      Attachments

        1. HIVE-2068.6.patch
          79 kB
          Siying Dong
        2. HIVE-2068.5.patch
          79 kB
          Siying Dong
        3. HIVE-2068.4.patch
          80 kB
          Siying Dong
        4. HIVE-2068.3.patch
          78 kB
          Siying Dong
        5. HIVE-2068.2.patch
          77 kB
          Siying Dong
        6. HIVE-2068.1.patch
          72 kB
          Siying Dong

        Activity

          People

            sdong Siying Dong
            sdong Siying Dong
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: