Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-2068

Speed up query "select xx,xx from xxx LIMIT xxx" if no filtering or aggregation

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.8.0
    • Component/s: Query Processor
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      Currently, "select xx,xx from xxx where ...(only partition conditions) LIMIT xxx" will start a MapReduce job with input to be the whole table or partition. The latency can be huge if the table or partition is big. We could reduce number of input files to speed up the queries.

        Attachments

        1. HIVE-2068.1.patch
          72 kB
          Siying Dong
        2. HIVE-2068.2.patch
          77 kB
          Siying Dong
        3. HIVE-2068.3.patch
          78 kB
          Siying Dong
        4. HIVE-2068.4.patch
          80 kB
          Siying Dong
        5. HIVE-2068.5.patch
          79 kB
          Siying Dong
        6. HIVE-2068.6.patch
          79 kB
          Siying Dong

          Activity

            People

            • Assignee:
              sdong Siying Dong
              Reporter:
              sdong Siying Dong
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: