Hive
  1. Hive
  2. HIVE-2068

Speed up query "select xx,xx from xxx LIMIT xxx" if no filtering or aggregation

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.8.0
    • Component/s: Query Processor
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      Currently, "select xx,xx from xxx where ...(only partition conditions) LIMIT xxx" will start a MapReduce job with input to be the whole table or partition. The latency can be huge if the table or partition is big. We could reduce number of input files to speed up the queries.

      1. HIVE-2068.6.patch
        79 kB
        Siying Dong
      2. HIVE-2068.5.patch
        79 kB
        Siying Dong
      3. HIVE-2068.4.patch
        80 kB
        Siying Dong
      4. HIVE-2068.3.patch
        78 kB
        Siying Dong
      5. HIVE-2068.2.patch
        77 kB
        Siying Dong
      6. HIVE-2068.1.patch
        72 kB
        Siying Dong

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Siying Dong
            Reporter:
            Siying Dong
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development