Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-2324

Enhancing local mode execution with non-HDFS related queries

    Details

    • Type: New Feature
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      Local mode execution are decided based on the input size. There are queries that do not involve data resides in HDFS at all. We should enhance local mode execution with these types of queries:

      1) metadata only queries (HIVE-178, HIVE-1003). eg.,

        SELECT max(ds) From T WHERE ts = 'some_partition';  -- both ds and ts are partition columns
      

      2) DUAL table (HIVE-1558):

        SELECT MYUDF('constant1', 'constant2') FROM DUAL;
        SELECT MYUDAF(...) FROM ( SELECT 'const1' col1, 'const2' col2 FROM DUAL ) A GROUP BY col1;
      

        Activity

        Hide
        jvs John Sichi added a comment -

        Currently the HBase handler returns 0 from its splits' getLength(), because that's what the underlying HBase TableSplit implementation does, so it's in this category too.

        Show
        jvs John Sichi added a comment - Currently the HBase handler returns 0 from its splits' getLength(), because that's what the underlying HBase TableSplit implementation does, so it's in this category too.

          People

          • Assignee:
            Unassigned
            Reporter:
            nzhang Ning Zhang
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:

              Development