Hive
  1. Hive
  2. HIVE-2324

Enhancing local mode execution with non-HDFS related queries

    Details

    • Type: New Feature New Feature
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      Local mode execution are decided based on the input size. There are queries that do not involve data resides in HDFS at all. We should enhance local mode execution with these types of queries:

      1) metadata only queries (HIVE-178, HIVE-1003). eg.,

        SELECT max(ds) From T WHERE ts = 'some_partition';  -- both ds and ts are partition columns
      

      2) DUAL table (HIVE-1558):

        SELECT MYUDF('constant1', 'constant2') FROM DUAL;
        SELECT MYUDAF(...) FROM ( SELECT 'const1' col1, 'const2' col2 FROM DUAL ) A GROUP BY col1;
      

        Activity

        Hide
        John Sichi added a comment -

        Currently the HBase handler returns 0 from its splits' getLength(), because that's what the underlying HBase TableSplit implementation does, so it's in this category too.

        Show
        John Sichi added a comment - Currently the HBase handler returns 0 from its splits' getLength(), because that's what the underlying HBase TableSplit implementation does, so it's in this category too.

          People

          • Assignee:
            Unassigned
            Reporter:
            Ning Zhang
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:

              Development