Uploaded image for project: 'Tajo'
  1. Tajo
  2. TAJO-1403

Improve 'Simple Query' with only partition columns and constant values

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.11.0
    • Component/s: None
    • Labels:
      None

      Description

      Tajo shows a very fast response for a simple query ( https://cwiki.apache.org/confluence/display/TAJO/Simple+Query+and+Forwarded+Query) like the followings.

      select * from t1 limit 10;
      

      However, in many cases, tables have partitions.

      create external table t1(id int) using csv with ('csvfile.delimiter'='|') partition by column(dt text) location '/data';
      select * from t1 where dt='2015-03-15' limit 10;
      

      If all predicates in WHERE consist of partition columns and 'EQUAL' predicates with constant values, I think Tajo can handle these cases very fast.

      This kind of queries is very popular for DevOps users and simple ETL apps.

        Attachments

        1. TAJO-1403.patch
          12 kB
          Dongjoon Hyun
        2. TAJO-1403_jihoon.patch
          13 kB
          Jihoon Son

          Activity

            People

            • Assignee:
              dongjoon Dongjoon Hyun
              Reporter:
              dongjoon Dongjoon Hyun
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: