Details

    • Type: Wish
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 0.19.0
    • Fix Version/s: 0.19.0
    • Component/s: None
    • Labels:
      None
    • Environment:

      N/A

    • Hadoop Flags:
      Reviewed
    • Release Note:
      Hide
      Introduced Hive Data Warehouse built on top of Hadoop that enables structuring Hadoop files as tables and partitions and allows users to query this data through a SQL like language using a command line interface.
      Show
      Introduced Hive Data Warehouse built on top of Hadoop that enables structuring Hadoop files as tables and partitions and allows users to query this data through a SQL like language using a command line interface.

      Description

      Hive is a data warehouse built on top of flat files (stored primarily in HDFS). It includes:

      • Data Organization into Tables with logical and hash partitioning
      • A Metastore to store metadata about Tables/Partitions etc
      • A SQL like query language over object data stored in Tables
      • DDL commands to define and load external data into tables

      Hive's query language is executed using Hadoop map-reduce as the execution engine. Queries can use either single stage or multi-stage map-reduce. Hive has a native format for tables - but can handle any data set (for example json/thrift/xml) using an IO library framework.

      Hive uses Antlr for query parsing, Apache JEXL for expression evaluation and may use Apache Derby as an embedded database for MetaStore. Antlr has a BSD license and should be compatible with Apache license.

      We are currently thinking of contributing to the 0.17 branch as a contrib project (since that is the version under which it will get tested internally) - but looking for advice on the best release path.

        Attachments

        1. ant.log
          60 kB
          Ashish Thusoo
        2. hive.tgz
          7.79 MB
          Ashish Thusoo
        3. hive.tgz
          7.84 MB
          Ashish Thusoo
        4. hive.tgz
          7.84 MB
          Ashish Thusoo
        5. HiveTutorial.pdf
          67 kB
          Ashish Thusoo

          Activity

            People

            • Assignee:
              athusoo Ashish Thusoo
              Reporter:
              jsensarma Joydeep Sen Sarma
            • Votes:
              4 Vote for this issue
              Watchers:
              29 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 1,080h
                1,080h
                Remaining:
                Remaining Estimate - 1,080h
                1,080h
                Logged:
                Time Spent - Not Specified
                Not Specified