Details

    • Type: Wish Wish
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 0.19.0
    • Fix Version/s: 0.19.0
    • Component/s: None
    • Labels:
      None
    • Environment:

      N/A

    • Hadoop Flags:
      Reviewed
    • Release Note:
      Hide
      Introduced Hive Data Warehouse built on top of Hadoop that enables structuring Hadoop files as tables and partitions and allows users to query this data through a SQL like language using a command line interface.
      Show
      Introduced Hive Data Warehouse built on top of Hadoop that enables structuring Hadoop files as tables and partitions and allows users to query this data through a SQL like language using a command line interface.

      Description

      Hive is a data warehouse built on top of flat files (stored primarily in HDFS). It includes:

      • Data Organization into Tables with logical and hash partitioning
      • A Metastore to store metadata about Tables/Partitions etc
      • A SQL like query language over object data stored in Tables
      • DDL commands to define and load external data into tables

      Hive's query language is executed using Hadoop map-reduce as the execution engine. Queries can use either single stage or multi-stage map-reduce. Hive has a native format for tables - but can handle any data set (for example json/thrift/xml) using an IO library framework.

      Hive uses Antlr for query parsing, Apache JEXL for expression evaluation and may use Apache Derby as an embedded database for MetaStore. Antlr has a BSD license and should be compatible with Apache license.

      We are currently thinking of contributing to the 0.17 branch as a contrib project (since that is the version under which it will get tested internally) - but looking for advice on the best release path.

      1. HiveTutorial.pdf
        67 kB
        Ashish Thusoo
      2. hive.tgz
        7.84 MB
        Ashish Thusoo
      3. hive.tgz
        7.84 MB
        Ashish Thusoo
      4. hive.tgz
        7.79 MB
        Ashish Thusoo
      5. ant.log
        60 kB
        Ashish Thusoo

        Activity

          People

          • Assignee:
            Ashish Thusoo
            Reporter:
            Joydeep Sen Sarma
          • Votes:
            4 Vote for this issue
            Watchers:
            29 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Time Tracking

              Estimated:
              Original Estimate - 1,080h
              1,080h
              Remaining:
              Remaining Estimate - 1,080h
              1,080h
              Logged:
              Time Spent - Not Specified
              Not Specified

                Development