Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-2121

Input Sampling By Splits

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.8.0
    • Query Processor
    • None
    • Reviewed
    • This patch adds support for the 'TABLESAMPLE(x PERCENT)' clause.

    Description

      We need a better input sampling to serve at least two purposes:
      1. test their queries against a smaller data set
      2. understand more about how the data look like without scanning the whole table.
      A simple function that gives a subset splits will help in those cases. It doesn't have to be strict sampling.

      Attachments

        1. HIVE-2121.8.patch
          754 kB
          Siying Dong
        2. HIVE-2121.7.patch
          266 kB
          Siying Dong
        3. HIVE-2121.6.patch
          266 kB
          Siying Dong
        4. HIVE-2121.5.patch
          218 kB
          Siying Dong
        5. HIVE-2121.4.patch
          209 kB
          Siying Dong
        6. HIVE-2121.3.patch
          210 kB
          Siying Dong
        7. HIVE-2121.2.patch
          208 kB
          Siying Dong
        8. HIVE-2121.1.patch
          37 kB
          Siying Dong

        Activity

          People

            sdong Siying Dong
            sdong Siying Dong
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: