[HIVE-2121] Input Sampling By Splits - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.8.0
Component/s: Query Processor
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
This patch adds support for the 'TABLESAMPLE(x PERCENT)' clause.

Description

We need a better input sampling to serve at least two purposes:
1. test their queries against a smaller data set
2. understand more about how the data look like without scanning the whole table.
A simple function that gives a subset splits will help in those cases. It doesn't have to be strict sampling.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-2121.8.patch
29/Apr/11 09:42
754 kB
Siying Dong
HIVE-2121.7.patch
28/Apr/11 22:17
266 kB
Siying Dong
HIVE-2121.6.patch
28/Apr/11 19:41
266 kB
Siying Dong
HIVE-2121.5.patch
28/Apr/11 08:28
218 kB
Siying Dong
HIVE-2121.4.patch
26/Apr/11 23:47
209 kB
Siying Dong
HIVE-2121.3.patch
26/Apr/11 21:16
210 kB
Siying Dong
HIVE-2121.2.patch
20/Apr/11 18:19
208 kB
Siying Dong
HIVE-2121.1.patch
20/Apr/11 00:34
37 kB
Siying Dong

Activity

People

Assignee:: Siying Dong

Reporter:: Siying Dong

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 20/Apr/11 00:26

Updated:: 15/Jun/12 19:17

Resolved:: 29/Apr/11 22:51