[HBASE-15482] Provide an option to skip calculating block locations for SnapshotInputFormat - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.0.0-beta-1, 2.0.0
Component/s: mapreduce
Labels:
None

Hadoop Flags:

Reviewed

Description

When a MR job is reading from SnapshotInputFormat, it needs to calculate the splits based on the block locations in order to get best locality. However, this process may take a long time for large snapshots.

In some setup, the computing layer, Spark, Hive or Presto could run out side of HBase cluster. In these scenarios, the block locality doesn't matter. Therefore, it will be great to have an option to skip calculating the block locations for every job. That will super useful for the Hive/Presto/Spark connectors.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HBASE-15482.master.000.patch
06/Dec/17 17:10
13 kB
Xiang Li
HBASE-15482.master.001.patch
08/Dec/17 04:25
17 kB
Xiang Li
HBASE-15482.master.002.patch
08/Dec/17 07:01
17 kB
Xiang Li
15482.v3.txt
12/Dec/17 05:44
16 kB
Ted Yu
HBASE-15482.master.003.patch
19/Dec/17 16:53
17 kB
Xiang Li

Activity

People

Assignee:: Xiang Li

Reporter:: Liyin Tang

Votes:: 0 Vote for this issue

Watchers:: 11 Start watching this issue

Dates

Created:: 18/Mar/16 04:18

Updated:: 21/Mar/18 22:21

Resolved:: 20/Dec/17 15:49