[HBASE-14796] Enhance the Gets in the connector - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.0.0
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
spark.hbase.bulkGetSize in HBaseSparkConf is for grouping bulkGet, and default value is 1000.

Description

Current the Spark-Module Spark SQL implementation gets records from HBase from the driver if there is something like the following found in the SQL.

rowkey = 123

The reason for this original was normal sql will not have many equal operations in a single where clause.

Zhan, had brought up too points that have value.
1. The SQL may be generated and may have many many equal statements in it so moving the work to an executor protects the driver from load
2. In the correct implementation the drive is connecting to HBase and exceptions may cause trouble with the Spark application and not just with the a single task execution

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HBASE-14976.patch
23/Dec/15 19:42
11 kB
Zhan Zhang
HBASE-14796-1.patch
28/Dec/15 19:49
11 kB
Zhan Zhang

Issue Links

is part of

HBASE-14789 Enhance the current spark-hbase connector

Closed

links to

reviewboard

Activity

People

Assignee:: Zhan Zhang

Reporter:: Theodore michael Malaska

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 11/Nov/15 00:00

Updated:: 29/Dec/15 03:27

Resolved:: 28/Dec/15 23:48