[PIG-4585] Use newAPIHadoopRDD instead of newAPIHadoopFile - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: spark-branch
Fix Version/s: spark-branch
Component/s: spark
Labels:
None

Description

LoadConverter currently uses SparkContext.newAPIHadoopFile which won't work for non-filesystem based input sources, like HBase.

newAPIHadoopFile assumes a FileInputFormat and attempts to verify this in the constructor, which fails for HBaseTableInputFormat (which is not a FileInputFormat)

  NewFileInputFormat.setInputPaths(job, path)

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

PIG-4585.patch
03/Jun/15 22:58
13 kB
Mohit Sabharwal
PIG-4585.2.patch
10/Jun/15 19:20
12 kB
Mohit Sabharwal
PIG-4585.1.patch
09/Jun/15 18:25
11 kB
Mohit Sabharwal

Issue Links

links to

review board

Activity

People

Assignee:: Mohit Sabharwal

Reporter:: Mohit Sabharwal

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 03/Jun/15 22:52

Updated:: 21/Jun/17 09:18

Resolved:: 11/Jun/15 13:16