[BLUR-18] Rework the MapReduce Library to implement Input/OutputFormats - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

Currently the only way to implement indexing is to use the BlurReducer. A better way to implement this would be to support Hadoop input/outputformats in both the new and old api's. This would allow an easier integration with other Hadoop projects such as Hive and Pig.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

0001-BLUR-ID-18-Created-New-Version-of-Files.patch
03/Nov/12 11:12
31 kB
Gagandeep Singh
0001-BLUR-ID-18-New-Writables.patch
05/Nov/12 13:34
16 kB
Gagandeep Singh

Sub-Tasks

1.	Create Bulk OutputFormat for direct indexing to HDFS	Closed	Unassigned
2.	Create Thrift Based OutputFormat to write data directly to the shard servers	Open	Unassigned
3.	Create a Thrift based InputFormat that reads the output of a query into a MR job	Closed	Unassigned

Activity

People

Assignee:: Unassigned

Reporter:: Aaron McCurry

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 07/Sep/12 13:09

Updated:: 02/Jun/15 12:45