Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Currently the only way to implement indexing is to use the BlurReducer. A better way to implement this would be to support Hadoop input/outputformats in both the new and old api's. This would allow an easier integration with other Hadoop projects such as Hive and Pig.
Attachments
Attachments
1.
|
Create Bulk OutputFormat for direct indexing to HDFS | Closed | Unassigned | |
2.
|
Create Thrift Based OutputFormat to write data directly to the shard servers | Open | Unassigned | |
3.
|
Create a Thrift based InputFormat that reads the output of a query into a MR job | Closed | Unassigned |