Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
0.90.3
-
None
-
None
Description
Some requirements:
- Being able to have multiple tables as your input path
- Being able to filter on specific columns/column families
- Providing the source location (table/row/column) to the results
- Multiple clusters
- Different schemas.
Overall this seems difficult for now so I am going to punt on it. On the other hand it would be easy enough to write all of the MR values into an intermediate table and then work from there.