Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Like HADOOP-372, we have a multi format Reducer too. Someone suggested that if we need different reducers and map implementations(like what i need) I was better of by writing 2 jobs. I dont quite agree. I am calculating 2 big matrices that must be calculated in the map step, summed in the reducers multiplied and then written to a file. The First mapper sums a matrix based on the i,j th index(key) into the file and the second mapper adds the N*1 dimension vector that uses a new line as key. These keys must be passed as such to the reduce process.