Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-5166

MultiThreaded Table Mapper analogous to MultiThreaded Mapper in hadoop

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.94.0
    • Component/s: None
    • Hadoop Flags:
      Reviewed
    • Release Note:
      New MultiThreadedTableMapper facility

      Description

      There is no MultiThreadedTableMapper in hbase currently just like we have a MultiThreadedMapper in Hadoop for IO Bound Jobs.
      UseCase, webcrawler: take input (urls) from a hbase table and put the content (urls, content) back into hbase.
      Running these kind of hbase mapreduce job with normal table mapper is quite slow as we are not utilizing CPU fully (N/W IO Bound).

      Moreover, I want to know whether It would be a good/bad idea to use HBase for these kind of usecases ?.

        Attachments

          Activity

            People

            • Assignee:
              flukebox Jai Kumar Singh
              Reporter:
              flukebox Jai Kumar Singh
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 0.5h
                0.5h
                Remaining:
                Remaining Estimate - 0.5h
                0.5h
                Logged:
                Time Spent - Not Specified
                Not Specified