[HBASE-5166] MultiThreaded Table Mapper analogous to MultiThreaded Mapper in hadoop - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.94.0
Component/s: None
Labels:
- multithreaded
- tablemapper

Hadoop Flags:

Reviewed
Release Note:
New MultiThreadedTableMapper facility

Description

There is no MultiThreadedTableMapper in hbase currently just like we have a MultiThreadedMapper in Hadoop for IO Bound Jobs.
UseCase, webcrawler: take input (urls) from a hbase table and put the content (urls, content) back into hbase.
Running these kind of hbase mapreduce job with normal table mapper is quite slow as we are not utilizing CPU fully (N/W IO Bound).

Moreover, I want to know whether It would be a good/bad idea to use HBase for these kind of usecases ?.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

0001-Added-MultithreadedTableMapper-HBASE-5166.patch
10/Jan/12 11:39
9 kB
Jai Kumar Singh
0003-Added-MultithreadedTableMapper-HBASE-5166.patch
16/Jan/12 19:11
8 kB
Jai Kumar Singh
0005-HBASE-5166-Added-MultithreadedTableMapper.patch
21/Feb/12 10:13
31 kB
Jai Kumar Singh
0006-HBASE-5166-Added-MultithreadedTableMapper.patch
21/Feb/12 12:38
19 kB
Jai Kumar Singh
0008-HBASE-5166-Added-MultithreadedTableMapper.patch
23/Feb/12 06:15
18 kB
Jai Kumar Singh
5166-v9.txt
23/Feb/12 17:34
18 kB
Michael Stack

Activity

People

Assignee:: Jai Kumar Singh

Reporter:: Jai Kumar Singh

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 10/Jan/12 10:23

Updated:: 12/Oct/12 05:35

Resolved:: 24/Feb/12 06:39

Time Tracking

Estimated:

0.5h

Remaining:

0.5h

Logged:

Not Specified