[HBASE-270] [HBase] Build a Lucene index on an HBase table - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

This patch provides a Reducer class and other related classes which help to build a Lucene index on an HBase table. The index build part is similar to that of Nutch.

Each row is modeled as a Lucene document: row key is indexed in its untokenized form, column name-value pairs are Lucene field name-value pairs.

IndexConf is used to configure various Lucene parameters, specify whether to optimize an index and which columns to index and/or store, in tokenized or untokenized form, etc.

The number of reduce tasks decides the number of indexes (partitions). The index(es) is stored in the output path of job configuration.

The index build process is done in the reduce phase. Users can use the map phase to join rows from different tables or to pre-parse/analyze column content, etc.

A junit test is added to test the build of an index on an HBase table with an identity mapper. It also serves as an example on how to use the new classes.

BuildTableIndex is provided to help building an index on an HBase table. It should be moved to examples package if HBase decides to have one.

This patch requires the inclusion of the Lucene library.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

build_table_index.take8.patch
28/Sep/07 05:52
69 kB
Michael Stack
build_table_index.take7.patch
27/Sep/07 21:31
69 kB
Michael Stack
build_table_index.take6.patch
24/Sep/07 19:43
39 kB
Michael Stack
build_table_index.take5.patch
22/Sep/07 01:25
35 kB
Michael Stack
build_table_index.take4.patch
21/Sep/07 14:11
44 kB
Michael Stack
build_table_index.take3.patch
21/Sep/07 01:31
39 kB
Ning Li
build_table_index.take2.again.patch
18/Sep/07 14:19
37 kB
Ning Li
build_table_index.take2.patch
18/Sep/07 02:31
37 kB
Ning Li
build_table_index.patch
17/Sep/07 23:31
38 kB
Ning Li

Activity

People

Assignee:: Unassigned

Reporter:: Ning Li

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 17/Sep/07 23:30

Updated:: 04/Feb/08 18:41

Resolved:: 28/Sep/07 16:32