[HBASE-9970] HBase BulkLoad, table is creating with the timestamp key also as a column to the table. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.94.11
Fix Version/s: 0.98.0, 0.96.1, 0.94.14
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed

Description

If BulkLoad job is running with out creating a table.
job itself will create the table if table is not found.

if (!doesTableExist(tableName)) {
        createTable(conf, tableName);
      }

if columns contains timestamp also then table is creating with defined columns and timestamp key.

eg: -Dimporttsv.columns=HBASE_ROW_KEY,HBASE_TS_KEY,d:num

table is creating with the following columnFamilies.
'HBASE_TS_KEY' and 'd'

while iterating timestamp key also need to avoid while describing the column descriptors.

private static void createTable(HBaseAdmin admin, String tableName, String[] columns)
      throws IOException {
    HTableDescriptor htd = new HTableDescriptor(TableName.valueOf(tableName));
    Set<String> cfSet = new HashSet<String>();
    for (String aColumn : columns) {
      if (TsvParser.ROWKEY_COLUMN_SPEC.equals(aColumn)) continue;
      // we are only concerned with the first one (in case this is a cf:cq)
      cfSet.add(aColumn.split(":", 2)[0]);
    }
    for (String cf : cfSet) {
      HColumnDescriptor hcd = new HColumnDescriptor(Bytes.toBytes(cf));
      htd.addFamily(hcd);
    }
    LOG.warn(format("Creating table '%s' with '%s' columns and default descriptors.",
      tableName, cfSet));
    admin.createTable(htd);
  }

Index: hbase-server/src/main/java/org/apache/hadoop/hbase/mapreduce/ImportTsv.java
===================================================================
--- hbase-server/src/main/java/org/apache/hadoop/hbase/mapreduce/ImportTsv.java	(revision 1539967)
+++ hbase-server/src/main/java/org/apache/hadoop/hbase/mapreduce/ImportTsv.java	(working copy)
@@ -413,7 +413,8 @@
     HTableDescriptor htd = new HTableDescriptor(TableName.valueOf(tableName));
     Set<String> cfSet = new HashSet<String>();
     for (String aColumn : columns) {
-      if (TsvParser.ROWKEY_COLUMN_SPEC.equals(aColumn)) continue;
+      if (TsvParser.ROWKEY_COLUMN_SPEC.equals(aColumn)
+          || TsvParser.TIMESTAMPKEY_COLUMN_SPEC.equals(aColumn)) continue;
       // we are only concerned with the first one (in case this is a cf:cq)
       cfSet.add(aColumn.split(":", 2)[0]);
     }

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HBASE-9970.java.patch
15/Nov/13 05:02
0.8 kB
Y. SREENIVASULU REDDY
HBASE-9970_94.patch
15/Nov/13 04:58
0.8 kB
Y. SREENIVASULU REDDY

Activity

People

Assignee:: Y. SREENIVASULU REDDY

Reporter:: Y. SREENIVASULU REDDY

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 14/Nov/13 12:36

Updated:: 17/Dec/13 16:09

Resolved:: 16/Nov/13 03:47