Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-6072

Make TableRecordReaderImpl more easily extended

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Not A Problem
    • None
    • None
    • mapreduce
    • None

    Description

      We have a MR job that is very memory bound. It reads a potentially large row from hbase, then deserializes it into an (even larger) object representation, then does a fair amount of computation requiring memory. After converting the Result into our object representation we want to free the memory holding the Result to be available for the actual computation of output values.

      Currently we have our own custom modified copy of TableRecordReaderImpl to be able to set the Result value to null after reading it, but it's almost entirely a duplicate of hbase's TableRecordReaderImpl so we have to manually keep it up to date with changes to the hbase version. If the value field of TableRecordReaderImpl were protected instead of private we could use a very simple subclass instead.

      Are there any philosophical guidelines about what parts of HBase should or should not be easily extensible?

      Attachments

        Activity

          People

            Unassigned Unassigned
            davelatham Dave Latham
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: