Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-735

The underlying data structure, ByteArrayOutputStream, for buffer type of Hadoop record is inappropriate

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.9.2
    • 0.11.0
    • record
    • None

    Description

      With ByteArrayOutputStream as the underlying data structure for a buffer, the user is forced to convert it into a byte [] object in order to do any operations other than sequence append on the buffer. The convertion will create a new copy of bytes. That will cause huge performance problem.

      It seems BytesWritable is a better replacement.

      Attachments

        1. BytesWritable.patch
          33 kB
          Milind Barve

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            milindb Milind Barve
            runping Runping Qi
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment