Details

    • Type: New Feature New Feature
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.18.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed
    • Release Note:
      Created SequenceFileAsBinaryOutputFormat to write raw bytes as keys and values to a SequenceFile.

      Description

      Add an OutputFormat to write raw bytes as keys and values to a SequenceFile.

      In C++-Pipes, we're using SequenceFileAsBinaryInputFormat to read Sequencefiles.
      However, we current don't have a way to write a sequencefile efficiently without going through extra (de)serializations.

      I'd like to store the correct classnames for key/values but use BytesWritable to write
      (in order for the next java or pig code to be able to read this sequencefile).

      1. HADOOP-3460-part3.patch
        18 kB
        Koji Noguchi
      2. HADOOP-3460-part2.patch
        16 kB
        Koji Noguchi
      3. HADOOP-3460-part1.patch
        12 kB
        Koji Noguchi

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Koji Noguchi
            Reporter:
            Koji Noguchi
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development