Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2602

Allow setting of end-of-record delimiter for TextInputFormat (for the old API)

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.23.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      Since there are users who are still using the old MR API, it will be useful to modify the org.apache.hadoop.mapred.LineRecordReader and org.apache.hadoop.mapred.TextInputFormat to be able to use custom (user-specified) end-of-record delimiters. This will make use of the LineReader improvement introduced in HADOOP-7096 that enables the LineReader to break lines at user-specified delimiters.

      Note: MAPREDUCE-2254 already added this improvement to the new API (but not the old API).

        Attachments

        1. MAPREDUCE-2602.patch
          10 kB
          Ahmed Radwan
        2. MAPREDUCE-2602_rev2.patch
          10 kB
          Ahmed Radwan

          Activity

            People

            • Assignee:
              ahmed.radwan Ahmed Radwan
              Reporter:
              ahmed.radwan Ahmed Radwan
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: