Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2602

Allow setting of end-of-record delimiter for TextInputFormat (for the old API)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.23.0
    • None
    • None
    • Reviewed

    Description

      Since there are users who are still using the old MR API, it will be useful to modify the org.apache.hadoop.mapred.LineRecordReader and org.apache.hadoop.mapred.TextInputFormat to be able to use custom (user-specified) end-of-record delimiters. This will make use of the LineReader improvement introduced in HADOOP-7096 that enables the LineReader to break lines at user-specified delimiters.

      Note: MAPREDUCE-2254 already added this improvement to the new API (but not the old API).

      Attachments

        1. MAPREDUCE-2602.patch
          10 kB
          Ahmed Radwan
        2. MAPREDUCE-2602_rev2.patch
          10 kB
          Ahmed Radwan

        Activity

          People

            ahmed.radwan Ahmed Radwan
            ahmed.radwan Ahmed Radwan
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: