Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-579

Streaming "slowmatch" documentation

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Trivial
    • Resolution: Fixed
    • 2.6.0
    • 2.8.0, 3.0.0-alpha1
    • contrib/streaming
    • None
    • Reviewed

    Description

      The documentation for the Streaming module do not include any mention of the "slowmatch" parameter, which checks for CDATA sections while looking for XML records.

      An important point is that "slowmatch=true" violates the principle of least surprise: the "begin" and "end" parameters become regular expressions instead of exact strings. This is probably a useful feature, but should definitely be noted since users will be tempted to use the XML record reader on not-strictly-xml files, which may require escaping the "begin" and "end" patterns.

      Attachments

        1. MAPREDUCE-579.patch
          2 kB
          Harsh J

        Activity

          People

            qwertymaniac Harsh J
            thumper Bo Adler
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: