Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: 2.6.0
    • Fix Version/s: 2.8.0, 3.0.0-alpha1
    • Component/s: contrib/streaming
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      The documentation for the Streaming module do not include any mention of the "slowmatch" parameter, which checks for CDATA sections while looking for XML records.

      An important point is that "slowmatch=true" violates the principle of least surprise: the "begin" and "end" parameters become regular expressions instead of exact strings. This is probably a useful feature, but should definitely be noted since users will be tempted to use the XML record reader on not-strictly-xml files, which may require escaping the "begin" and "end" patterns.

        Attachments

          Activity

            People

            • Assignee:
              qwertymaniac Harsh J
              Reporter:
              thumper Bo Adler
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: