Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-4842

Streaming combiner should allow command, not just JavaClass

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.21.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed
    • Release Note:
      Streaming option -combiner allows any streaming command (not just Java class) to be a combiner.

      Description

      Streaming jobs are way slower than Java jobs for many reasons, but certainly stopping the shell-only programmer from using the combiner feature won't help. Right now, the streaming usage says:

      -mapper <cmd|JavaClassName> The streaming command to run
      -combiner <JavaClassName> Combiner has to be a Java class
      -reducer <cmd|JavaClassName> The streaming command to run

        Attachments

        1. patch-4842-3.txt
          7 kB
          Amareshwari Sriramadasu
        2. patch-4842-2.txt
          8 kB
          Amareshwari Sriramadasu
        3. patch-4842-1.txt
          6 kB
          Amareshwari Sriramadasu
        4. patch-4842.txt
          6 kB
          Amareshwari Sriramadasu

          Activity

            People

            • Assignee:
              amareshwari Amareshwari Sriramadasu
              Reporter:
              menicosia Marco Nicosia
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: