Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-2586

A better plan/data flow visualizer

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • impl

    Description

      Pig supports a dot graph style plan to visualize the logical/physical/mapreduce plan (explain with -dot option, see http://ofps.oreilly.com/titles/9781449302641/developing_and_testing.html). However, dot graph takes extra step to generate the plan graph and the quality of the output is not good. It's better we can implement a better visualizer for Pig. It should:
      1. show operator type and alias
      2. turn on/off output schema
      3. dive into foreach inner plan on demand
      4. provide a way to show operator source code, eg, tooltip of an operator (plan don't currently have this information, but you can assume this is in place)
      5. besides visualize logical/physical/mapreduce plan, visualize the script itself is also useful
      6. may rely on some java graphic library such as Swing

      This is a candidate project for Google summer of code 2013. More information about the program can be found at https://cwiki.apache.org/confluence/display/PIG/GSoc2013

      Functionality implemented so far, is available at
      https://reviews.apache.org/r/12077/

      Attachments

        1. patch04
          35 kB
          Allan Avendaño
        2. visualize.zip
          119 kB
          Allan Avendaño
        3. graph.zip
          117 kB
          Allan Avendaño

        Activity

          People

            xalan Allan Avendaño
            daijy Daniel Dai
            Votes:
            1 Vote for this issue
            Watchers:
            11 Start watching this issue

            Dates

              Created:
              Updated: