Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-2586

A better plan/data flow visualizer

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • impl

    Description

      Pig supports a dot graph style plan to visualize the logical/physical/mapreduce plan (explain with -dot option, see http://ofps.oreilly.com/titles/9781449302641/developing_and_testing.html). However, dot graph takes extra step to generate the plan graph and the quality of the output is not good. It's better we can implement a better visualizer for Pig. It should:
      1. show operator type and alias
      2. turn on/off output schema
      3. dive into foreach inner plan on demand
      4. provide a way to show operator source code, eg, tooltip of an operator (plan don't currently have this information, but you can assume this is in place)
      5. besides visualize logical/physical/mapreduce plan, visualize the script itself is also useful
      6. may rely on some java graphic library such as Swing

      This is a candidate project for Google summer of code 2013. More information about the program can be found at https://cwiki.apache.org/confluence/display/PIG/GSoc2013

      Functionality implemented so far, is available at
      https://reviews.apache.org/r/12077/

      Attachments

        1. graph.zip
          117 kB
          Allan Avendaño
        2. patch04
          35 kB
          Allan Avendaño
        3. visualize.zip
          119 kB
          Allan Avendaño

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            xalan Allan Avendaño
            daijy Daniel Dai

            Dates

              Created:
              Updated:

              Slack

                Issue deployment