Pig
  1. Pig
  2. PIG-3204

Change script parsing to parse entire script instead of line by line

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.10.1
    • Fix Version/s: 0.12.0
    • Component/s: None
    • Labels:
      None

      Description

      Currently there are a lot of NN calls made to determine if there is a schema file for a path in a LOAD statement. When there is a slow NN(caused by whole bunch of other issues), it takes a lot of time for this and we found the scripts spending anywhere from 5 mins to 40 mins depending upon the script. It seems to be a good place for optimization.

      1. PIG-3204-6.patch
        73 kB
        Rohini Palaniswamy
      2. PIG-3204-5.patch
        73 kB
        Rohini Palaniswamy
      3. PIG-3204-4.patch
        73 kB
        Rohini Palaniswamy
      4. PIG-3204-3.patch
        17 kB
        Rohini Palaniswamy
      5. PIG-3204-2.patch
        8 kB
        Rohini Palaniswamy
      6. PIG-3204-1.patch
        8 kB
        Rohini Palaniswamy

        Issue Links

          Activity

          Cheolsoo Park made changes -
          Link This issue breaks PIG-4106 [ PIG-4106 ]
          Daniel Dai made changes -
          Status Resolved [ 5 ] Closed [ 6 ]
          Rohini Palaniswamy made changes -
          Status Patch Available [ 10002 ] Resolved [ 5 ]
          Resolution Fixed [ 1 ]
          Rohini Palaniswamy made changes -
          Summary Reduce the number of getSchema calls during script parsing Change script parsing to parse entire script instead of line by line
          Rohini Palaniswamy made changes -
          Attachment PIG-3204-6.patch [ 12598083 ]
          Rohini Palaniswamy made changes -
          Attachment PIG-3204-5.patch [ 12597998 ]
          Rohini Palaniswamy made changes -
          Attachment PIG-3204-4.patch [ 12597728 ]
          Rohini Palaniswamy made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Rohini Palaniswamy made changes -
          Attachment PIG-3204-3.patch [ 12597727 ]
          Rohini Palaniswamy made changes -
          Status Patch Available [ 10002 ] Open [ 1 ]
          Rohini Palaniswamy made changes -
          Attachment PIG-3204-2.patch [ 12587490 ]
          Rohini Palaniswamy made changes -
          Summary Optimize the number of FS calls to get schema to cut down time before job launch Reduce the number of getSchema calls during script parsing
          Rohini Palaniswamy made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Assignee Rohini Palaniswamy [ rohini ]
          Fix Version/s 0.12 [ 12323380 ]
          Rohini Palaniswamy made changes -
          Attachment PIG-3204-1.patch [ 12587349 ]
          Rohini Palaniswamy made changes -
          Field Original Value New Value
          Affects Version/s 0.10.1 [ 12320547 ]
          Rohini Palaniswamy created issue -

            People

            • Assignee:
              Rohini Palaniswamy
              Reporter:
              Rohini Palaniswamy
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development