Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-953

Enable merge join in pig to work with loaders and store functions which can internally index sorted data

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.3.0
    • 0.6.0
    • None
    • None
    • Reviewed

    Description

      Currently merge join implementation in pig includes construction of an index on sorted data and use of that index to seek into the "right input" to efficiently perform the join operation. Some loaders (notably the zebra loader) internally implement an index on sorted data and can perform this seek efficiently using their index. So the use of the index needs to be abstracted in such a way that when the loader supports indexing, pig uses it (indirectly through the loader) and does not construct an index.

      Attachments

        1. PIG-953_missing_files.diff
          37 kB
          Dmitriy V. Ryaboy
        2. PIG-953-9.patch
          100 kB
          Pradeep Kamath
        3. PIG-953-8.patch
          98 kB
          Pradeep Kamath
        4. PIG-953-7.patch
          96 kB
          Pradeep Kamath
        5. PIG-953-6.patch
          95 kB
          Pradeep Kamath
        6. PIG-953-5.patch
          95 kB
          Pradeep Kamath
        7. PIG-953-4.patch
          86 kB
          Pradeep Kamath
        8. PIG-953-3.patch
          86 kB
          Pradeep Kamath
        9. PIG-953-2.patch
          72 kB
          Pradeep Kamath
        10. PIG-953.patch
          72 kB
          Pradeep Kamath

        Issue Links

          Activity

            People

              pkamath Pradeep Kamath
              pkamath Pradeep Kamath
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: