Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-9790

Hybrid Hybrid Grace Hash Join: improve side file serialization

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      We have discussed it in the past; the current method is very wasteful, if serializes helper object for each row (so there's expensive serialization and also bunch of unneeded stuff serialized), whereas at the "memory-insert vs side-file-spill" decision point we can produce bytes that are directly usable by hashtable in one method call. So we should do that... at load point, again there's no expensive deserialization, and no helpers, bytes can go into hashtable directly pretty much

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                sershe Sergey Shelukhin
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated: