Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-9790

Hybrid Hybrid Grace Hash Join: improve side file serialization

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      We have discussed it in the past; the current method is very wasteful, if serializes helper object for each row (so there's expensive serialization and also bunch of unneeded stuff serialized), whereas at the "memory-insert vs side-file-spill" decision point we can produce bytes that are directly usable by hashtable in one method call. So we should do that... at load point, again there's no expensive deserialization, and no helpers, bytes can go into hashtable directly pretty much

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              sershe Sergey Shelukhin
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: