Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17209

ObjectCacheFactory should return null when tez shared object registry is not setup

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 3.0.0
    • None
    • None
    • Reviewed

    Description

      HIVE-15269 introduced dynamic min/max bloom filter ("hive.tez.dynamic.semijoin.reduction=true"). This needs to access ObjectCache and in tez, ObjectCache can only be created by TezProcessor.

      In the following case AM --> splits --> OrcInputFormat.pickStripes::evaluatePredicateMinMax --> DynamicValue.getLiteral --> objectCache access, AM ends up throwing lots of NPE since AM has not created ObjectCache.

      Orc reader catches these exceptions, skips PPD and proceeds further. For e.g, in Q95 it ends up throwing ~30,000 NPE before completing split information.

      ObjectCacheFactory should return null when tez shared object registry is not setup.

      Attachments

        1. HIVE-17209.1.patch
          3 kB
          Rajesh Balamohan

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            rajesh.balamohan Rajesh Balamohan Assign to me
            rajesh.balamohan Rajesh Balamohan
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment