Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-10671

yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.3.0, 2.0.0
    • Spark
    • None

    Description

      With Hive on Spark, users noticed that in certain cases spark.master=yarn-client offers 2x or 3x better performance than if spark.master=yarn-cluster. However, yarn-cluster is what we recommend and support. Thus, we should investigate and fix the problem. One of the such queries is TPC-H 22.

      Attachments

        1. HIVE-10671.1-spark.patch
          7 kB
          Rui Li
        2. HIVE-10671.2-spark.patch
          8 kB
          Rui Li

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            lirui Rui Li Assign to me
            xuefuz Xuefu Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment