Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17718

Hive on Spark Debugging Improvements

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Spark
    • None

    Description

      There are multiple places where it is hard to debug HoS - e.g. the HoS Remote Driver and Client, the Spark RDD graph, etc.

      Attachments

        Issue Links

        1.
        Explain plan should show if a Map/Reduce Work is being cached Sub-task Open liyunzhang Actions
        2.
        Custom Hive on Spark Tab in Spark Web UI Sub-task Open Unassigned Actions
        3.
        Race condition in RemoteSparkJobMonitor Sub-task Open Sahil Takiar Actions
        4.
        Hive logs in Spark Executor and Driver should show thread-id. Sub-task Open Unassigned Actions
        5.
        Improve SparkTask OOM Error Parsing Logic Sub-task Open Unassigned Actions
        6.
        SparkClientImpl should react to errors sent from the RemoteDriver Sub-task Open Unassigned Actions
        7.
        Better console logging for lifecycle of a Spark job Sub-task Open Sahil Takiar Actions
        8.
        Add units to displayed Spark metrics Sub-task Open Unassigned Actions
        9.
        Organize Spark metrics into multiple groups Sub-task Open Unassigned Actions
        10.
        Create Docker env for running HoS locally Sub-task Open Aihua Xu Actions
        11.
        Race condition when timeout task is invoked during SASL negotation Sub-task In Progress Aihua Xu Actions
        12.
        hive.spark.log.dir isn't honored for TestSparkCliDriver Sub-task Open Unassigned Actions
        13.
        NPE in SparkTask#printConsoleMetrics Sub-task Open Unassigned Actions
        14.
        Propagate ExecutionExceptions from the driver thread to the client Sub-task Open Unassigned Actions
        15.
        Re-add HIVE-19787: Log message when spark-submit has completed Sub-task Open Sahil Takiar Actions
        16.
        Typo in MetricsCollection for OutputMetrics Sub-task Patch Available Adesh Kumar Rao Actions
        17.
        Improve logging when HoS Driver is killed due to exceeding memory limits Sub-task Open Unassigned Actions
        18.
        JobResultSerializer uses wrong registration id in KyroMessageCodec Sub-task Open Sahil Takiar Actions
        19.
        Remove 30m min value for hive.spark.session.timeout Sub-task Patch Available Unassigned Actions
        20.
        SparkSession should be able to close a session while it is being opened Sub-task Open Antal Sinkovits Actions
        21.
        Parse Spark error blacklist errors Sub-task Open Unassigned Actions

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            stakiar Sahil Takiar

            Dates

              Created:
              Updated:

              Slack

                Issue deployment