Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-3062

savepoint rollback of last but one savepoint fails

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Not A Problem
    • None
    • 0.11.0
    • None

    Description

      so, I created 2 savepoints as below. 

      c1, c2, c3, sp1, c4, sp2, c5.

      tried savepoint rollback for sp2 and it worked. but left trailing rollback meta files. 

      again tried to savepoint roll back with sp1 and it failed. stacktrace does not have sufficient info. 

      21/12/18 06:20:00 INFO HoodieActiveTimeline: Loaded instants upto : Option{val=[==>20211218061954430__rollback__REQUESTED]}
      21/12/18 06:20:00 INFO BaseRollbackPlanActionExecutor: Requesting Rollback with instant time [==>20211218061954430__rollback__REQUESTED]
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 66
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 26
      21/12/18 06:20:00 INFO BlockManagerInfo: Removed broadcast_3_piece0 on 192.168.1.4:54359 in memory (size: 25.5 KB, free: 366.2 MB)
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 110
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 99
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 47
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 21
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 43
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 55
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 104
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 124
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 29
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 91
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 123
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 120
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 25
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 32
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 92
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 76
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 89
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 102
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 50
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 49
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 116
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 96
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 118
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 44
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 60
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 87
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 77
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 75
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 9
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 72
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 2
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 37
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 113
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 67
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 28
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 95
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 59
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 68
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 45
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 39
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 74
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 20
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 90
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 56
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 58
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 61
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 13
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 46
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 101
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 105
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 81
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 63
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 78
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 4
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 31
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 71
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 3
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 1
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 114
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 51
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 18
      21/12/18 06:20:00 INFO HoodieActiveTimeline: Loaded instants upto : Option{val=[==>20211218061954430__rollback__REQUESTED]}
      21/12/18 06:20:00 INFO BlockManagerInfo: Removed broadcast_4_piece0 on 192.168.1.4:54359 in memory (size: 34.6 KB, free: 366.3 MB)
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 109
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 0
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 40
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 119
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 117
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 84
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 41
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 16
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 107
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 24
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 62
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 93
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 22
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 115
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 54
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 14
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 86
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 65
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 12
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 10
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 42
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 82
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 79
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 30
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 6
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 64
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 112
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 7
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 53
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 33
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 17
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 80
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 35
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 48
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 69
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 100
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 108
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 111
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 5
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 34
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 52
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 85
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 121
      21/12/18 06:20:00 INFO BlockManagerInfo: Removed broadcast_2_piece0 on 192.168.1.4:54359 in memory (size: 25.5 KB, free: 366.3 MB)
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 106
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 57
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 122
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 88
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 98
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 15
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 94
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 97
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 19
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 36
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 23
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 38
      21/12/18 06:20:00 INFO BlockManagerInfo: Removed broadcast_1_piece0 on 192.168.1.4:54359 in memory (size: 25.5 KB, free: 366.3 MB)
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 103
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 83
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 27
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 70
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 73
      21/12/18 06:20:00 INFO ContextCleaner: Cleaned accumulator 11
      21/12/18 06:20:00 INFO HoodieActiveTimeline: Loaded instants upto : Option{val=[==>20211218061954430__rollback__REQUESTED]}
      21/12/18 06:20:00 WARN SparkMain: The commit "20211217183516921" failed to roll back.
      21/12/18 06:20:00 INFO SparkUI: Stopped Spark web UI at http://192.168.1.4:4042
      21/12/18 06:20:00 INFO MapOutputTrackerMasterEndpoint: MapOutputTrackerMasterEndpoint stopped!
      21/12/18 06:20:00 INFO MemoryStore: MemoryStore cleared
      21/12/18 06:20:00 INFO BlockManager: BlockManager stopped
      21/12/18 06:20:00 INFO BlockManagerMaster: BlockManagerMaster stopped
      21/12/18 06:20:00 INFO OutputCommitCoordinator$OutputCommitCoordinatorEndpoint: OutputCommitCoordinator stopped!
      21/12/18 06:20:00 INFO SparkContext: Successfully stopped SparkContext
      21/12/18 06:20:00 INFO ShutdownHookManager: Shutdown hook called
      21/12/18 06:20:00 INFO ShutdownHookManager: Deleting directory /private/var/folders/ym/8yjkm3n90kq8tk4gfmvk7y140000gn/T/spark-983167c8-60f0-493c-9d31-9d69131ddcc1
      21/12/18 06:20:00 INFO ShutdownHookManager: Deleting directory /private/var/folders/ym/8yjkm3n90kq8tk4gfmvk7y140000gn/T/spark-a11f663a-03a7-47a7-87f1-39859094f0cf
      hudi:hudi_trips_cow->37488835 [Spring Shell] INFO  org.apache.hudi.common.table.HoodieTableMetaClient  - Loading HoodieTableMetaClient from /tmp/hudi_trips_cow
      37488867 [Spring Shell] INFO  org.apache.hudi.common.table.HoodieTableConfig  - Loading table properties from /tmp/hudi_trips_cow/.hoodie/hoodie.properties
      37488867 [Spring Shell] INFO  org.apache.hudi.common.table.HoodieTableMetaClient  - Finished Loading Table of type COPY_ON_WRITE(version=1, baseFileFormat=PARQUET) from /tmp/hudi_trips_cow
      Savepoint "20211217183516921" failed to roll back 

       

      and timeilne is at an inflight state.  

      -rw-r--r--  1 nsb  wheel     0 Dec 17 19:57 20211217195708258.commit.requested
      -rw-r--r--  1 nsb  wheel  2594 Dec 17 19:57 20211217195708258.inflight
      -rw-r--r--  1 nsb  wheel  4425 Dec 17 19:57 20211217195708258.commit
      -rw-r--r--  1 nsb  wheel     0 Dec 17 19:57 20211217195708258.savepoint.inflight
      -rw-r--r--  1 nsb  wheel  1168 Dec 17 19:57 20211217195708258.savepoint
      -rw-r--r--  1 nsb  wheel     0 Dec 17 20:00 20211217200028051.restore.inflight
      -rw-r--r--  1 nsb  wheel  1703 Dec 17 20:00 20211217200028099.rollback.requested
      -rw-r--r--  1 nsb  wheel  1703 Dec 17 20:00 20211217200028099.rollback.inflight
      -rw-r--r--  1 nsb  wheel  2770 Dec 17 20:00 20211217200028051.restore
      
      -rw-r--r--  1 nsb  wheel     0 Dec 18 06:19 20211218061954381.restore.inflight
      -rw-r--r--  1 nsb  wheel  1703 Dec 18 06:20 20211218061954430.rollback.requested 

       

      there is a minor bug w/ cli wrt savepoint. refer to https://issues.apache.org/jira/browse/HUDI-3059 for more info

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            shivnarayan sivabalan narayanan
            shivnarayan sivabalan narayanan
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Agile

                Completed Sprint:
                Hudi-Sprint-Mar-22 ended 05/Apr/22
                View on Board

                Slack

                  Issue deployment