Uploaded image for project: 'Apache IoTDB'
  1. Apache IoTDB
  2. IOTDB-5019

[write]data region leader write many wal files file after restarting datanode on it

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.14.0-SNAPSHOT
    • None
    • Core/Cluster
    • 2022-11-Cluster

    Description

      [write]data region leader write many wal files after restarting datanode on it

      environment:
      3C3D cluster, Nov. 21

      reproduction:
      1. Using iotdb-benchmarks write data to iotdb cluster for more than 6 hours, only 1 device 1 sensor with double values. 2 replicas.
      2. The 46 node failed to writing data, so I restart data node of it, and it's the data region leader
      3. Continue writing data to the same timeseries for about 8 hours. I find that most of data lay on 44 node

      问题:

      1. 为什么重启46前,44,46节点上的数据分布还是很均衡的,重启46后,wal文件几乎就只写在44上了呢

      2. 为什么写了那么多的wal文件,远远大于数据数量和size

       

      show regions(before and after restart datanode are the same):
       |RegionId|Type|Status|Database|SeriesSlotId|TimeSlotId|DataNodeId|Host|RpcPort|Role|

      10 SchemaRegion Running root.aggr.g_0 1 0 1 172.20.70.44 6667 Follower
      10 SchemaRegion Running root.aggr.g_0 1 0 5 172.20.70.46 6667 Leader
      11 DataRegion Running root.aggr.g_0 1 10 1 172.20.70.44 6667 Follower
      11 DataRegion Running root.aggr.g_0 1 10 5 172.20.70.46 6667 Leader

      iotdb-1: 44
      iotdb-2: 45
      iotdb-3: 46
      files:

      atmos@i-rh6m726k root.aggr.g_0]$ ansible allnodes -m shell -a "find $IOTDB_HOME/data/datanode/data/sequence/root.aggr.g_0 -type f |wc -l"
      iotdb-1 | CHANGED | rc=0 >>
      1694
      iotdb-2 | CHANGED | rc=0 >>
      966
      iotdb-3 | CHANGED | rc=0 >>
      183
      

      monitor:

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Mrquan Quan Siyi
            changxue changxue
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Agile

                Completed Sprint:
                2022-11-Cluster ended 11/Dec/22
                View on Board

                Slack

                  Issue deployment