Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-10714

Issue in handling checksum errors in write pipeline when fault DN is LAST_IN_PIPELINE

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      We had come across one issue, where write is failed even 7 DN’s are available due to network fault at one datanode which is LAST_IN_PIPELINE. It will be similar to HDFS-6937 .

      Scenario : (DN3 has N/W Fault and Min repl=2).

      Write pipeline:
      DN1->DN2->DN3 => DN3 Gives ERROR_CHECKSUM ack. And so DN2 marked as bad
      DN1->DN4-> DN3 => DN3 Gives ERROR_CHECKSUM ack. And so DN4 is marked as bad
      ….
      And so on ( all the times DN3 is LAST_IN_PIPELINE) ... Continued till no more datanodes to construct the pipeline.

      Attachments

        1. HDFS-10714-01-draft.patch
          19 kB
          Vinayakumar B

        Activity

          People

            vinayakumarb Vinayakumar B
            brahmareddy Brahma Reddy Battula
            Votes:
            0 Vote for this issue
            Watchers:
            16 Start watching this issue

            Dates

              Created:
              Updated: