Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-10226

Track and use BlockScheduled size for DatanodeDescriptor instead of count.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Tracking count will result in inaccurate estimation of remaining space in case of different block sized files being written.

      This issue can happen when parallel write happen with different block size.

      For Example:
      Datanode Capacity is 10GB, available is 2GB
      ClientA wants to write 2 blocks with block size 1GB
      ClientB wants to write 2 blocks with block size 128MB

      Here ClientB thinks scheduled size 128MB *2 = 256MB and write success where clientA write will fail.

      Attachments

        Activity

          People

            brahmareddy Brahma Reddy Battula
            brahmareddy Brahma Reddy Battula
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: