Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-324

Generate a network infrastructre map

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • None
    • None
    • None
    • None

    Description

      Assuming an inhomogeneous network it might be sensible to not only collect metrics about to and from the nodes but also

      • collect latency between nodes
      • network io between nodes
      • daytime depending network io between nodes (i.e. network is much slower in the morning time and in the early evening time)
      • network failures

      Assuming the we collect aging information over a period of time, it would allow us to create a "learning cloud" in senses of its infrastructure. From this a network map we can

      • see subclusters of high speed linked nodes*
      • see unreliable connections between nodes
      • see heavily used links over the time

      These information can be refed into the DFS (for data distribution as well as for the balancer) logic to increase its reliability and its performance a lot.

      *Note : This could already be managed with the rack awareness, but the aging approach would make this much more fine grained and in an automatic manner.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mokai Kai Mosebach
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: