Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-257

starting one data node thread to manage multiple data directories

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • 0.2.0
    • 0.4.0
    • None
    • None

    Description

      If a data node is configured with multiple data directories, current implementation of dfs will start multiple data node threads, each of which manages one data directory and talks to its name node independently. From the name node's point of view, it sees multiple data nodes instead of one.

      I feel that a more scalable solution should be to start one data node thread that manages multiple data diretories. But the one data node thread needs to take care of the block allocation problem.

      Attachments

        Issue Links

          Activity

            People

              shv Konstantin Shvachko
              hairong Hairong Kuang
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: