Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1075

Separately configure connect timeouts from read timeouts in data path

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • datanode, hdfs-client
    • None

    Description

      The timeout configurations in the write pipeline overload the read timeout to also include a connect timeout. In my experience, if a node is down it can take many seconds to get back an exception connect, whereas if it is up it will accept almost immediately, even if heavily loaded (the kernel listen backlog picks it up very fast). So in the interest of faster dead node detection from the writer perspective, the connect timeout should be configured separately, usually to a much lower time than the read timeout.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tlipcon Todd Lipcon
              Votes:
              0 Vote for this issue
              Watchers:
              15 Start watching this issue

              Dates

                Created:
                Updated: