Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-14676

Calls to libhdfs (via pyarrow) hang after a while

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Critical
    • Resolution: Unresolved
    • Affects Version/s: 3.0.3
    • Fix Version/s: None
    • Component/s: hdfs, libhdfs
    • Labels:
      None
    • Environment:

      hadoop-3.0.3 
      python 3.6 
      Centos7

      Description

      I'm using the pyarrow HDFS client in a long running (forever) app that makes connections to HDFS (via libhdfs) as external requests come in and destroys the connection as soon as the request is handled. This happens a large amount of times on separate threads and everything works great.

      The problem is, after the app idles for a while (perhaps hours) and no HDFS connections are made during this time, when the next connection is attempted, it hangs. No exceptions are thrown. As soon as I restart my python app, the HDFS connection works just fine again.

      I'm using the precompiled libhdfs.so directly from the hadoop-3.0.3 distribution. Do I typically need to recompile libhdfs.so for my OS, or is the one out of the box typically fine?

      I've checked with the Arrow community first- they've recommended I check with the Hadoop community since all the pyarrow client does is pass through the commands to libhdfs.

      Any suggestions on debugging this hanging issue would be appreciated.

       

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              ftzeng82 Fred Tzeng
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: