Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-13535

connect hdfs error

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 5.0.0
    • None
    • Python
    • None

    Description

      when i use pyarrow to connect my hdfs, I meet error 

      I use 

      from pyarrow import fs
      print(fs.FileSystem.from_uri("hdfs://"))

      the error shows loadFileSystems error:
      (unable to get root cause for java.lang.NoClassDefFoundError)
      (unable to get stack trace for java.lang.NoClassDefFoundError)
      hdfsBuilderConnect(forceNewInstance=1, nn=hdfs://, port=0, kerbTicketCachePath=(NULL), userName=(NULL)) error:
      (unable to get root cause for java.lang.NoClassDefFoundError)
      (unable to get stack trace for java.lang.NoClassDefFoundError)
      /arrow/cpp/src/arrow/filesystem/hdfs.cc:51: Failed to disconnect hdfs client: IOError: HDFS hdfsFS::Disconnect failed, errno: 9 (Bad file descriptor)
      Traceback (most recent call last):
      File "/home/tdops/fucheng.pan/ray-code/read.py", line 15, in <module>
      print(fs.FileSystem.from_uri("hdfs:"))
      File "pyarrow/_fs.pyx", line 347, in pyarrow._fs.FileSystem.from_uri
      File "pyarrow/error.pxi", line 122, in pyarrow.lib.pyarrow_internal_check_status
      File "pyarrow/error.pxi", line 99, in pyarrow.lib.check_status
      OSError: HDFS connection failed

      Attachments

        Activity

          People

            Unassigned Unassigned
            pfcqlj cheng pan
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: