Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-4820

[Python] hadoop class path derived not correct

    XMLWordPrintableJSON

Details

    Description

      in hdfs.py, the method  _derive_hadoop_classpath add jar files under $HADOOP_HOME into hadoop classpath,but the hadoop config directory is not contained in  classpath.

       

      when hadoop HA mode enabled,the hdfs uri like this: hdfs://ns

      when the HADOOP_CONF_DIR directory is not in the hadoop classpath,the libhdfs can not locate the right  hdfs-site.xml, in the HA mode, hdfs service name was parsed as host name ,it is not correct 

       

      Attachments

        Issue Links

          Activity

            People

              Tiger068 Tiger068
              Tiger068 Tiger068
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 50m
                  50m