Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-6756

[C++][Python] Include HDFS `getfacl` in `pyarrow.hdfs.HadoopFileSystem`

    XMLWordPrintableJSON

Details

    • Wish
    • Status: Closed
    • Major
    • Resolution: Won't Do
    • 0.13.0
    • None
    • Python

    Description

      Extended HDFS filesystem attributes are exposed through the `getfacl` command.

      It would be immensely help to have this information accessible via:

      pyarrow.hdfs.HadoopFileSystem

       

      Link to the official Hadoop docs where this is discussed in more detail:

      https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/FileSystemShell.html#getfacl

      Sample output from the *nix shell:

      $ hadoop fs -getfacl /path/to/hdfs/dir
       # file: /path/to/hdfs/dir
       # owner: hive
       # group: hive
       user::rwx
       group:unix_group_with_acl_privs_defined:rwx
       group::---
       user:hive:rwx
       group:hive:rwx
       mask::rwx
       other::--x

      Attachments

        Activity

          People

            Unassigned Unassigned
            bbapache bb
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: