Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-6756

[C++][Python] Include HDFS `getfacl` in `pyarrow.hdfs.HadoopFileSystem`

    XMLWordPrintableJSON

    Details

    • Type: Wish
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 0.13.0
    • Fix Version/s: None
    • Component/s: Python
    • Labels:

      Description

      Extended HDFS filesystem attributes are exposed through the `getfacl` command.

      It would be immensely help to have this information accessible via:

      pyarrow.hdfs.HadoopFileSystem

       

      Link to the official Hadoop docs where this is discussed in more detail:

      https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/FileSystemShell.html#getfacl

      Sample output from the *nix shell:

      $ hadoop fs -getfacl /path/to/hdfs/dir
       # file: /path/to/hdfs/dir
       # owner: hive
       # group: hive
       user::rwx
       group:unix_group_with_acl_privs_defined:rwx
       group::---
       user:hive:rwx
       group:hive:rwx
       mask::rwx
       other::--x

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              bbapache bb
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: