Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-9682

[Python] Unable to specify the partition style with pq.write_to_dataset

    XMLWordPrintableJSON

Details

    Description

      I am able to import and test DirectoryPartitioning but I am not able to figure out a way to write a dataset using this feature. It seems like write_to_dataset defaults to the "hive" style. Is there a way to test this?

      from pyarrow.dataset import DirectoryPartitioning
      
      partitioning = DirectoryPartitioning(pa.schema([("year", pa.int16()), ("month", pa.int8()), ("day", pa.int8())]))
      
      print(partitioning.parse("/2009/11/3"))
      
      

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            ldacey Lance Dacey
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: