Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-17089

[Python] Use `.arrow` as extension for IPC file dataset

    XMLWordPrintableJSON

Details

    Description

      Same as ARROW-17088

      As noted in the following document, the recommended extension for IPC files is now `.arrow`.

      > We recommend the “.arrow” extension for files created with this format.
      https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format

      However, currently when writing a dataset with the pyarrow.dataset.write_dataset function, the default extension is .feather when arrow or ipc or feather is selected.
      https://github.com/apache/arrow/blob/b8067151db9bfc04860285fdd8b5e73703346037/python/pyarrow/_dataset.pyx#L1149-L1151

      Attachments

        Issue Links

          Activity

            People

              eitsupi SHIMA Tatsuya
              eitsupi SHIMA Tatsuya
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3h 50m
                  3h 50m