Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-7906

[C++][Python] Full functionality for ORC format

    XMLWordPrintableJSON

Details

    Description

      Just like parquet format, ORC has a large group of fans in Bigdata area, it has better performance than parquet in some use cases.
      But there is a problem in python is that it doesn't have the standard write function.

      Seems the ORC team itself maintains the standard C++ code(https://github.com/apache/orc/tree/master/c%2B%2B) , so I think it won't take too much effort to integrate into Arrow(C++) and build the hook for python.

      Attachments

        Issue Links

          Activity

            People

              yingzhou474 Ian Alexander Joiner
              PereTang HAOFENG DENG
              Votes:
              2 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40h 10m
                  40h 10m