Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-17303

[Java] Read "arrow" (IPC and streaming) files using org.apache.arrow.dataset.jni.NativeDatasetFactory

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 9.0.0
    • 10.0.0
    • Java

    Description

      Fetch "arrow" (IPC and streaming) files using org.apache.arrow.dataset.jni.NativeDatasetFactory in Java API.  This functionality required to implement Arrow file/Stream input format in my use case to process large amount of existing geospatial ARROW format data in Apache Spark data source. Optimized Analytics Package (OAP) for Spark also can leverage this feature of Dataset on JVM. They use FileSystemDatasetFactory in this [Spark gazelle_plugin adapter

      Attachments

        Activity

          People

            igor.suhorukov Igor Suhorukov
            igor.suhorukov Igor Suhorukov
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 2h 50m
                2h 50m