Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-57

Format: Draft data headers IDL for data interchange

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.1.0
    • Format
    • None

    Description

      From the mailing list discussion, we need to define the structure of data headers to describe a "page" of Arrow data. In Apache Parquet these are referred to as the data page headers and are separate from the schema metadata.

      As part of this JIRA:

      • Choose a technology (it need not be the only one, but it would be good to have a reference tool) for representing the data page header in "serialized" form (as can be stored in shared memory and interpreted by any system with access to that memory). Google's Flatbuffers libraries has been discussed for this use case.
      • Draft IDL specification for the data headers

      Attachments

        Activity

          People

            wesm Wes McKinney
            wesm Wes McKinney
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: