Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-1647

[Java] support for Arrow's float16

    XMLWordPrintableJSON

Details

    Description

      DESCRIPTION

       

      I'm wondering if there's any interest in supporting Arrow's float16 type in Parquet.

      There seem to be one or two float16 / halffloat tickets here (e.g., PARQUET-1403) but nothing that speaks to adding half-float support to Parquet in-general.

       

      PLANS

      I'm able to spend some time on this, if someone points me  in the right direction.

       

      1. Add the HALFFLOAT or FLOAT16 enum (any preferred naming convention?) to https://github.com/apache/parquet-format/blob/master/src/main/thrift/parquet.thrift#L32
      2. Add HALFFLOAT to org.apache.parquet.schema.PrimitiveType
      3. Add HALFFLOAT support to org.apache.parquet.arrow.schema.SchemaConverter
      4. Add encoding for new type at org.apache.parquet.column.Encoding
      5. ??

      If anyone has any interest in this, pointers, or comments, they would be greatly appreciated!

      Attachments

        Issue Links

          Activity

            People

              Jiashen Zhang Jiashen Zhang
              the_alchemist The Alchemist
              Votes:
              3 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10m
                  10m