Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-660

Writing Protobuf messages with extensions results in an error or data corruption.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.8.1
    • 1.9.0, 1.8.2
    • parquet-mr
    • None
    • Patch

    Description

      Currently, in ProtoParquetWriter there's not any support for extended protobuf messages. An attempt to write a protobuf message with extension either:

      • Results in an uninformative error like this:
        parquet.proto.ProtoWriteSupport: Cannot write message 
        YOUR_EXTENSION_TYPE cannot be cast to A_BASE_FIELD_TYPE : BASE_FIELD_NAME 
        
      • Results in a data corruption, if an extension type is compatible with a base field type with a corresponding index.

      Imo there are two possible solutions:

      1. Ignore extension fields.
      2. When an extension field is detected in a message, throw an informative error.

      Attachments

        Activity

          People

            jkukul Jakub Kukul
            jkukul Jakub Kukul
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: