Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-354

Question on parquet-protobuf and parquet-pig

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.8.0
    • None
    • parquet-mr

    Description

      Hi,

      I have a question about protobuf to parquet conversion.
      I have a message like this (simplified) :

       
      message MyMessage {
        repeated string language = 1;
      }
      

      parquet-protobuf convert it to the following schema :

      message MyMessage {
        repeated binary language (UTF8);
      }
      

      But, according to TestPigSchameConverter.java, the correct schema should be :

      message MyMessage {
        optional group language (LIST) {
          repeated binary value (UTF8);
        }
      }
      

      Language is an optional list of language, i want to store zero or more language.

      Who have the correct schema for my case ? parquet-protobuf or parquet-pig ?

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            cnguyen@nextperf.com Christian Nguyen Van Than

            Dates

              Created:
              Updated:

              Slack

                Issue deployment