Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-2484

Unable to create bloomfilter when writing duplicates values for a field

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 1.13.1
    • None
    • parquet-mr
    • None

    Description

      I'm unable to create a bloomfilter for a field, when I perform writes with repeating values. The bloomfilter returned is null when I try to read such a parquet file. If there are no repeating values, the bloomfilter is created without any issue.
      The working and non-working case in captured in the below repo

      https://github.com/MaheshGPai/parquet-mr-test
      https://github.com/MaheshGPai/parquet-mr-test/blob/main/src/test/java/com/mahesh/test/AppTest.java#L73

      Attachments

        Activity

          People

            Unassigned Unassigned
            maheshgpai Mahesh Pai
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: