Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-177

MemoryManager ensure minimum Column Chunk size

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.6.0
    • 1.6.0
    • parquet-mr
    • None

    Description

      The memory manager currently has no limit to how small it will make row groups. This is problematic because jobs that have a large number of writers can result in tiny row groups that hurt performance.

      The following patch will allow a configurable minimum size before killing the job. Default is currently no limit.

      Attachments

        Issue Links

          Activity

            People

              dweeks-netflix Daniel Weeks
              dweeks-netflix Daniel Weeks
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: