Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-11190

grouping on categorical columns should not require Singleton partitioning

Details

    • Improvement
    • Status: Resolved
    • P3
    • Resolution: Fixed
    • None
    • 2.26.0
    • sdk-py-core
    • None

    Description

      Currently groupby with observed=False (the default) requires aggregating in the Singleton partition since it would otherwise produce results with every index value within every partition.

      Attachments

        Issue Links

          Activity

            People

              bhulette Brian Hulette
              bhulette Brian Hulette
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m