Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-14107

DataFrame API: Support deferred GroupBy.__len__ and GroupBy.ngroups

Details

    Description

      We should be able to build a deferred operation that calculates the number of groups for GroupBy.ngroups.

      len(groupby) might be possible ... but I think python will complain if we return something other than an integer (like a deferred operation). If thats the case we should raise an error that directs users to alternatives (e.g. ngroups).

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              yeandy Andy Ye
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: