Uploaded image for project: 'Crunch (Retired)'
  1. Crunch (Retired)
  2. CRUNCH-139

PCollection#length doesn't always reduce the count to a single value

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.5.0
    • None
    • None

    Description

      PCollection#length doesn't explicitly set the number of reducers to 1, which means that the output of the counting mappers can be partitioned. This results only a partial count (i.e. an incorrect value) being returned in the length PObject if the input PCollection spans multiple default reduce partitions.

      Attachments

        1. CRUNCH-139.patch
          1 kB
          Gabriel Reid

        Activity

          People

            gabriel.reid Gabriel Reid
            gabriel.reid Gabriel Reid
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: