Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-12132

DataFrame API: Consider allowing partitioning by column in addition to Index

    XMLWordPrintableJSON

    Details

      Description

      For some DataFrame use-cases it may be beneficial to partition a dataset across the columns as well as across the index.

      One example might be computing a correlation in a DataFrame with a very large number of columns. It would be beneficial to be able to perform pairwise column correlations on separate workers.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                bhulette Brian Hulette
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated: