Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-12132

DataFrame API: Consider allowing partitioning by column in addition to Index

Details

    Description

      For some DataFrame use-cases it may be beneficial to partition a dataset across the columns as well as across the index.

      One example might be computing a correlation in a DataFrame with a very large number of columns. It would be beneficial to be able to perform pairwise column correlations on separate workers.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              bhulette Brian Hulette
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: