Uploaded image for project: 'SystemDS'
  1. SystemDS
  2. SYSTEMDS-3254

CountDistinct Col and Row & Unique

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • SystemDS 3.1
    • SystemDS 3.1
    • None

    Description

       

      This task is to extend the countDistinct to support column and row counting and add another alias for the same operation called 'unique'.

      example test of countDistinct:

      src/test/scripts/functions/countDistinct/countDistinct.dml

       

      the operations to implement in this task are:

       

      unique(X)

      colUnique(X)

      rowUnique(X)

       

      colCountDistinct(X)

      rowCountDistinct(X)

       

      There are multiple options for the implementation, one suggestion is to extend

      src/main/java/org/apache/sysds/runtime/matrix/data/sketch/MatrixSketch.java

      and move the countDistinctValuesNaive method to your new implementation.

      src/main/java/org/apache/sysds/runtime/matrix/data/LibMatrixCountDistinct.java

       

       

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              baunsgaard Sebastian Baunsgaard
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: