Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-12360

Support using 64-bit long type in SparkR

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • 1.5.2
    • None
    • SparkR

    Description

      R has no support for 64-bit integers. While in Scala/Java API, some methods have one or more arguments of long type. Currently we support only passing an integer cast from a numeric to Scala/Java side for parameters of long type of such methods. This may have problem covering large data sets.

      Storing a 64-bit integer in a double obviously does not work as some 64-bit integers can not be exactly represented in double format, so x and x+1 can't be distinguished.

      There is a bit64 package (https://cran.r-project.org/web/packages/bit64/index.html) in CRAN which supports vectors of 64-bit integers. We can investigate if it can be used for this purpose.
      two questions are:
      1. Is the license acceptable?
      2. This will have SparkR depends on a non-base third-party package, which may complicate the deployment.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              sunrui Sun Rui
              Votes:
              2 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: