Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-10228

Integer overflow in VertexRDDImpl.count

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 1.4.1
    • None
    • GraphX
    • None

    Description

      VertexRDDImpl overrides RDD.count() but aggregates Int instead of Long:

      /** The number of vertices in the RDD. */
      override def count(): Long =

      { partitionsRDD.map(_.size).reduce(_ + _) }

      This causes Pregel to stop iterating when the number of messages is "negative", giving incorrect results.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              hpmv118 Robin Cheng
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: