Description
parallelize uses integer multiplication, which cannot handle size over ~47000. This cause issues with lapply
SparkR:::parallelize(sc, 1:47000, 47000) Error in rep(start, end - start) : invalid 'times' argument Error in rep(start, end - start) : invalid 'times' argument In addition: Warning message: In x * length(coll) : NAs produced by integer overflow