Running a variant of PageRank on a 6-node cluster with a 30Gb input dataset. Recently upgraded to Spark 1.1. The workload fails with the following error message(s):
In order to identify the problem, I carried out change set analysis. As I go back in time, the error message changes to:
All the way until Aug 4th. Turns out the problem changeset is 4fde28c.