The current implementation of personalized pagerank in GraphX uses uniform initialization over the full graph - every vertex will get initially activated.
Leads to all vertices being set to resetProb (0.15), which is different from the behavior described in
SPARK-5854, where only the source node should be activated.
The risk is that, after a few iterations, the most activated nodes are the source node and the nodes that were untouched by the propagation. For example in the above example the vertex 2L will always have an activation of 0.15:
Which leads into a higher score for 2L than for 7L and 5L, even though there's no outbound path from 3L to 2L.