Description
Current schema inferance for local python collections halts as soon as there are no NullTypes. This is different than when we specify a sampling ratio of 1.0 on a distributed collection. This could result in incomplete schema information.
Repro:
input = [{"a": 1}, {"b": "coffee"}] df = sqlContext.createDataFrame(input) print df.schema
Discovered while looking at SPARK-2870