Description
When working with the output of a query against a sqlContext, the result a Row object with fields accessible via dot notation rather than something similar to a dictionary.
The issue is that there are libraries (such as pandas and ggplot) that do not know how to operate on Rows but you can supply field names for the libraries to use. When the library attempts to use the row, it expects a data type like a dictionary, indexable using brackets and named field i.e. obj["fieldname"].
Attachments
Issue Links
- is related to
-
SPARK-4561 PySparkSQL's Row.asDict() should convert nested rows to dictionaries
- Resolved
- links to