[CARBONDATA-322] Integration with spark 2.x - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 0.2.0-incubating
Fix Version/s: 1.0.0-incubating
Component/s: spark-integration
Labels:
None

Description

Since spark 2.0 released. there are many nice features such as more efficient parser, vectorized execution, adaptive execution.
It is good to integrate with spark 2.x

current integration up to Spark v1.6 is tightly coupled with spark, we would like to cleanup the interface with following design points in mind:

1. decoupled with Spark, integration based on Spark's v2 datasource API
2. Enable vectorized carbon reader
3. Support saving DataFrame to Carbondata file through Carbondata's output format.
...