Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
0.2.0-incubating
-
None
Description
Since spark 2.0 released. there are many nice features such as more efficient parser, vectorized execution, adaptive execution.
It is good to integrate with spark 2.x
current integration up to Spark v1.6 is tightly coupled with spark, we would like to cleanup the interface with following design points in mind:
1. decoupled with Spark, integration based on Spark's v2 datasource API
2. Enable vectorized carbon reader
3. Support saving DataFrame to Carbondata file through Carbondata's output format.
...
Attachments
1.
|
Clean partitioner in RDD package | Resolved | Jacky Li |
|
||||||||
2.
|
Clean up carbonTableSchema.scala before moving to spark-common package | Resolved | Jacky Li |
|
||||||||
3.
|
Extract spark-common module | Resolved | Jacky Li |
|
||||||||
4.
|
spark 2 stable datasource api integration | Resolved | Fei Wang |
|
||||||||
5.
|
spark2 decimal issue | Resolved | Fei Wang |
|
||||||||
6.
|
do not use runnablecommand in spark2 | Resolved | Fei Wang |
|
||||||||
7.
|
Use carbon property to get the store path/kettle home | Resolved | Fei Wang |
|
||||||||
8.
|
Executor can not get the read support class | Resolved | Fei Wang |
|
||||||||
9.
|
Depends on more stable class of spark in spark2 | Resolved | Fei Wang |
|