Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
None
-
None
Description
At Spark Summit today this idea came up from a discussion: it would be great to investigate the possibility of implementing a new SQL server using Postgres' protocol, in lieu of Hive ThriftServer 2. I'm creating this ticket to track this idea, in case others have feedback.
This server can have a simpler architecture, and allows users to leverage a wide range of tools that are already available for Postgres (and many commercial database systems based on Postgres).
Some of the problems we'd need to figure out are:
1. What is the Postgres protocol? Is there an official documentation for it?
2. How difficult would it be to implement that protocol in Spark (JVM in particular).
3. How does data type mapping work?
4. How does system commands work? Would Spark need to support all of Postgres' commands?
5. Any restrictions in supporting nested data?