Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-3050

[C++] Adopt HiveServer2 client C++ codebase

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.11.0
    • C++

    Description

      I helped develop a small C++/Python library for interacting with databases like Hive and Impala via the HiveServer2 Thrift protocol and making them accessible to Python / pandas:

      https://github.com/cloudera/hs2client

      Internally this interfaces with HS2's own columnar representation. Arrow is a natural partner for this project, much of which could be discarded. I think Arrow would make as much sense as any place to develop this codebase further. It could be later split off into a new project if a large enough community develops

      cc twmarshall mjacobs for thoughts

      If we did this, do we need to do a software grant (essentially what I'm proposing is to fork)? Can we just attribute the original Cloudera authors in LICENSE.txt?

      Attachments

        Issue Links

          Activity

            People

              wesm Wes McKinney
              wesm Wes McKinney
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 2h 50m
                  2h 50m