Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-26222

Native GeoSpatial Support in Hive

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 4.0.0
    • Hive, HiveServer2
    • None

    Description

      This is an epic Jira to support GeoSpatial datatypes natively in Hive. This will cater to the applications which queries on large volumes of spatial data. This support will be added in a phased manner. To start with, we are planning to make use of framework developed by ESRI [https://github.com/Esri/spatial-framework-for-hadoop |https://github.com/Esri/spatial-framework-for-hadoop).]  on the Apache 2.0 license. This project is not very active and there is no release done to maven central. So its not easy to get the jars downloaded directly using pom dependency. Also the UDFs are based on older version of Hive. So we have decided to make a copy of this repo and maintained it inside Hive. This will make it easier to do any improvement and manage dependencies. As of now, the data loading is done only on a binary data type. We need to enhance this  to make it more user friendly. In the next phase, a native Geometry/Geography datatype will be supported. User can directly create a geometry type and operate on it. Apart from these we can start adding support for different indices like quad tree, R-tree, ORC/Parquet/Iceberg support etc. 

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            ayushtkn Ayush Saxena Assign to me
            maheshk114 mahesh kumar behera
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

              Estimated:
              Original Estimate - Not Specified
              Not Specified
              Remaining:
              Remaining Estimate - 0h
              0h
              Logged:
              Time Spent - 2h 40m
              2h 40m

              Slack

                Issue deployment