Details

    • Type: Improvement
    • Status: In Progress
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Catalog, Storage
    • Labels:
      None

      Description

      Currently, Tajo uses HDFS as a primary storage. But, as a data warehouse system, Tajo should easily support various data sources.

      For this, I propose a generic storage handler interface that provides common storage methods as follows:

      • splitting input data
      • locality
      • accessing catalog (if providing)
      • creating a table
      • removing a table
      • adding default table properties and validating properties
      • committing, rollback, and clean up output tables
      • getting table physical information like table volumes and others
      • managing connection pool for connection-based storages
      • adding storage-specified rewrite rules
      • adding hooks for query phases
      • physical properties like instant random access, indexible, read throughput, and write throughput

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                hyunsik Hyunsik Choi
                Reporter:
                hyunsik Hyunsik Choi
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated: