Uploaded image for project: 'Tajo (Retired)'
  1. Tajo (Retired)
  2. TAJO-337

Tajo Generic Storage Support

Add voteWatch issue
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: In Progress
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Catalog, Storage
    • None

    Description

      Currently, Tajo uses HDFS as a primary storage. But, as a data warehouse system, Tajo should easily support various data sources.

      For this, I propose a generic storage handler interface that provides common storage methods as follows:

      • splitting input data
      • locality
      • accessing catalog (if providing)
      • creating a table
      • removing a table
      • adding default table properties and validating properties
      • committing, rollback, and clean up output tables
      • getting table physical information like table volumes and others
      • managing connection pool for connection-based storages
      • adding storage-specified rewrite rules
      • adding hooks for query phases
      • physical properties like instant random access, indexible, read throughput, and write throughput

      Attachments

        Issue Links

          Activity

            People

              hyunsik Hyunsik Choi
              hyunsik Hyunsik Choi

              Dates

                Created:
                Updated:

                Slack

                  Issue deployment