Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-3621 Parquet as Cube storage
  3. KYLIN-3626

Allow customization for Parquet path

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Resolved
    • None
    • None
    • None

    Description

      Due to the difference of HDFS and object store like S3/blob store, Kylin needs to allow different path patterns for the files when putting to the underlying storage. 

       

      For example, to HDFS, this pattern is good:

      /kylin/metadata_name/cube_name/segment_name/file1.parquet

       

      But, the above pattern is not good for S3, since S3 uses the first couple bytes of the path to do the hashing, so we'd better add salt ahead of the original pathname:

      /salt/kylin/metadata_name/cube_name/segment_name/file1.parquet

      or do a reverse order:

      /segment_name/cube_name/metadata_name/kylin/file1.parquet

       

      To get the flexibility, we should make this extensible.

      Attachments

        Activity

          People

            Wayne0101 Chao Long
            shaofengshi Shao Feng Shi
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: