Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-603

Add mem store for seconds data latency

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • Backlog
    • None
    • None

    Description

      For seconds level streaming data latency

      • Data is separated into Young and Aged.
      • Add mem store to hold young generation micro batch loaded from streaming (e.g. every 15 seconds). Mem store is good for high throughput and frequent access.
      • Aged generation is batch loaded from mem store to disk store daily or hourly.
      • Data is persisted in two copies, one as Inverted Index, one as Cube, for both young and aged.
      • Inverted Index is for analysis at raw record or low aggregation levels (cardinality > 10K), optimized for parallel random read.
      • Cube is for analysis at high aggregation levels (cardinality < 10K), optimized for sequential scan.
      • Hybrid bring aged from cube and young from inverted index when young cube is omitted.

      Attachments

        Issue Links

          Activity

            People

              liyang.gmt8@gmail.com liyang
              liyang.gmt8@gmail.com liyang
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: