[PHOENIX-716] Intra-row secondary indexing - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Task
Status: Open
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

old issue number:
585

Description

Use cases that leverage nested child rows may be able to benefit from a simpler index implementation that has better atomicity guarantees and better performance. If the row is identified through another index (whether in Phoenix or via another mechanism, such as a lucene-based search), the data stored in the columns of the row can be indexed by other columns in the row.

Below is a simple example implementation. Given row:
```

{salt}

person_id:123
```
You could have columns:
```

{leading byte for index 1}

attribute_1:value

{delimiter}event:123 -> (no value) {leading byte for index 2}attribute_2:value{delimiter}

event:123 -> (no value)

{leading byte for data}

event:123 ->

{nested data with attributes 1 & 2}

```
Given the structure above, as long as we know the person we are working with we benefit from a few things.

1. We can leverage row-level atomicity for index updates.
2. When data is written to the row, the index can be updated very efficiently, as its in the same region.
3. When reading from the row, the index can be used very efficiently. A filter determines which index(es) to hit based on the query conditions, then skips "down" to the data for the read. Though the common case would only require a single scan, multiple scans would be possible in the event of a query that joins back to itself or must use multiple indexes in an order different than they are written to the row.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Jason Bray

Votes:: 1 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 10/Mar/14 23:48

Updated:: 10/Mar/14 23:48