[PHOENIX-1598] Encode column names to save space and improve performance - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.10.0
Component/s: None
Labels:
None

Description

when creating table using phoenix DDL replace the column names that the user give with shorter names to save space. the user will still the full name is his select statements and will get them in the result set but under the hood the infra will translate the names to their sorter version.

example:
when creating table with my_column_1, my_column_2 ... the table will be created with a as first column , b as the second one etc'

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

PHOENIX-1598_master.patch
27/Feb/17 06:14
1.42 MB
Samarth Jain
PHOENIX-1598-4.x-HBase-0.98.patch
27/Feb/17 06:11
1.44 MB
Samarth Jain

Sub-Tasks

1.	Support encoded column qualifiers per column family	Resolved	Samarth Jain
2.	Make joins work with encoded column names	Resolved	Samarth Jain
3.	Add support for setting a storage scheme at table creation time	Resolved	Samarth Jain
4.	Support null when columns have default values for immutable tables with encoding scheme COLUMNS_STORED_IN_SINGLE_CELL	Resolved	Thomas D'Silva
5.	Support different encoding schemes (BYTE, SHORT, INTEGER) for storing encoded column qualifiers	Resolved	Samarth Jain
6.	Make changes to IndexMaintainer backward compatible	Resolved	Samarth Jain
7.	Add a CREATE IMMUTABLE TABLE construct to make immutable tables more explicit	Resolved	Thomas D'Silva
8.	Parameterize tests for different encoding and storage schemes	Resolved	Thomas D'Silva
9.	Add upgrade code to add the required metadata columns for supporting column encoding	Resolved	Samarth Jain
10.	Add COLUMN_ENCODED_BYTES table property	Resolved	Thomas D'Silva
11.	Fix bulkload for StorageScheme - ONE_CELL_PER_KEYVALUE_COLUMN	Resolved	Ankit Singhal
12.	Data load gets 5-7X slower with mutable sparse columns	Resolved	Samarth Jain
13.	Filter on value column for mutable encoded table is > 3X slower compared to non encoded table	Resolved	Samarth Jain
14.	Upgrading from 4.8 or before to encodecolumns2 branch fails	Resolved	Samarth Jain
15.	Make use of EncodedColumnQualifierCellsList for all column name mapping schemes	Resolved	Samarth Jain
16.	Add a test case to test out CREATE TABLE IF NOT EXISTS code path	Resolved	Samarth Jain
17.	Change tests extending BaseQueryIT to use unique table names	Resolved	Samarth Jain
18.	Optimize BooleanExpressionFilter and ColumnProjectionFilter for tables with encoded columns	Resolved	Samarth Jain
19.	Backward compatibility fails for immutable tables after column encoding patch	Resolved	Samarth Jain
20.	Backward compatibility fails for joins	Resolved	Samarth Jain
21.	Remove testUnfoundSingleColumnCaseStatement from CaseStatementIT	Resolved	Samarth Jain

Activity

People

Assignee:: Samarth Jain

Reporter:: noam bulvik

Votes:: 0 Vote for this issue

Watchers:: 13 Start watching this issue

Dates

Created:: 20/Jan/15 07:06

Updated:: 06/Mar/17 22:22

Resolved:: 06/Mar/17 22:22