================================================================================================================ The first batch experiments are based on 10 and 25 columns. All row count is 1000000(one million). ================================================================================================================ 100000 random string rows: 10 columns: Write RCFile with 10 random string columns and 100000 rows cost 5298 milliseconds. And the file's on disk size is 50557188 Read all columns of a RCFile with 10 random string columns and 100000 rows cost 1200 milliseconds. Write SequenceFile with 10 random string columns and 100000 rows cost 18338 milliseconds. And the file's on disk size is 52701138 Read SequenceFile with 10 random string columns and 100000 rows cost 8968 milliseconds. 25 columns Write RCFile with 25 random string columns and 100000 rows cost 11732 milliseconds. And the file's on disk size is 126410809 Read all columns of a RCFile with 25 random string columns and 100000 rows cost 2076 milliseconds. Write SequenceFile with 25 random string columns and 100000 rows cost 45179 milliseconds. And the file's on disk size is 131299430 Read SequenceFile with 25 random string columns and 100000 rows cost 20030 milliseconds. 1000000 random string rows: 10 columns Write RCFile with 10 random string columns and 1000000 rows cost 51763 milliseconds. And the file's on disk size is 505226638 Read all columns of a RCFile with 10 random string columns and 1000000 rows cost 7855 milliseconds. Write SequenceFile with 10 random string columns and 1000000 rows cost 182081 milliseconds. And the file's on disk size is 527075676 Read SequenceFile with 10 random string columns and 1000000 rows cost 91953 milliseconds. 25 columns Write RCFile with 25 random string columns and 1000000 rows cost 121355 milliseconds. And the file's on disk size is 1263114852 Read all columns of a RCFile with 25 random string columns and 1000000 rows cost 18510 milliseconds. Write SequenceFile with 25 random string columns and 1000000 rows cost 454995 milliseconds. And the file's on disk size is 1313843492 Read SequenceFile with 25 random string columns and 1000000 rows cost 205802 milliseconds. ================================================================================================================ The first batch experiments are based on 10,25,40,50,and 80 columns. All row count is 100000(one hundred thousand). ================================================================================================================ Write RCFile with 10 random string columns and 100000 rows cost 6693 milliseconds. And the file's on disk size is 50505213 Read only one column of a RCFile with 10 random string columns and 100000 rows cost 838 milliseconds. Read only first and last columns of a RCFile with 10 random string columns and 100000 rows cost 261 milliseconds. Read all columns of a RCFile with 10 random string columns and 100000 rows cost 865 milliseconds. Write SequenceFile with 10 random string columns and 100000 rows cost 19138 milliseconds. And the file's on disk size is 52749300 Read SequenceFile with 10 random string columns and 100000 rows cost 9282 milliseconds. Write RCFile with 25 random string columns and 100000 rows cost 11965 milliseconds. And the file's on disk size is 126316949 Read only one column of a RCFile with 25 random string columns and 100000 rows cost 393 milliseconds. Read only first and last columns of a RCFile with 25 random string columns and 100000 rows cost 291 milliseconds. Read all columns of a RCFile with 25 random string columns and 100000 rows cost 1951 milliseconds. Write SequenceFile with 25 random string columns and 100000 rows cost 45623 milliseconds. And the file's on disk size is 131428716 Read SequenceFile with 25 random string columns and 100000 rows cost 20568 milliseconds. Write RCFile with 40 random string columns and 100000 rows cost 19129 milliseconds. And the file's on disk size is 202143232 Read only one column of a RCFile with 40 random string columns and 100000 rows cost 466 milliseconds. Read only first and last columns of a RCFile with 40 random string columns and 100000 rows cost 386 milliseconds. Read all columns of a RCFile with 40 random string columns and 100000 rows cost 2991 milliseconds. Write SequenceFile with 40 random string columns and 100000 rows cost 71973 milliseconds. And the file's on disk size is 209988338 Read SequenceFile with 40 random string columns and 100000 rows cost 32193 milliseconds. Write RCFile with 50 random string columns and 100000 rows cost 23083 milliseconds. And the file's on disk size is 252566469 Read only one column of a RCFile with 50 random string columns and 100000 rows cost 333 milliseconds. Read only first and last columns of a RCFile with 50 random string columns and 100000 rows cost 395 milliseconds. Read all columns of a RCFile with 50 random string columns and 100000 rows cost 3571 milliseconds. Write SequenceFile with 50 random string columns and 100000 rows cost 90024 milliseconds. And the file's on disk size is 262471976 Read SequenceFile with 50 random string columns and 100000 rows cost 39705 milliseconds. Write RCFile with 80 random string columns and 100000 rows cost 36214 milliseconds. And the file's on disk size is 404189730 Read only one column of a RCFile with 80 random string columns and 100000 rows cost 477 milliseconds. Read only first and last columns of a RCFile with 80 random string columns and 100000 rows cost 544 milliseconds. Read all columns of a RCFile with 80 random string columns and 100000 rows cost 5489 milliseconds. Write SequenceFile with 80 random string columns and 100000 rows cost 144141 milliseconds. And the file's on disk size is 419713124 Read SequenceFile with 80 random string columns and 100000 rows cost 62703 milliseconds. Table Format: column number | RCFile size | RCFile read 1 column | RCFile read 2 columns | RCFile read all columns | Sequence file size | sequence file full read 10|50505213|838| 261| 865| 52749300| 9282 25|126316949|393| 291| 1951| 131428716| 20568 40|202143232|466| 386| 2991| 209988338| 32193 50|252566469|333| 395| 3571| 262471976| 39705 80|404189730|477| 544| 5489| 419713124| 62703