Details
Description
Currently, sqoop-1 only supports merging of two Parquet format data sets but it doesn't support to do incremental merge, so I have written a Sqoop Incremental Merge MR for Parquet File Format and I have tested with million records of data with N number of iterations.
blocked by issue https://issues.apache.org/jira/browse/SQOOP-3192
Attachments
Issue Links
- Blocked
-
SQOOP-3182 Sqoop1 (import + --incremental + --merge-key + --as-parquetfile) fails with (Can't parse input data: 'PAR1')
- Resolved
- is blocked by
-
SQOOP-3192 upgrade parquet
- Resolved
- is related to
-
SQOOP-3181 Sqoop1 (import + --incremental + --merge-key + --as-parquetfile) fails with (Could not find class <CLASS>.)
- Resolved
-
SQOOP-3182 Sqoop1 (import + --incremental + --merge-key + --as-parquetfile) fails with (Can't parse input data: 'PAR1')
- Resolved
- relates to
-
PARQUET-1010 SKIP FILES STARTS WITH "." in parquet.hadoop.ParquetFileReader.readAllFootersInParallel
- Resolved
- links to