Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
trunk
-
None
Description
Background
While profiling entity creation as part of zip file-based migration import, it was observed that over 50% of time is spent in assigning values to attributes that are of primitive type viz. string, date, etc.
Methodology Used
The entity creation flow already has MetricRecorder added to key steps within the entity creation flow. Additional methods (like mapPrimitiveValue) were profiled to get complete breakdown.
Data set contained large number of entities of various types. Ones used were hdfs_path, hive_db, hive_table, hive_column, hive_process, hive_process_execution, hive_column_lineage.
Solution
Within the migration request context, use the setEncodedProperty instead of using the current route of creating AtttributeMutationContext.