Uploaded image for project: 'Atlas'
  1. Atlas
  2. ATLAS-1995

Performance of Entity Creation Can Be Improved By Using Index Query to Fetch Entity Using Unique Attributes

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.8-incubating
    • Fix Version/s: 0.8.1, 1.0.0
    • Component/s: atlas-core
    • Labels:
      None

      Description

      Background
      On profiling entity creation flow, it was observed that several calls are made to AtlasGraphUtilsV1.getVertexByUniqueAttributes.

      These calls result in querying database using graph query. There is a potential for improving this if index query was used.

      Analysis
      Upon experimentation, it was found that there is a 50% improvement in performance of entity creation if this method was replaced with equivalent that uses indexQuery.

      Also, when large number of entities are created (typically using import_hive.sh), the CPU usage on Atlas was reduced, as the Solr was being used for doing some of the work.

      Implementation Guidance

      • Add new method to AtlasGraphUtilsV1.getAtlasVertexFromIndexQuery that will use AtlasGraphProvider.indexQuery to fetch vertices.
      • Ensure that query created is 'escaped' appropriately.
      • Include logic to fallback to graph query if the property being queried for is not indexed.

      Since this is a high-impact change, it will be worth while to verify other dependent modules.

        Attachments

        Issue Links

          Activity

            People

            • Assignee:
              amestry Ashutosh Mestry
              Reporter:
              amestry Ashutosh Mestry

              Dates

              • Due:
                Created:
                Updated:
                Resolved:

                Issue deployment