Uploaded image for project: 'Atlas'
  1. Atlas
  2. ATLAS-4922

Atlas Parallel Import Functionality using Kafka

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • atlas-core
    • None

    Description

      Develop an import process flow in Apache Atlas that leverages Kafka to manage parallel import requests. This solution will use a dedicated Kafka topic (ATLAS_IMPORT) to ensure import requests are processed efficiently and without conflicts. The ImportService will be enhanced to consume and process each message from the topic, enabling seamless behind-the-scenes import execution.

      Users will be able to submit new import requests after the staging process, which prepares the file stream for Kafka, is complete. This approach supports concurrent imports and enhances the handling of import operations in Apache Atlas.

      Additionally, this feature will introduce two new API endpoints for tracking the current status of imports. These endpoints will allow users to check both the overall progress and detailed status of each import request at any stage of the import lifecycle (e.g., "PROCESSING," "PROCESSED," "COMPLETE") using the unique request ID. Users will also gain access to request metadata, providing insights into the status and progress of their imports.

      Attachments

        Activity

          People

            dishatalreja Disha Talreja
            dishatalreja Disha Talreja
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: