Uploaded image for project: 'Ranger'
  1. Ranger
  2. RANGER-4298

Multi-threaded tagsync needed

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Ranger, tagsync
    • None

    Description

      We are in the process of migrating our entire large data lakes (S3, MinIO) from resource-based to tag-based AC. This migration is phased and estimated to continue through Q3. In this phase we need to import 250K tags into Ranger from Apache Atlas. At the current, single-threaded rate of 500ms/tag, this will take 6 to 7 days.

      Beyond this migration, we expect to be incorporating multiple external datasets into our datalakes at regular intervals, which will cause roughly the same quantity of tags to be imported.

      So this is not a one-time thing, it is expected to recur. It would be great if tagsync could be made multi-threaded.

      Attachments

        Activity

          People

            Unassigned Unassigned
            barbara Barbara Eckman
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: