Uploaded image for project: 'Atlas'
  1. Atlas
  2. ATLAS-3321

Introduce atlas metadata namespace

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.0.0
    • 2.1.0, 3.0.0
    • atlas-core
    • None

    Description

      Atlas has the notion of clusterName which is used as a global namespace in which you can define the context in which entity instances are defined and grouped. A cluster name defines a scope of operation in atlas and is specified in parent entity types - hive_db (Hive), hbase_namespace (HBase), fs_path (File System), storm_topology (Storm), spark_db (Spark)

      cluster name is also used to construct qualifiedName attribute in atlas -

      • hive_db - db_name@cluster_name
      • hive_table - db_name.table_name@cluster_name
      • hive_column - db_name.table_name.column_name@cluster_name

      Cluster name works for simple environments wherein we have a single cluster and single service instances running (single Hive or HBase service). In complex deployments - a single cluster may host multiple instances of the same service (two Hive services - Hive-1 and Hive-2), for such cases cluster name doesn't help to provide context to individual service instances and metadata groupings.

      So we introduce - "metadata namespace" where a single cluster can contain multiple metadata namespaces. The qualified name for atlas will be generated using metadata namespace name instead of cluster name.

      For each hook enabled we specify the property - "atlas.metadata.namespace" which defines the scope/context of the metadata.

      Attachments

        Issue Links

          Activity

            People

              sarath Sarath Subramanian
              sarath Sarath Subramanian
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: