Uploaded image for project: 'Atlas'
  1. Atlas
  2. ATLAS-409

Atlas will not import avro tables with schema read from a file

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.6-incubating
    • 0.7-incubating
    • None
    • None

    Description

      Atlas won't import a Hive table with no columns (see below for an example of a valid hive table with no explicit columns). This is because the Atlas Hive Storage Descriptor class REQUIRES columns, but the Hive Table class allows them to be OPTIONAL.

      CREATE TABLE example
        ROW FORMAT SERDE
        'org.apache.hadoop.hive.serde2.avro.AvroSerDe'
        STORED AS INPUTFORMAT
        'org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat'
        OUTPUTFORMAT
        'org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat'
        TBLPROPERTIES (
          'avro.schema.url'='file:///path/to/the/schema/test_serializer.avsc');
      

      Attachments

        1. ATLAS-409.patch
          1 kB
          Aaron Dossett
        2. ATLAS-409-2.patch
          2 kB
          Aaron Dossett
        3. ATLAS-409-3.patch
          1 kB
          Aaron Dossett
        4. example.ddl
          0.3 kB
          Aaron Dossett
        5. foo.avsc
          0.6 kB
          Aaron Dossett
        6. hive-avro-entity.json
          9 kB
          Hemanth Yamijala

        Activity

          People

            dossett@gmail.com Aaron Dossett
            dossett@gmail.com Aaron Dossett
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: