Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-6190

redundant columns in metastore schema for stats

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Duplicate
    • None
    • 4.1.0
    • Metastore
    • None

    Description

      package.jdo has:

            <field name ="dbName">
              <column name="DB_NAME" length="128" jdbc-type="VARCHAR" allows-null="false"/>
            </field>
            <field name="tableName">
              <column name="TABLE_NAME" length="128" jdbc-type="VARCHAR" allows-null="false"/>
            </field>
            <field name="partitionName">
              <column name="PARTITION_NAME" length="767" jdbc-type="VARCHAR" allows-null="false"/>
            </field>
            <field name="partition">
              <column name="PART_ID"/>
            </field>
      

      Only PART_ID is enough, the other fields are unnecessary and may potentially cause bugs; similarly for table stats. One could argue that they were intended for perf (denormalization), but stats retrieval currently is very slow on much deeper level so it's not really justified.

      Attachments

        Issue Links

          Activity

            People

              sershe Sergey Shelukhin
              sershe Sergey Shelukhin
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: