Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-20109

get rid of COLUMN_STATS_ACCURATE

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Statistics
    • None

    Description

      I don't know why anyone would come up with an idea of storing a set of booleans in a database using JSON. This has caused various problems in the past (text field limitations, perf issues when parsing a giant string; also bugs because the way it is set is brittle).
      However, now that we are implementing transactional stats, it becomes especially problematic and error prone because the code in Hive sets C_S_A in random places with reckless abandon, whereas we want to change the state of the stats in well defined places where txn semantics can be verified.
      Currently in HIVE-19416, we are handling random things that touch it (from metastore itself to output committers, various stats tasks, commands like truncate, etc.) via a pile of hacks, but the best solution would be to remove it completely and replace with a DB table/columns in stats tables that would need to be set explicitly, not via generic alter_table.

      Attachments

        1. HIVE-20109.nogen.patch
          227 kB
          Sergey Shelukhin
        2. HIVE-20109.patch
          1.62 MB
          Sergey Shelukhin

        Activity

          People

            sershe Sergey Shelukhin
            sershe Sergey Shelukhin
            Votes:
            1 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: