Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-8746

ORC timestamp columns are sensitive to daylight savings time

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.11.0, 0.12.0, 0.13.0, 0.14.0, 1.0.0, 1.1.0, 1.2.0
    • Fix Version/s: 1.2.0
    • Component/s: None
    • Labels:
    • Release Note:
      Fixed ORC timestamp columns for daylight savings changes.

      Description

      Hive uses Java's Timestamp class to manipulate timestamp columns. Unfortunately the textual parsing in Timestamp is done in local time and the internal storage is in UTC.

      ORC mostly side steps this issue by storing the difference between the time and a base time also in local and storing that difference in the file. Reading the file between timezones will mostly work correctly "2014-01-01 12:34:56" will read correctly in every timezone.

      However, when moving between timezones with different daylight saving it creates trouble. In particular, moving from a computer in PST to UTC will read "2014-06-06 12:34:56" as "2014-06-06 11:34:56".

        Attachments

        1. HIVE-8746.4.patch
          264 kB
          Prasanth Jayachandran
        2. HIVE-8746.3.patch
          275 kB
          Prasanth Jayachandran
        3. HIVE-8746.2.patch
          274 kB
          Prasanth Jayachandran
        4. HIVE-8746.1.patch
          54 kB
          Prasanth Jayachandran

          Issue Links

            Activity

              People

              • Assignee:
                prasanth_j Prasanth Jayachandran
                Reporter:
                omalley Owen O'Malley
              • Votes:
                0 Vote for this issue
                Watchers:
                10 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: