Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-14448

[Python] Update pyarrow.array() docstring note on timestamp (timezone) conversion

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 7.0.0
    • Python

    Description

      The current pa.array(..) docstring has the following note:

      Localized timestamps will currently be returned as UTC (pandas’s native representation). Timezone-naive data will be implicitly interpreted as UTC.

      This is IMO quite confusing. While technically somewhat correct (we do store UTC values under the hood for timezone aware data, and values "as if" in UTC for timezone naive data), it's not what the user actually gets returned. The type of the returned array does preserve the timezone if possible, or will have no timezone (also no UTC) when converted from naive timestamps.

      Attachments

        Issue Links

          Activity

            People

              sanjibansg Sanjiban Sengupta
              jorisvandenbossche Joris Van den Bossche
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 40m
                  1h 40m