Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-1436

PyArrow Timestamps written to Parquet as INT96 appear in Spark as 'bigint'

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Cannot Reproduce
    • Affects Version/s: 0.6.0
    • Fix Version/s: 0.8.0
    • Component/s: Format, Python
    • Labels:
      None

      Description

      When using the 'use_deprecated_int96_timestamps' option to write Parquet files compatible with Spark <2.2.0 (which doesn't support INT64 backed Timestamps) Spark identifies the Timestamp columns as BigInts. Some metadata may be missing.

        Attachments

          Activity

            People

            • Assignee:
              Licht-T Licht Takeuchi
              Reporter:
              LucasPickup Lucas Pickup
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: