Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-3918

[Python] ParquetWriter.write_table doesn't support coerce_timestamps or allow_truncated_timestamps

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • 0.11.1
    • 0.12.0
    • Python
    • None

    Description

      Error: Table Schema does not match schema used to create file.

      The 0.11.1 release added these parameters to pyarrow.parquet.write_table(), but they are missing from pyarrow.parquet.ParquetWriter.write_table().. I'm seeing mismatches between the table schema and the file schema, but they are identical in the error message with modified: timestamp[ms] column types in both schemas. The only thing which looks odd is the Pandas metadata that has a modified column with a panda datatype of datetime and a numpy datatype of datetime64[ns]

       

      Attachments

        Issue Links

          Activity

            People

              kszucs Krisztian Szucs
              davlee1972@yahoo.com David Lee
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: