Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-9040

[Python][Parquet]"_ParquetDatasetV2" fail to read with columns and use_pandas_metadata=True

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.17.1
    • 1.0.0
    • Python
    • None

    Description

      Loading a parquet file write by pandas with default index.

      When call _ParquetDatasetV2.read(columns=['column'], use_pandas_metadata=True),

      "TypeError: unhashable type 'dict'"  were raised from 

      index_columns = set(_get_pandas_index_columns(metadata))

      Is it because of pandas default index?

       

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            cmsxbc cmsxbc
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: