[ARROW-1599] [C++][Parquet] Unable to read Parquet files with list inside struct - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Duplicate
Affects Version/s: 0.7.0
Fix Version/s: None
Component/s: C++, Python
Labels:
- parquet
Environment:
Ubuntu

External issue URL:
https://github.com/apache/arrow/issues/17612

Description

Is PyArrow currently unable to read in Parquet files with a vector as a column? For example, the schema of such a file is below:

{{<pyarrow._parquet.ParquetSchema object at 0x7f2d42493c88>
mbc: FLOAT
deltae: FLOAT
labels: FLOAT
features.type: INT32 INT_8
features.size: INT32
features.indices.list.element: INT32
features.values.list.element: DOUBLE}}

Using either pq.read_table() or pq.ParquetDataset('/path/to/parquet').read() yields the following error: ArrowNotImplementedError: Currently only nesting with Lists is supported.

From the error I assume that this may be implemented in further releases?

Attachments

Issue Links

is related to

ARROW-4279 [C++] Rebase https://github.com/apache/parquet-cpp/pull/462# onto arrow repo

Closed

ARROW-5799 [Python] Fail to write nested data to Parquet via BigQuery API

Closed

Activity

People

Assignee:: Micah Kornfield

Reporter:: Jovann Kung

Votes:: 3 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 22/Sep/17 17:59

Updated:: 11/Jan/23 07:15

Resolved:: 14/Mar/20 23:12