[Bug] parquet files with columns containing large list of byte arrays can not be read by pyarrow. #816
Closed
2 tasks done
Labels
bug
Something isn't working
Search before asking
Component
Library/core
What happened + What you expected to happen
I have a parquet file that has a column containing a list of images, as byte arrays. Under some circumstances, such files are not readable by pyarrow.
Reproduction script
grab https://ibm.ent.box.com/file/1684883605503?s=9qcne0iubeji6t6a77gh2sxgi29k1tjp as test.parquet
Anything else
OS
Ubuntu, MacOS (limited support)
Python
3.11.x
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: