-
Notifications
You must be signed in to change notification settings - Fork 198
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] ArrowTypeError: "Could not convert" Error in inspect._files method #1477
Comments
Thanks for reporting this @xsfa
Could you provide more information so we can debug this? |
{
"content": DataFileContent.DATA,
"file_path": "s3a://dataplatform/silver/iceberg/spark/dbname/tablename/data/00001-3933-e97b5082-3b9e-4c4e-b965-f290205bcf3a-0-00001.parquet",
"file_format": "PARQUET",
"spec_id": 0,
"record_count": 16718742,
"file_size_in_bytes": 474139920,
"column_sizes": {
"1": 44833933,
"2": 39592909,
"3": 26025570,
"4": 21604711,
"5": 27511454,
"6": 930995,
"7": 5173236,
"8": 4051761,
"9": 4944629,
"10": 24729094
},
"value_counts": {
"1": 16718742,
"2": 16718742,
"3": 16718742,
"4": 16718742,
"5": 16718742,
"6": 16718742,
"7": 16718742,
"8": 16718742,
"9": 16718742,
"10": 16718742
},
"null_value_counts": {
"1": 0,
"2": 0,
"3": 0,
"4": 3910423,
"5": 7637,
"6": 0,
"7": 0,
"8": 10289423,
"9": 0,
"10": 0
},
"split_offsets": [4, 138429859, 276834527, 415238280],
"sort_order_id": 0,
. . .
} Heres an example entry from the |
I dont see anything out of the ordinary. is there a particular reason you think its due to It would be helpful to print out can you also try |
Apache Iceberg version
0.8.1 (latest release)
Please describe the bug 🐞
I think PyArrow is receiving misformatted data from the file metadata, causing me to be unable to call any of the file functions. Could this be caused by my Iceberg table format or is it a genuine bug? I have confirmed that my table is a valid Iceberg V2 table and readable.
Code:
Full Stack Trace:
Willingness to contribute
The text was updated successfully, but these errors were encountered: