You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think this is due to the underlying parquet write truncating the values it uses for the column index, but not for the column metadata statistics. I'm going to open a companion issue to arrow-rs to track that. (EDIT: apache/arrow-rs#5037)
The text was updated successfully, but these errors were encountered:
Environment
Delta-rs version: master
Binding: rust
Environment: local test
Bug
What happened:
When writing a file with large binary columns, the delta log json for the commit is very large due to a large statistics object.
What you expected to happen:
These are expected to receive truncated statistics due to a PR to arrow-rs (apache/arrow-rs#4389).
How to reproduce it:
A test case (to put in
stats.rs
):More details:
I think this is due to the underlying parquet write truncating the values it uses for the column index, but not for the column metadata statistics. I'm going to open a companion issue to arrow-rs to track that. (EDIT: apache/arrow-rs#5037)
The text was updated successfully, but these errors were encountered: