`UnwrapCastInComparison` produces incorrect results #14303

jonahgao · 2025-01-26T09:52:19Z

Describe the bug

I found that UnwrapCastInComparison always assumes the cast operation can succeed, but when it cannot, it results in incorrect optimization results.

To Reproduce

Run query in CLI (compiled from the latest main: f775791)

DataFusion CLI v44.0.0
> with t as (select 1000000 as a) select try_cast(a as smallint) > 1 from t;
+----------------+
| t.a > Int64(1) |
+----------------+
| true           |
+----------------+
1 row(s) fetched.
Elapsed 0.008 seconds.

> with t as (select 1000000 as a) select cast(a as smallint) > 1 from t;
+----------------+
| t.a > Int64(1) |
+----------------+
| true           |
+----------------+
1 row(s) fetched.
Elapsed 0.007 seconds.

Expected behavior

When optimizations are disabled, the above queries will produce different results, which are correct.

> set datafusion.optimizer.max_passes=0;
0 row(s) fetched.
Elapsed 0.003 seconds.

> with t as (select 1000000 as a) select try_cast(a as smallint) > 1 from t;
+----------------+
| t.a > Int64(1) |
+----------------+
| NULL           |
+----------------+
1 row(s) fetched.
Elapsed 0.006 seconds.

> with t as (select 1000000 as a) select cast(a as smallint) > 1 from t;
Arrow error: Cast error: Can't cast value 1000000 to type Int16

Additional context

I don't think this is a very urgent bug because both Spark and DuckDB have similar issues.

The text was updated successfully, but these errors were encountered:

Spaarsh · 2025-01-26T14:04:01Z

I have analyzed the code in unwrap_cast_in_comparison.rs. I think adding conditional statement in OptimizerRule implementation of the UnwrapCastInComparison should fix this.

Spaarsh · 2025-01-26T14:04:06Z

take

jonahgao added the bug Something isn't working label Jan 26, 2025

github-actions bot assigned Spaarsh Jan 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`UnwrapCastInComparison` produces incorrect results #14303

`UnwrapCastInComparison` produces incorrect results #14303

jonahgao commented Jan 26, 2025 •

edited

Loading

Spaarsh commented Jan 26, 2025

Spaarsh commented Jan 26, 2025

UnwrapCastInComparison produces incorrect results #14303

UnwrapCastInComparison produces incorrect results #14303

Comments

jonahgao commented Jan 26, 2025 • edited Loading

Describe the bug

To Reproduce

Expected behavior

Additional context

Spaarsh commented Jan 26, 2025

Spaarsh commented Jan 26, 2025

`UnwrapCastInComparison` produces incorrect results #14303

`UnwrapCastInComparison` produces incorrect results #14303

jonahgao commented Jan 26, 2025 •

edited

Loading