Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

meta: increase Flink's streaming backend coverage #8250

Closed
1 task done
chloeh13q opened this issue Feb 6, 2024 · 8 comments
Closed
1 task done

meta: increase Flink's streaming backend coverage #8250

chloeh13q opened this issue Feb 6, 2024 · 8 comments
Assignees
Labels
feature Features or general enhancements flink Issues or PRs related to Flink
Milestone

Comments

@chloeh13q
Copy link
Contributor

chloeh13q commented Feb 6, 2024

Is your feature request related to a problem?

While we have implemented support for the basic operators in the Flink backend, there are still some operators that are commonly seen in streaming workloads that we have not yet implemented support for. Some of these are pending the sqlglot refactoring work.

Describe the solution you'd like

  1. [DONE] [P1] Alternative syntax for top k (follow-up to feat(flink): implement windowed computations #7407)
  2. [P1] Deduplication using distinct() (pending feat: move the logic for deduplication into individual backends #7556)
  3. [P0] Array expansion (pending sqlglot refactoring)
  4. [P0] ops.ArrayCollect (pending UDF support)
  5. [P2] Window joins with more complex syntax (ANTI/SEMI) (follow-up to test(flink): add unit test for window join #7966)
  6. [P0] Temporal join (pending sqlglot refactoring) (follow-up to feat(flink): implement support for temporal join #7921)
  7. [P2] Time travel query (pending sqlglot refactoring) (feat(api): Support time travel query #8203)
  8. [P2] Pattern recognition (feat(api): Support pattern recognition #8252)
  9. [DONE] [P0] MAP support (covered by ibis/backends/tests/test_map.py)
  10. [P2] feat(flink): Support temporal join on Iceberg table #8254 (testing)

What version of ibis are you running?

8.0

What backend(s) are you using, if any?

Flink

Code of Conduct

  • I agree to follow this project's Code of Conduct
@chloeh13q chloeh13q added the feature Features or general enhancements label Feb 6, 2024
@deepyaman
Copy link
Contributor

Propose to include 9. MAP support in this issue; covered by ibis/backends/tests/test_map.py and useful to get UDFs with map working in #8142.

@jcrist jcrist added the flink Issues or PRs related to Flink label Feb 15, 2024
@jcrist jcrist changed the title feat(flink): increase Flink's streaming backend coverage meta: increase Flink's streaming backend coverage Feb 15, 2024
@jcrist jcrist added the roadmap label Feb 15, 2024
@lostmygithubaccount lostmygithubaccount added this to the Q1 2024 milestone Feb 15, 2024
@mfatihaktas
Copy link
Contributor

Temporal join issue: #8247

@chloeh13q
Copy link
Contributor Author

chloeh13q commented Feb 29, 2024

Weekly update [2/29/24]

[3-Array expansion]: issue opened (#8457), feature implementation WIP.

[6-Temporal join]: issue opened (#8247), draft PR #8412 in review.

[7-Time travel query]: issue opened (#8203), exploratory work on implementation (currently blocked by catalog not supporting time travel: https://issues.apache.org/jira/browse/FLINK-34553).

[9-MAP support]: Done (#8425).

[10-Temporal join on Iceberg table]: depends on #7712, draft PR #8343 to address #7712. This work is paused for now because of significant blockers in pyiceberg. See the discussion thread in #8343 for more context.No update on remaining items.

@chloeh13q
Copy link
Contributor Author

chloeh13q commented Mar 6, 2024

Weekly update [3/6/24]

P0

[3 - Array expansion] - PR in review #8511
[4 - ops.ArrayCollect] - issue opened #8555, pausing implementation until Flink 1.20 release
[6 - Temporal join] - issue opened #8247, implementation in progress #8412, which is currently blocked by #8537
[9 - MAP support] - DONE

P1

[1 - Alternative syntax for topk] - planned
[2 - Deduplication using distinct()] - waiting on #7556, which is currently blocked by #8509

P2

[5 - Complex window join] - not started
[7 - Time travel] - issue opened #8203, draft PR in review #8517
[8 - Pattern recognition] - not started
[10 - Temporal join on Iceberg table] - blocked and paused

Ad hoc

We have raised a few additional issues regarding

@chloeh13q
Copy link
Contributor Author

Weekly update [3/14/24]

P0

[3 - Array expansion] - PR #8511, blocked by #8516 (WIP)
[4 - ops.ArrayCollect] - paused
[6 - Temporal join] - issue opened #8247, implementation in progress #8412, which is currently blocked by #8537 (WIP)
[9 - MAP support] - DONE

P1

[1 - Alternative syntax for topk] - under investigation
[2 - Deduplication using distinct()] - blocked by #8509 (WIP)

P2

[5 - Complex window join] - not started
[7 - Time travel] - issue opened #8203, draft PR in review #8517
[8 - Pattern recognition] - WIP
[10 - Temporal join on Iceberg table] - blocked and paused

@chloeh13q
Copy link
Contributor Author

Weekly update [3/20/24]

P0

[3 - Array expansion] - PR #8511, blocked by #8516 (WIP)
[4 - ops.ArrayCollect] - paused
[6 - Temporal join] - issue opened #8247, implementation in progress #8412, which is currently blocked by #8537 (WIP)
[9 - MAP support] - DONE

P1

[1 - Alternative syntax for topk] - under investigation
[2 - Deduplication using distinct()] - blocked by #8509 (WIP)

P2

[5 - Complex window join] - work planned under issue #8710
[7 - Time travel] - issue opened #8203, draft PR in review #8517
[8 - Pattern recognition] - WIP #8692
[10 - Temporal join on Iceberg table] - blocked and paused

@chloeh13q
Copy link
Contributor Author

Weekly update [4/11/24]

P0

[3 - Array expansion] - PR in review #8511
[4 - ops.ArrayCollect] - paused
[6 - Temporal join] - no update
[9 - MAP support] - DONE

P1

[1 - Alternative syntax for topk] - DONE
[2 - Deduplication using distinct()] - no update

P2

[5 - Complex window join] - PR in review #8745
[7 - Time travel] - PR in review #8517
[8 - Pattern recognition] - WIP #8692
[10 - Temporal join on Iceberg table] - no update

@lostmygithubaccount
Copy link
Member

closing out from last quarter

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature Features or general enhancements flink Issues or PRs related to Flink
Projects
Archived in project
Development

No branches or pull requests

5 participants