Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DNM] persist: communicate projection pushdown via shard_source read schema #29539

Closed
wants to merge 1 commit into from

Conversation

danhhz
Copy link
Contributor

@danhhz danhhz commented Sep 13, 2024

Motivation

Tips for reviewer

Checklist

  • This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
  • This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
  • If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
  • If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
  • If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.

@danhhz
Copy link
Contributor Author

danhhz commented Sep 26, 2024

@ParkMyCar Here's the projection pushdown branch if you're interested

@ParkMyCar
Copy link
Member

Thanks @danhhz!

ParkMyCar added a commit that referenced this pull request Jan 7, 2025
…30764)

Continuation of #29539

Implements a limited form of projection pushdown by applying the
`demands` of a `MapFilterProject` onto the `RelationDesc` of a
collection, and using the resulting `RelationDesc` in `shard_source`.
All of the columns will still be fetched from S3, but when decoding
structured data we'll drop the unneeded columns, and decoding `ProtoRow`
data we'll skip the unneeded `Datum`s.

### Motivation

Progress towards:
MaterializeInc/database-issues#8402

Fixes MaterializeInc/database-issues#8825

### Tips for reviewers

The implementation for the feature is in the first commit, and can be
reviewed on its own. The later commits add testing and metrics and
theoretically could be their own PRs, so it might be nice to review one
commit at a time.

### Checklist

- [x] This PR has adequate test coverage / QA involvement has been duly
considered. ([trigger-ci for additional test/nightly
runs](https://trigger-ci.dev.materialize.com/))
- [x] This PR has an associated up-to-date [design
doc](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/design/README.md),
is a design doc
([template](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/design/00000000_template.md)),
or is sufficiently small to not require a design.
  <!-- Reference the design in the description. -->
- [x] If this PR evolves [an existing `$T ⇔ Proto$T`
mapping](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/command-and-response-binary-encoding.md)
(possibly in a backwards-incompatible way), then it is tagged with a
`T-proto` label.
- [x] If this PR will require changes to cloud orchestration or tests,
there is a companion cloud PR to account for those changes that is
tagged with the release-blocker label
([example](MaterializeInc/cloud#5021)).
<!-- Ask in #team-cloud on Slack if you need help preparing the cloud
PR. -->
- [x] If this PR includes major [user-facing behavior
changes](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/guide-changes.md#what-changes-require-a-release-note),
I have pinged the relevant PM to schedule a changelog post.
@danhhz
Copy link
Contributor Author

danhhz commented Jan 13, 2025

Cleaning up old PRs

@danhhz danhhz closed this Jan 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants