Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[JENKINS-55927] Hook event should not trigger Branch Indexing #908

Open
wants to merge 13 commits into
base: master
Choose a base branch
from

Conversation

Dohbedoh
Copy link
Contributor

@Dohbedoh Dohbedoh commented Nov 8, 2024

JENKINS-55927

Noticed this issue while troubleshooting an environment where Bitbucket event were triggering branch indexing. Haven't yet identified exactly what kind of push events has empty changes. Though per my understanding of the design of the Branch API / SCM API and Pipeline Multibranch, SCM events should not trigger branch indexing ? And this is rather unexpected. Unless I am missing something ?

cc @jenkinsci/bitbucket-branch-source-plugin-developers

Your checklist for this pull request

  • Make sure you are requesting to pull a topic/feature/bugfix branch (right side) and not your master branch!
  • Ensure that the pull request title represents the desired changelog entry
  • Please describe what you did
  • Link to relevant issues in GitHub or in Jenkins JIRA
  • Link to relevant pull requests, esp. upstream and downstream changes
  • Did you provide a test-case? That demonstrates feature works or fixes the issue.

@nfalco79
Copy link
Member

nfalco79 commented Nov 8, 2024

I think, but I need to check, Approved, Approved Removed, Declined event could have not changes. For example declined will trigger the job deletion in multibranch pipeline

@nfalco79 nfalco79 self-requested a review November 8, 2024 11:11
@Dohbedoh
Copy link
Contributor Author

Dohbedoh commented Nov 8, 2024

@nfalco79 Actually according to the code and the HookProcessor that do this, it should really be a push event (could also be a mirror synchronized) but not pull requests... Unless declining a pull requests somehow generate a push event on the base branch..

For example declined will trigger the job deletion in multibranch pipeline

Right, so per my understanding of the SCM API, a decline should just be a REMOVED event and mark the branch job as dead. Branch Indexing will be responsible later on for the cleanup of orphaned items.

Here the behavior of triggering a full scan on a SCMHead event (single branch) can be detrimental. A full scan can be long and expensive (resources and BranchIndexing thread pool on the server but also API requests against Bitbucket).

That is why I think this might be a mistake.

@@ -71,13 +70,6 @@ public void process(HookEventType hookEvent, String payload, BitbucketType insta
push = BitbucketCloudWebhookPayload.pushEventFromPayload(payload);
}
if (push != null) {
String owner = push.getRepository().getOwnerName();
Copy link
Member

@nfalco79 nfalco79 Nov 8, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As described in JIRA issue I think these changes will cause to the user that have setup "Discover pull requests" strategies different than "The current pull request revision" they never get a build for PRs which target branch is the same branch subject of the merged.
If I understand correct the configuration in the JIRA issue the reported would trigger manual reindex to update all PRs. Than this should be archived using a build strategies to ignore some specific index events (maybe already implemented in other plugins)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The JIRA mentions the Received hook from Bitbucket. Processing push event on log line. per my reading, this is really coming from the event subscriber / hook processors (entry point is BitbucketSCMSourcePushHookReceiver). It would happen regardless of the multibranch configuration as far as I can tell.

@nfalco79
Copy link
Member

nfalco79 commented Nov 8, 2024

Here the behavior of triggering a full scan on a SCMHead event (single branch) can be detrimental. A full scan can be long and expensive (resources and BranchIndexing thread pool on the server but also API requests against Bitbucket).

I agree but if you do not perform a BranchIndexing you do not know what are branches involved in changes.

@Dohbedoh
Copy link
Contributor Author

Looking at the history of that change, it was here from the start, before we implemented the SCM API Event mechanism and preserved when we added it https://github.com/jenkinsci/bitbucket-branch-source-plugin/pull/37/files#diff-a5efe0e74b0f75735db75b177d0b8da3426c13be4a855727e8eeb8193f74ff33. Though we were relying on a custom made plugin at that time that does not exist anymore: https://github.com/topicusfinan/bitbucket-webhooks-plugin/

I think we really need to figure out what scenario causes an empty changes. I am quite convinced triggering indexing instead of using a particular Event is wrong.. If we wanted to have indexing triggered, I would think we'd create a SCMSourceEvent instead. Maybe someone closer to the design of SCM API / Branch API could confirm cc @rsandell @amuniz

Per the JIRA issue, this could be a remote merge, that is we have an Open PR, but instead of merging it from Bitbucket, we merge it by hand. Hence we end up with a PR without changes. If that is the case, this really shouldn't trigger an indexing. I would be surprised if someone was relying on such a behavior.

@nfalco79
Copy link
Member

nfalco79 commented Nov 11, 2024

I would take time to test some supported webhook event (I'm collecting them) like:
new tag, that must automatically trigger the tag discover branch
PR merged trigger removal
New PR
Approved
Disapproved

The behaviour to trigger a new build for all the PRs with Merge strategy (build PR commit with merge into master) is by what who setup this expect to ensure the PR is good for merge and avoid trigger manul build. Who do not want -> build strategy to inhibit builds (but not indexing)

@amuniz
Copy link
Member

amuniz commented Nov 12, 2024

It's a really long time I worked on this plugin, so I don't really know. But looking at the change introducing the empty check I think they were interpreted as an empty changeset, so it corresponds to a PR creation/deletion which actually requires a re-index to create/remove the job in Jenkins. Whether that's still needed or not, I don't know, but I think if all the known cases are manually tested and they work, then we are ok.

@Dohbedoh
Copy link
Contributor Author

Dohbedoh commented Nov 13, 2024

But per the design of the API (and like all implementation do) those events do not run branch indexing... A CREATE SCMEvent would create the PR job. A DELETED SCMEvent would mark it as stale...
And those event with empty changes are actually push events, they originate from activity on branches.

Just need to figure out what a push event with empty changes originate from. I think I have a reproducer with Bitbucket Server with Native webhook and also with the add on. Will update this in the JIRA shortly.

@Dohbedoh
Copy link
Contributor Author

Dohbedoh commented Nov 13, 2024

Found 2 scenario where you get a push event with empty changes, in Bitbucket Server with the Native webhook as well as with the Bitbucket add-on. I found 2 scenarios where the empty changes may happen:

Rebase a target branch to a source branch while a PR is open

git checkout main
git checkout -B source-1
echo "test" >> source-1.md
git add . && git commit -m "source-1: $(date)" && git push --set-upstream origin source-1
git push --set-upstream origin source-1
# Open a PR for from the source branch `source-1` to the target branch `main`
git checkout main
git rebase -i source-1
git push -f

This will generate 2 events. The first one with empty changes and the next one with the correct changes made to the target branch. (The PR is automatically merged because there is no diff and that's a default behavior of Bitbucket server).. So the first event is useless. It does not give any clue about what just happened.

Auto-Sync of Fork when a branch cannot be synchronized

When the Multibranch Pipeline is a fork and Automatic Fork Syncing is enabled. If a branch on a fork cannot be synchronized, the attempt from Bitbucket to synchronize it generates a push event with empty changes. This seems also useless. The event does not contain information that allow us to know what just happened, and the refs is actually unchanged...


Both the add-on and the native webhook results behave the same. Per my understanding the add-on relies on the native mechanism but add some extra configuration..

So far, I don't see a valid scenario where a push event with empty changes should be considered. Or why it would trigger branch indexing. If we are skeptical about whether anybody expect this behavior, I propose to add a system property to allow use to re-enable this.

@Dohbedoh Dohbedoh requested a review from nfalco79 November 13, 2024 12:10
@nfalco79
Copy link
Member

The behaviour to trigger a new build for all the PRs with Merge strategy (build PR commit with merge into master) is by what who setup this expect to ensure the PR is good for merge and avoid trigger manual build.

Issue #316

@Dohbedoh
Copy link
Contributor Author

Actually this behavior is not what happens with Bitbucket Branch Source at the moment. Those events that trigger branch indexing are happening on very specific scenario where you would not expect it (see my previous comment).
What people are asking in #316 are IMO a feature for SCM API / Branch API. By default the API does not go and observe source branches when a target one is updated.

@@ -86,6 +85,7 @@ public void process(HookEventType type, String payload, BitbucketType instanceTy
/**
* To be called by implementations once the owner and the repository have been extracted from the payload.
*
* @deprecated SCM Event should not trigger Branch Indexing
Copy link
Member

@nfalco79 nfalco79 Jan 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it's not possible, this will have a worster impact on issues like #539

Copy link
Contributor Author

@Dohbedoh Dohbedoh Jan 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The impact really depend on a the environment. User that have large repositories with hundreds of branches / PRs cannot afford frequent branch indexing.
In case of #539, branch indexing is not a good solution IMO. A pull request decline event should not trigger branch indexing, but rather update the source and target based on discovery criteria through SCMHeadEvents.
Maybe the description could be improved.

Copy link
Member

@nfalco79 nfalco79 Jan 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

but rather update the source and target based on discovery criteria through SCMHeadEvents.

This is hard because you should apply any registered trait (also external to this plugin) and simulate if any existing job is related to the target branch and in case delete or create.
In master I start to add unit test collecting some bitbucket cloud events and I'm providing test on how I expect they should do

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants