-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] Revert PR #17160 #17224
base: master
Are you sure you want to change the base?
[WIP] Revert PR #17160 #17224
Conversation
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
cc: @hakman |
/test presubmit-kops-aws-small-scale-amazonvpc-using-cl2 |
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
Looking at the latest test failure on Jan 20 - https://prow.k8s.io/view/gs/kubernetes-ci-logs/logs/ci-kubernetes-e2e-kops-aws-scale-amazonvpc-using-cl2/1879785938800873472
It is failing for two reasons ^^^:
As for why APIServer availability SLOs not meeting, relevant error logs from CL2
So, need to understand what's going on APIServer ? Was it being overloaded w.r.t resources like CPU/Mem/Network ? Lets check apiserver health logs:Seems like Let's Inspect APIServer logs shows that all 3 instances failed for same reason i.e
Correlating apiserver healthz failures to following:
^^^ It appears that Also around the same min/sec Next steps to understand
evaluate ^^^ if its due to high CPU/Mem/Network utilization or something else ? |
This should likely help with pre-submit job failure for aws-scale tests - kubernetes/test-infra#34201 |
Given this is merged kubernetes/test-infra#34203 |
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
1 similar comment
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
@hakuna-matatah: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
Testing etcd revert of Revert #17160 to see if scale tests succeeds - https://testgrid.k8s.io/sig-scalability-aws#ec2-master-scale-performance
As tests started failing since the etcd version bump - 8063d83...581f363
So to rule this out, i want to test this out on a presubmit test
Related to kubernetes/kubernetes#129593