Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore(eks): document failure conditions for albController #28419

Merged
merged 4 commits into from
Dec 19, 2023

Conversation

pahud
Copy link
Contributor

@pahud pahud commented Dec 19, 2023

albController would fail the deployment if there's no defaultCapacity or nodegroup. This PR improves the doc about it.

Documents issue from #22005 but further effort is needed to create a synth-time error.


By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license

@github-actions github-actions bot added bug This issue is a bug. p2 labels Dec 19, 2023
@aws-cdk-automation aws-cdk-automation requested a review from a team December 19, 2023 00:21
@mergify mergify bot added the contribution/core This is a PR that came from AWS. label Dec 19, 2023
@kaizencc kaizencc changed the title chore: improve the aws-eks doc for albController chore(eks): document failure conditions for albController Dec 19, 2023
Copy link
Contributor

@kaizencc kaizencc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. Does AWS have any documentation of this requirement that we can link to? It seems quite weird.

  2. Can we account for this in code and throw a synth-time error?

@aws-cdk-automation aws-cdk-automation added the pr/needs-maintainer-review This PR needs a review from a Core Team Member label Dec 19, 2023
@pahud
Copy link
Contributor Author

pahud commented Dec 19, 2023

  1. Does AWS have any documentation of this requirement that we can link to? It seems quite weird.

We checked the doc, unfortunately it doesn't mention any about that and according to our test, the node availability is required for a node-based cluster.

  1. Can we account for this in code and throw a synth-time error?

Yes we probably can do that but this might take a while for further investigation and experiment. Before that, it would be helpful to add a quick note in the README like this to avoid potential errors.

Copy link
Contributor

@kaizencc kaizencc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@pahud ok, I've approved this as it moves us along. But I am stopping it from closing the linked issue as I think the synth-time error is really what we need. If you think the issue can be closed by this doc update, then go ahead and override my thoughts here, I'm not too integrated into this issue.

@aws-cdk-automation aws-cdk-automation removed the pr/needs-maintainer-review This PR needs a review from a Core Team Member label Dec 19, 2023
Copy link
Contributor

mergify bot commented Dec 19, 2023

Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork).

@aws-cdk-automation
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: AutoBuildv2Project1C6BFA3F-wQm2hXv2jqQv
  • Commit ID: cc06f50
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@mergify mergify bot merged commit 7c2b110 into aws:main Dec 19, 2023
9 checks passed
Copy link
Contributor

mergify bot commented Dec 19, 2023

Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork).

@pahud
Copy link
Contributor Author

pahud commented Dec 19, 2023

@kaizencc

Agree. But there is a case if the cluster and nodegroup are in different stacks, the cluster construct would not be able to check the nodegroup availability in another stack.

Another solution off the top of my head is to expose the wait property to the service which default is true that blocks the deployment. Making it optionally false should not block the stack deployment. Under the hood the lambda custom resource should immediate return after executing the helm command.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug This issue is a bug. contribution/core This is a PR that came from AWS. p2
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants