-
Notifications
You must be signed in to change notification settings - Fork 388
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ci system following two merged PRs get stuck #1237
Comments
Here is a test run from #1238 that provides more details about the state of things associated with new failure. Nothing seems wrong to me, but I'm not sure what causes the curl request to get stuck. https://github.com/jupyterhub/binderhub/pull/1238/checks?check_run_id=1657413955 |
I think it has happened before that the |
No because the echoed statement is executes |
Is there a potential race condition? Could the check in Line 22 in f76e37e
succeed before /health and/or the network routing is ready?
|
Hmmmm, well there are no readiness probe for the binder deployment apparently, so just attempting to start (having pulled the image etc) will make the rollout status work clear. So, yes, but only if it is slower than the hub and proxy pod, which it could be. Okay to add a readiness probe? PR in #1242. |
From some experimentation, I found enough evidence to support the idea that @manics guess was correct. Without a readinessProbe, the following only concludes the binder pod has started, not that it has become ready. Line 22 in f76e37e
Due to this, I marked #1242 to close this issue. |
I merged #1236 and #1235 which both seemed fine to me, but now our tests fail in a new way. Did the combination of these PRs introduce a new bug? I'm clueless.
Test failure: https://github.com/jupyterhub/binderhub/actions/runs/466459518
From the namespace report, it seems like the binder pod became ready and running though...
The actual test seem to be...
And the initial output of this test is...
It seems to me that the /health endpoint now fail to respond or perhaps streams a response or similar that is never completed?
The text was updated successfully, but these errors were encountered: