-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[5pt] Make sure ps-stacks can receive recommendation from Thoth #326
Comments
/priority important-soon |
any update on this? |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with /lifecycle stale |
/remove-lifecycle stale |
In the last integration test runs for aws-prod there are errors in some of the ps-* tests: ps-cv-{pytorch,tensorflow} and ps-nlp-tensorflow due to tmieouts:
Other related integration tests succeeded. In the last run of integration tests for smaug-prod, ps-* tests failed with HTTP 400 codes (bad request), e.g.
|
/milestone OKR review Q2 2022 |
/remove-sig user-experience because there are issues resolving the stacks here |
The last integration-tests report (
All of them use These issues can be also supported with the following solving error described in thoth-station/integration-tests#266 (comment). Basically, jupyter-tensorboard expects jupyterlab to be already installed during the installation process (it registers itself). Our solver has no jupyterlab installed when it tries to install jupyter-tensorboard so it fails obtaining dependency information (it was observed for some versions). This behaviour is not very nice, but Python packaging supports it. This can support the first paragraph stated as adviser might be failing to find suitable versions when latest recommendation type is used. To introspect what is happening here, we might:
Also, we can try using user stack scoring and see how the resolver behaves with specific versions of libraries to narrow down to possible issue maker. |
Tested with
|
Tested with
|
Tested with
|
Possible fixes:
|
This means our solvers are not able to solve jupyter-tensorboard or other packages with such requirements, right? |
Generally, no - we are not able to solve libraries that have hard requirements on environment that are not met in our solvers. Ideally, jupyter-tensorboard should not depend on the environment and execute code during the installation process - at least not make it a hard requirement (if it fails, the installed package can still be present). This might get better over time as python packaging evolves (and provides static wheel metadata).
There can be found versions that were removed in the stack info provided to the user:
Thoth also suggested to use it, for example in the first successful resolution with
|
And for others, it looks like it failed as it did not find any resolution in the allocated time. |
ack, thanks for the explanation. |
/remove-label human_intervention_required |
@codificat: The label(s) In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
The integration tests in stage are suffering from cluster issues that have been going on for a while and are expected to take some more time to fix. Meanwhile, though, a current test of all the overlays using the production environment provided successful advice with the The recommendations that are failing fail with the following message:
Below is the current status with each stack. ps-nlp
ps-cv
ps-ip
|
Based on this, I believe we can We still need to ensure that integration tests, that include checks for successful advices on the predictable stacks, run successfully (e.g. thoth-station/integration-tests#324), and possibly review the justification related to the failures on some combination of stack/type. These are tracked in separate issues as appropriate. |
@codificat: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Describe the bug
As User of Thoth PS images,
I want to have continous updates on software stacks to be maintained by Thoth services.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
All ps-* stacks can be advised by Thoth (all integration tests are green for ps-stacks: thoth-station/integration-tests#204)
Screenshots
Additional context
ps-*:
The text was updated successfully, but these errors were encountered: