Skip to content

Conversation

@ldoktor
Copy link
Contributor

@ldoktor ldoktor commented Nov 7, 2025

when testing manually we are hitting the 4h timeout. Let's increase the timeout to allow longer execution and move the cucushift-installer-wait to test phase to avoid interrupting the cleanup.

As suggested by on slack this probably needs to be merged before the timeout is used.

@openshift-ci openshift-ci bot requested review from pmores and tbuskey November 7, 2025 16:10
- chain: sandboxed-containers-operator-pre
post:
- ref: sandboxed-containers-operator-gather-must-gather
- ref: cucushift-installer-wait
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @ldoktor , I'm not sure about that change.... it's not wrong with current setup, but I'd like to discuss this rule of only fail the job on infra errors, because it gives false-negatives on automated CI.

@tbuskey FYI

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's try it

Copy link
Contributor

@tbuskey tbuskey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can the changes be added to sandboxed-containers-operator-create-prowjob-commands.sh

We should move to generating openshift-sandboxed-containers-operator-devel__downstream-candidate.yaml and openshift-sandboxed-containers-operator-devel__downstream-release.yaml with the script. Also 4.17-4.20 versions

- chain: sandboxed-containers-operator-pre
post:
- ref: sandboxed-containers-operator-gather-must-gather
- ref: cucushift-installer-wait
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's try it

we're hitting 4h timeout when running manual suite via
cucushift-installer-wait. Increase the default timeout to 24h

Signed-off-by: Lukáš Doktor <[email protected]>
in case cucushift-installer-wait fails the consequent cleanup steps are
not executed and the cloud cleanup is not performed, leaving resources
behind. Move it to test-phase ensures the cleanup should be executed.

There is one drawback, failed "e2e" execution results in skipped
cucushift-installer-wait. This shouldn't be often a problem as we only
fail on infra error, not on test fail.

Signed-off-by: Lukáš Doktor <[email protected]>
we only want to allow 12h reservation.

Signed-off-by: Lukáš Doktor <[email protected]>
@ldoktor
Copy link
Contributor Author

ldoktor commented Nov 12, 2025

Changes:

  • rebased
  • added timeouts to the sandboxed-containers-operator-create-prowjob-commands.sh

@openshift-ci-robot
Copy link
Contributor

[REHEARSALNOTIFIER]
@ldoktor: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name Repo Type Reason
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-release-azure-ipi-kata N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-candidate-aws-ipi-coco N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-release-aws-ipi-coco N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-release-aws-ipi-peerpods N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-candidate-azure-ipi-kata N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-candidate-aws-ipi-peerpods N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-candidate-azure-ipi-coco N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-release-azure-ipi-coco N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-release-azure-ipi-peerpods N/A periodic Ci-operator config changed
periodic-ci-openshift-sandboxed-containers-operator-devel-downstream-candidate-azure-ipi-peerpods N/A periodic Ci-operator config changed
Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

@wainersm
Copy link
Contributor

@ldoktor it looks good to me. Do you want to have @tbuskey reviewing it too? Otherwise I can place the label.

@tbuskey
Copy link
Contributor

tbuskey commented Nov 12, 2025

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Nov 12, 2025
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 12, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ldoktor, tbuskey

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 12, 2025
@ldoktor
Copy link
Contributor Author

ldoktor commented Nov 13, 2025

Since there is a consensus that it should be merged, let me add the rehearsal ack and update the dependant PR...

/pj-rehearse ack

@openshift-ci-robot
Copy link
Contributor

@ldoktor: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@openshift-ci-robot openshift-ci-robot added the rehearsals-ack Signifies that rehearsal jobs have been acknowledged label Nov 13, 2025
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 13, 2025

@ldoktor: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@openshift-merge-bot openshift-merge-bot bot merged commit 93eda0c into openshift:master Nov 13, 2025
15 checks passed
namansharma18899 pushed a commit to namansharma18899/release that referenced this pull request Nov 24, 2025
…penshift#71161)

* ci-operator.config.osc: Increase job timeout

we're hitting 4h timeout when running manual suite via
cucushift-installer-wait. Increase the default timeout to 24h

Signed-off-by: Lukáš Doktor <[email protected]>

* ci-operator.step: Move cucushift-installer-wait to test phase

in case cucushift-installer-wait fails the consequent cleanup steps are
not executed and the cloud cleanup is not performed, leaving resources
behind. Move it to test-phase ensures the cleanup should be executed.

There is one drawback, failed "e2e" execution results in skipped
cucushift-installer-wait. This shouldn't be often a problem as we only
fail on infra error, not on test fail.

Signed-off-by: Lukáš Doktor <[email protected]>

* ci-operator.step: Limit cucushift-installer-wait to 12h

we only want to allow 12h reservation.

Signed-off-by: Lukáš Doktor <[email protected]>

---------

Signed-off-by: Lukáš Doktor <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. rehearsals-ack Signifies that rehearsal jobs have been acknowledged

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants