Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Promoting operational errors to critical severity #13930

Open
wants to merge 2 commits into
base: dev
Choose a base branch
from

Conversation

ehellbar
Copy link
Collaborator

@ehellbar ehellbar commented Feb 3, 2025

This PR promotes Error messages which require some action from the shifter to the InfoLogger Ops level.

As a first step, "(core dumped)" messages are set to severity=Error and level=1 in the EPNstderrMonitor task. I will also check for other stderr messages which we might want to promote.

Now, in the second step, we should promote known DPL Erros (https://alice-qc-shifter.docs.cern.ch/pdp/infologgerMessages/ and https://alice-operations.docs.cern.ch/ECS/ECS_operational/#known-messages-in-the-guis) for which we ask the shifter to act (create BKE or call an expert) to level=1.

Copy link
Contributor

github-actions bot commented Feb 3, 2025

REQUEST FOR PRODUCTION RELEASES:
To request your PR to be included in production software, please add the corresponding labels called "async-" to your PR. Add the labels directly (if you have the permissions) or add a comment of the form (note that labels are separated by a ",")

+async-label <label1>, <label2>, !<label3> ...

This will add <label1> and <label2> and removes <label3>.

The following labels are available
async-2023-pbpb-apass4
async-2023-pp-apass4
async-2024-pp-apass1
async-2022-pp-apass7
async-2024-pp-cpass0
async-2024-PbPb-apass1
async-2024-ppRef-apass1

@davidrohr
Copy link
Collaborator

davidrohr commented Feb 3, 2025 via email

@ehellbar
Copy link
Collaborator Author

ehellbar commented Feb 3, 2025

They will be anyway filtered before by mFilters.

@ehellbar ehellbar changed the title Promoting critical Error logs to InfoLogger Ops level Promoting operational errors to critical severity Feb 20, 2025
@ehellbar ehellbar marked this pull request as ready for review March 7, 2025 13:25
@ehellbar ehellbar requested review from a team, wiechula and shahor02 as code owners March 7, 2025 13:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants