Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GPU_X] RelVals 141.044407, 141.044483 failing with NotFound #47152

Open
iarspider opened this issue Jan 21, 2025 · 6 comments
Open

[GPU_X] RelVals 141.044407, 141.044483 failing with NotFound #47152

iarspider opened this issue Jan 21, 2025 · 6 comments

Comments

@iarspider
Copy link
Contributor

In CMSSW_15_0_GPU_X_2025-01-20-2300, RelVals 141.044407, 141.044483 failed with NotFound exception:

----- Begin Fatal Exception 21-Jan-2025 03:36:16 CET-----------------------
An exception of category 'NotFound' occurred while
   [0] Processing  Event run: 369978 lumi: 219 event: 198765919 stream: 0
   [1] Running path 'dqmoffline_step'
   [2] Prefetching for module SiPixelCompareVertices/'siPixelCompareVertices'
   [3] Prefetching for module alpaka_serial_sync::PixelVertexProducerAlpakaPhase1/'pixelVerticesAlpakaSerial'
   [4] Prefetching for module alpaka_serial_sync::CAHitNtupletAlpakaPhase1/'pixelTracksAlpakaSerial'
   [5] Prefetching for module alpaka_serial_sync::SiPixelRecHitAlpakaPhase1/'siPixelRecHitsPreSplittingAlpakaSerial'
   [6] Calling method for module alpaka_serial_sync::SiPixelRawToClusterPhase1/'siPixelClustersPreSplittingAlpakaSerial'
Exception Message:
Service Request unable to find requested service with compiler type name ' alpaka_serial_sync::AlpakaService'.
----- End Fatal Exception -------------------------------------------------
@iarspider
Copy link
Contributor Author

assign heterogeneous

@cmsbuild
Copy link
Contributor

New categories assigned: heterogeneous

@fwyzard,@makortel you have been requested to review this Pull request/Issue and eventually sign? Thanks

@cmsbuild
Copy link
Contributor

cms-bot internal usage

@cmsbuild
Copy link
Contributor

A new Issue was created by @iarspider.

@Dr15Jones, @antoniovilela, @makortel, @mandrenguyen, @rappoccio, @sextonkennedy, @smuzaffar can you please review it and eventually sign/assign? Thanks.

cms-bot commands are listed here

@iarspider
Copy link
Contributor Author

The extra space in compiler type name seems suspicious

@makortel
Copy link
Contributor

On a quick look the error looks like a manifestation of #43780. I see the step3 of 141.044407 has --accelerators gpu*, and without hackery that should presently lead to alpaka_serial_sync modules to fail.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants