(on cpu) Example with sycl version: ``` mpirun -n 112 ./shallow_water --sycl -f -t -n 1024 shallow_water: [...]/distributed-ranges/include/./dr/detail/sycl_utils.hpp:124: sycl::event dr::__detail::parallel_for(sycl::queue &, sycl::range<2>, Fn &&) [Fn = (lambda at [...]/distributed-ranges/include/./dr/mhp/algorithms/md_for_each.hpp:142:29) &]: Assertion `global[0] < max && global[1] < max' failed. ``` For 1024 problem size 48 cores is sufficient to reproduce. happens with non sycl build as well