-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Migration stuck in CONNECTING
state
#4797
Comments
da-staging datastore artifacts dst_4or9o54g2 --download ./logs migration:migrations:
|
Seeing another issue where the target says the migration has state |
I20250319 09:45:21.473965 1720 scheduler.cc:480] ------------ Fiber outgoing_migration (suspended:1056085ms) ------------ |
It looks like we can't read from the socket at all |
migrations:
|
I've tried to reproduce it locally in the following ways:
|
Since updating our system tests to
v1.28
we're seeing some migrations are getting stuck in aCONNECTING
state for 15m+, even though both the source and target nodes are healthy. ~50% of our test runs are hitting this issueWe see
Migration initiating
andConnecting to target node
in a busy loop for a few seconds on the source node (logged ~30k times in 7 seconds), then no further output. ThoughSLOT-MIGRATION-STATUS
returns the state isCONNECTING
There could be a regression on the control plane, though I don't see any related changes that could have caused this. As far as I can see the cluster configuration looks valid
Will keep looking and trying to reproduce, so will add more info...
The text was updated successfully, but these errors were encountered: