Skip to content

Conversation

ianton-ru
Copy link

Changelog category (leave one):

  • New Feature

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

SYSTEM PRESHUTDOWN command for graceful shutdown swarm node

Documentation entry for user-facing changes

Solved #759
New command SYSTEM PRESHUTDOWN.
Scenario:
We want to scale down swarm cluster. On node which we want to shutdown we call SYSTEM PRESHUTDOWN, after that node stops to accept new distributed commands. It can still processed objects which is started processed before SYSTEM PRESHUTDOWN. When all that objects successfully processed, we can kill that node without any errors or lost data in responses on initiator.

After SYSTEM PRESHUTDOWN on swarm node:

  • unregister node from autodiscovery clusters if exists
  • stop getting new tasks for objectStorageCluster-family functions (s3Cluster/icebergCluster/etc.)

On initiator node:

  • for all distributed requests with setting skip_unavailable_shards=true unexpected closing of socket is legal if no data packets were accepted before. This allow to shutdown non-autodiscovery node too.

Exclude tests:

  • Fast test
  • Integration Tests
  • Stateless tests
  • Stateful tests
  • Performance tests
  • All with ASAN
  • All with TSAN
  • All with MSAN
  • All with UBSAN
  • All with Coverage
  • All with Aarch64
  • All Regression
  • Disable CI Cache

Copy link

Workflow [PR], commit [e74130f]

@Enmk
Copy link
Member

Enmk commented Sep 12, 2025

test_database_hms/test.py seems to be failing at random

@Enmk Enmk merged commit df3fd1a into antalya-25.6.5 Sep 12, 2025
123 of 136 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants