Improve logging of rebalancer and recovery #586

mrForza · 2025-08-12T17:01:12Z

Before this patch "Finish bucket recovery step ..." logs were printed at
the end of recovery even if no buckets were successfully recovered, it led
to unnecessary log entries. This patch fixes the issue by adding an
additional check for the number of recovered buckets.

Closes #212

NO_DOC=bugfix

Serpentian

These are the comments for the first two commits, more comments are coming later) Thank you for working on this, good logging is crucial and allows us to investigate, what happened during incidents

test/storage-luatest/storage_1_1_1_test.lua

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

Serpentian

Oh, shit. I forgot to send the last message of review, I'm very sorry

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

vshard/storage/init.lua

Serpentian

These are the final comments I have, the patch is pretty clean now)

vshard/storage/init.lua

test/storage-luatest/storage_1_1_1_test.lua

Before this patch "Finish bucket recovery step ..." logs were printed at the end of recovery even if no buckets were successfully recovered. It led to unnecessary log records. This patch fixes the issue by adding an additional check for the number of recovered buckets. Part of tarantool#212 NO_DOC=bugfix

This patch introduces logging of buckets' ids which were recovered during recovery stage of storage. Part of tarantool#212 NO_DOC=bugfix

Serpentian

Final nits

Serpentian · 2025-10-08T08:29:14Z

test/storage-luatest/storage_1_1_1_test.lua

+        end)
+        t.assert(g.replica_1_a:grep_log(
+            'Apply rebalancer routes with 1 workers'))
+        end)


Nit: the indent is not correct here

fixed. I also changed indents on 211-214 lines.

Serpentian · 2025-10-08T08:32:19Z

vshard/storage/init.lua

        end
-        log.info('Rebalance routes are sent. Schedule next wakeup after '..
-                 '%f seconds', consts.REBALANCER_WORK_INTERVAL)
+        log.info('Next rebalancer routes were sent: %s. Schedule next ' ..


Nit: sounds incorrect gramatically) Let's better say The following rebalancer routes were sent, or you can just leave as it was in order not to change the existing tests)

Serpentian · 2025-10-08T08:40:59Z

test/storage-luatest/storage_1_1_1_test.lua

+                                             g.replica_2_a:replicaset_uuid())
+    t.assert(g.replica_1_a:grep_log(rebalancer_routes_msg))
+    start_bucket_move(g.replica_1_a, g.replica_2_a, moved_bucket_from_2)
+    start_bucket_move(g.replica_1_a, g.replica_3_a, moved_bucket_from_3)


You're moving the buckets with rebalancer, why do you need to manually move them then?

This patch adds rebalancer routes' logging. The log file now includes information about the source storage, the number of buckets, and the destination storage where the buckets will be moved. Since the rebalancer service has changed logging of routes that were sent, we change the `rebalancer/rebalancer.test.lua` and `rebalancer/stress_add_remove_several_rs.test.lua` tests. Part of tarantool#212 NO_DOC=bugfix

Before this patch the function `rebalancer_download_states` didn't return information about replicaset from which the states could not be downloaded. As a result, the log "Some buckets are not active ..." lacks of valuable information about unhealthy replicaset. Now, we return `(replicaset.id, nil)` instead of `nil` in case when rebalancer can't download state from this replicaset. Also we add replicaset.id in "Some buckets are not active ..." log. Also we change `rebalancer/rebalancer.test.lua` test which expected the old "Some buckets are not active" log without replicaset.id. Closes tarantool#212 NO_DOC=bugfix

mrForza force-pushed the mrforza/gh-212-improvement-of-rebalancer-logging branch from 6b1057d to 64cc837 Compare August 15, 2025 09:27

mrForza requested a review from Serpentian August 15, 2025 09:42

mrForza assigned Serpentian Aug 15, 2025

Serpentian reviewed Aug 20, 2025

View reviewed changes

Serpentian assigned mrForza and unassigned Serpentian Aug 20, 2025

mrForza force-pushed the mrforza/gh-212-improvement-of-rebalancer-logging branch 3 times, most recently from 5a8b3f8 to f5c25f7 Compare August 22, 2025 15:52

mrForza assigned Serpentian and unassigned mrForza Aug 23, 2025

mrForza requested a review from Serpentian August 23, 2025 13:17

Serpentian reviewed Aug 25, 2025

View reviewed changes

vshard/storage/init.lua Show resolved Hide resolved

Serpentian reviewed Aug 25, 2025

View reviewed changes

Serpentian assigned mrForza and unassigned Serpentian Aug 25, 2025

mrForza force-pushed the mrforza/gh-212-improvement-of-rebalancer-logging branch 2 times, most recently from 04c506f to ccff54f Compare September 10, 2025 07:47

mrForza assigned Serpentian and unassigned mrForza Sep 10, 2025

mrForza requested a review from Serpentian September 10, 2025 08:07

Serpentian requested changes Sep 15, 2025

View reviewed changes

Serpentian assigned mrForza and unassigned Serpentian Sep 15, 2025

mrForza force-pushed the mrforza/gh-212-improvement-of-rebalancer-logging branch 3 times, most recently from 46add65 to a1c095b Compare September 17, 2025 13:22

mrForza assigned Serpentian and unassigned mrForza Sep 17, 2025

mrForza requested a review from Serpentian September 17, 2025 13:23

mrForza force-pushed the mrforza/gh-212-improvement-of-rebalancer-logging branch from a1c095b to 1da8c2c Compare September 17, 2025 13:48

Serpentian reviewed Sep 19, 2025

View reviewed changes

vshard/storage/init.lua Show resolved Hide resolved

test/storage-luatest/storage_1_1_1_test.lua Show resolved Hide resolved

vshard/storage/init.lua Outdated Show resolved Hide resolved

vshard/storage/init.lua Show resolved Hide resolved

Serpentian assigned mrForza and unassigned Serpentian Sep 19, 2025

mrForza force-pushed the mrforza/gh-212-improvement-of-rebalancer-logging branch from 1da8c2c to f07abe5 Compare September 19, 2025 16:18

mrForza assigned Serpentian and unassigned mrForza Sep 19, 2025

mrForza requested a review from Serpentian September 19, 2025 18:23

Serpentian requested changes Sep 30, 2025

View reviewed changes

vshard/storage/init.lua Outdated Show resolved Hide resolved

vshard/storage/init.lua Outdated Show resolved Hide resolved

test/storage-luatest/storage_1_1_1_test.lua Outdated Show resolved Hide resolved

Serpentian assigned mrForza and unassigned Serpentian Sep 30, 2025

mrForza added 2 commits October 2, 2025 13:59

recovery: add logging of recovered buckets

a7603bc

This patch introduces logging of buckets' ids which were recovered during recovery stage of storage. Part of tarantool#212 NO_DOC=bugfix

mrForza force-pushed the mrforza/gh-212-improvement-of-rebalancer-logging branch 2 times, most recently from 871197d to 489b425 Compare October 3, 2025 08:18

mrForza requested a review from Serpentian October 3, 2025 09:46

mrForza assigned Serpentian and unassigned mrForza Oct 3, 2025

Serpentian reviewed Oct 8, 2025

View reviewed changes

Serpentian assigned mrForza and unassigned Serpentian Oct 8, 2025

mrForza added 2 commits October 9, 2025 12:10

mrForza force-pushed the mrforza/gh-212-improvement-of-rebalancer-logging branch from 489b425 to fce8f28 Compare October 9, 2025 09:21

mrForza removed their assignment Oct 10, 2025

mrForza requested a review from Serpentian October 10, 2025 12:40

mrForza assigned Serpentian Oct 10, 2025

Improve logging of rebalancer and recovery #586

Are you sure you want to change the base?

Improve logging of rebalancer and recovery #586

Uh oh!

Conversation

mrForza commented Aug 12, 2025

Uh oh!

Serpentian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Serpentian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Serpentian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Serpentian left a comment

Choose a reason for hiding this comment

Uh oh!

Serpentian Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

mrForza Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Serpentian Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

mrForza Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Serpentian Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

mrForza Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants