Skip to content

[BUG] Discovery request queue is full #1257

@benbierens

Description

@benbierens

Storage nodes in the testnet have began logging
Exception queueing discovery request topics="codex discoveryengine" tid=1 exc="AsyncQueue is full!"
Occurance is increasing over time.

This is very probably causing failure to discover blocks -> download slots -> start contracts.

I suspected the discovery worker had stopped processing the queue. But,
trace "Discovery task cancelled"
info "Exiting discovery task runner"
do not appear in the logs.

However, looking at proc discoveryTaskLoop it seems possible for the worker loop to be stopped without leaving any log entry by any exception that is not a CancelledError. (Should this be impossible due to compiler-checked exceptions?)

Can we confirm somehow whether the discovery worker has been stopped? (There should be 10 of them by default.) Or is there a likely explanation for why the workers can't keep up with the queue?

Metadata

Metadata

Assignees

Labels

ClientSee https://miro.com/app/board/uXjVNZ03E-c=/ for detailsbugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions