Skip to content

Jormungandr v0.8.7 is not propagating blocks to the network #1625

@lovelypool

Description

@lovelypool

Describe the bug
I have had numerous blocks on epoch 40 not propagate to the network. My leader logs show the block was signed and sent.

3 examples of 3 consecutively missed blocks:

- created_at_time: "2020-01-23T00:44:54.320114806+00:00" enclave_leader_id: 1 finished_at_time: "2020-01-23T02:26:51.002096232+00:00" scheduled_at_date: "40.12997" scheduled_at_time: "2020-01-23T02:26:51+00:00" status: Block: block: 71bc2c42bfb03ef6c689929d143a6c4c478673b5399fbab48ed63e7d8f718595 chain_length: 122860 wake_at_time: "2020-01-23T02:26:51.001334781+00:00"

- created_at_time: "2020-01-23T00:44:54.320117912+00:00" enclave_leader_id: 1 finished_at_time: "2020-01-23T02:55:29.005382635+00:00" scheduled_at_date: "40.13856" scheduled_at_time: "2020-01-23T02:55:29+00:00" status: Block: block: aa833ac0709d7d2119167e73b8ea815e82fb93add0f85ba9edd5dd5ea67b01e3 chain_length: 122912 wake_at_time: "2020-01-23T02:55:29.001757208+00:00"
- created_at_time: "2020-01-23T00:44:54.320132781+00:00" enclave_leader_id: 1 finished_at_time: "2020-01-23T02:59:41.242523233+00:00" scheduled_at_date: "40.13982" scheduled_at_time: "2020-01-23T02:59:41+00:00" status: Block: block: bbed794f976a44dbfe323e102aa91cfcb8ada9cc2c57ba285f1b4f63fa4077ab chain_length: 122925 wake_at_time: "2020-01-23T02:59:41.241818067+00:00"

I monitor my node for slot latency (how fast did a block propagate to me) and block height delta (did my node miss a block). The blue arrows mark my leadership events. You can see one leadership event may have been missed due to my node skipping a block, but the other 2 do not have any explanation. Other operators in telegram confirmed that they saw no mention of the block hashes above in their logs.

image

My node is set to run with 768 connections, and hovers around 350 established TCP connections. It usually becomes unstable (more missed blocks, more slot latency) within 4-6 hours.

I am also using a gossip interval of 60sec to improve uptime and stability.

Mandatory Information

  1. jcli --full-version output;
    jcli 0.8.7 (HEAD-364cd84e+, release, linux [x86_64]) - [rustc 1.40.0 (73528e339 2019-12-16)]
  2. jormungandr --full-version output;
    jormungandr 0.8.7 (HEAD-364cd84e+, release, linux [x86_64]) - [rustc 1.40.0 (73528e339 2019-12-16)]

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingsubsys-networknetwork related

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions