Verifiable payer #544

mkysel · 2025-02-24T15:53:26Z

This adds a new field to the payer envelope that prevents a node from stealing payloads #474

Additionally it gets rid of old AAD validation #463

The bulk of the changeset allows the payer to pick a different node if the connection to the ideal node has failed. This is obviously not ideal, because it will keep retrying even if the node is down long term, but in the short/medium term this is sufficient.

Fixes #474 #463

Summary by CodeRabbit

New Features
- Introduced a retry mechanism for publishing messages that dynamically selects alternative nodes upon failure.
- Enhanced node selection to automatically exclude unavailable nodes, improving message routing reliability.
- Added a new field for TargetOriginator in payer envelopes to improve message association.
Improvements
- Strengthened message validation and signature verification for better envelope integrity.
- Streamlined error handling, resulting in clearer feedback in messaging workflows.
- Enhanced test coverage for scenarios involving banned nodes.
Refactoring
- Unified parameter usage in envelope creation to ensure consistency across message processing.

coderabbitai · 2025-02-24T15:53:37Z

Walkthrough

This pull request makes extensive changes to envelope creation, validation, and publishing. Several tests are updated to pass a new node identifier parameter and adjust expectations around mismatching originators. The service layer now performs additional checks on envelope originators and recovers signers with the new parameter. A retry mechanism is introduced for publishing envelopes with node selection that respects a ban list. Function signatures in utility packages are modified to incorporate the originator ID, and a new constant is added. There are also routine updates to mock version comments.

Changes

File(s)	Change Summary
`pkg/api/message/publish_test.go`	Updated test calls for `CreatePayerEnvelope` to include `DefaultClientEnvelopeNodeId`; renamed and added test functions to differentiate originator validation.
`pkg/api/message/service.go`	Enhanced `validatePayerEnvelope` with new originator checks and removed redundant check from `validateClientInfo`.
`pkg/api/payer/nodeSelector.go`	Modified `GetNode` signature to include a variadic `banlist` and added logic to skip banned nodes.
`pkg/api/payer/nodeSelector_test.go`	Added `TestGetNode_NodeGetNextIfBanned` to verify proper node selection when nodes are banned.
`pkg/api/payer/publish_test.go`	Adjusted mock expectations: removed hardcoded originator values and now assert the `TargetOriginator` post publish.
`pkg/api/payer/service.go`	Introduced `publishToNodeWithRetry` with retry logic; updated signing methods (`signAllClientEnvelopes` and `signClientEnvelope`) to accept an `originatorID`; refined error logging in blockchain publishing.
`pkg/constants/constants.go`	Added new constant `TARGET_ORIGINATOR_SEPARATION_LABEL = "target
`pkg/envelopes/envelopes_test.go`	Updated tests to pass an additional `originatorNodeId` argument when calling `CreatePayerEnvelope`.
`pkg/envelopes/payer.go`	Added a new field `TargetOriginator` to the `PayerEnvelope` struct and updated envelope creation and signer recovery logic accordingly.
`pkg/indexer/e2e_test.go`, `pkg/indexer/storer/groupMessage_test.go`	Removed an extra parameter from `CreateGroupMessageClientEnvelope` calls and renamed `topic` to `msgTopic` for consistency.
`pkg/server/server_test.go`	Modified tests to include specific node ID parameters (`nodeId1`, `nodeId2`) in envelope creation; reformatted `AuthenticatedData` structure.
`pkg/testutils/envelopes/envelopes.go`	Added constant `DefaultClientEnvelopeNodeId` and updated function signatures for `CreatePayerEnvelope` and `CreateGroupMessageClientEnvelope` (removed the target originator parameter).
`pkg/utils/hash.go`	Updated `HashPayerSignatureInput` to accept an additional `originatorID` parameter and incorporate it into the hash computation.
`pkg/utils/signature.go`	Updated `SignClientEnvelope` to include `originatorID` in its signature process by calling the updated hash function.
Various files under `pkg/mocks/...`	Updated mock version comments from v2.51.1 to v2.52.2 without altering functionality.

Sequence Diagram(s)

sequenceDiagram
    participant S as Service
    participant NS as NodeSelector
    participant N as Node
    
    S->>NS: GetNode(topic, banlist...)
    NS-->>S: Return available nodeID
    S->>N: publishEnvelope(envelopes)
    alt Publish fails
        N-->>S: Error response
        S->>NS: Update banlist & retry GetNode
        S->>N: publishEnvelope(envelopes) (Retry)
    else Publish succeeds
        N-->>S: Success response
    end

Possibly related PRs

Stable hashing for node selection in the payer #459: Updates in the publish_test.go regarding the handling of the TargetOriginator field align with changes in this PR.
Print error on failed blockchain publish #327: Modifications in the service layer’s envelope validation and processing in service.go have strong code-level parallels.
Guarantee your own commits via a blocking payer #441: Changes involving the CreatePayerEnvelope function and the addition of TargetOriginator in PayerEnvelope closely relate to this PR’s envelope parameter enhancements.

Suggested reviewers

fbac
neekolas

Warning

There were issues while running some tools. Please review the errors and either fix the tool’s configuration or disable the tool if it’s a critical failure.

🔧 golangci-lint (1.62.2)

Error: can't load config: the Go language version (go1.23) used to build golangci-lint is lower than the targeted Go version (1.24)
Failed executing command with error: can't load config: the Go language version (go1.23) used to build golangci-lint is lower than the targeted Go version (1.24)

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 6914cf6 and 16f0c05.

📒 Files selected for processing (1)

pkg/api/payer/nodeSelector_test.go (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

pkg/api/payer/nodeSelector_test.go

⏰ Context from checks skipped due to timeout of 90000ms (2)

GitHub Check: Test (Node)
GitHub Check: Build pre-baked anvil-xmtpd

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (5)

pkg/envelopes/payer.go (1)
14-18: Add documentation for the new TargetOriginator field.

While the implementation is correct, adding documentation would help explain the field's purpose in preventing payload stealing.
 type PayerEnvelope struct {
 	proto            *envelopesProto.PayerEnvelope
 	ClientEnvelope   ClientEnvelope
+	// TargetOriginator is the ID of the node this envelope is intended for,
+	// preventing other nodes from stealing and republishing the payload
 	TargetOriginator uint32
 }
pkg/api/message/service.go (1)
410-416: Improve error messages for better debugging.

While the validation checks are correct and align with preventing node payload stealing, the error messages could be more descriptive.

Apply this diff to improve error messages:
 if payerEnv.TargetOriginator != s.registrant.NodeID() {
-    return nil, status.Errorf(codes.InvalidArgument, "invalid target originator")
+    return nil, status.Errorf(codes.InvalidArgument, "target originator %d does not match node ID %d", payerEnv.TargetOriginator, s.registrant.NodeID())
 }

 if _, err = payerEnv.RecoverSigner(); err != nil {
-    return nil, err
+    return nil, status.Errorf(codes.InvalidArgument, "failed to recover signer: %v", err)
 }
pkg/api/payer/service.go (2)
168-202: Consider implementing a brief backoff period in the retry loop.
Currently, the loop attempts each retry immediately in quick succession. Introducing an exponential or fixed delay helps avoid potential tight loops and reduces load or noise if a node is persistently down.
 for retries := 0; retries < 5; retries++ {
   result, err = s.publishToNodes(ctx, nodeID, indexedEnvelopes)
   if err == nil {
     return result, nil
   }
+  // Example of a simple backoff
+  time.Sleep(time.Duration(retries+1) * time.Second)
   ...
 }
380-399: Validate the consistency of originator information.
Consider verifying that the originatorID parameter matches the data in clientEnvelope.Aad() to prevent potential spoofing attempts where these might differ.
pkg/api/payer/nodeSelector.go (1)

33-85: Implementation of banlist logic works but could benefit from further efficiency consideration.
Flattening the banlist into a map and iterating is clear and correct. If banlists become large or concurrency is introduced, more sophisticated coordination or data structures might be beneficial, but this is sufficient for many use cases.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 1f8192f and 6914cf6.

⛔ Files ignored due to path filters (3)

pkg/proto/openapi/xmtpv4/message_api/message_api.swagger.json is excluded by !pkg/proto/**
pkg/proto/openapi/xmtpv4/message_api/misbehavior_api.swagger.json is excluded by !pkg/proto/**
pkg/proto/xmtpv4/envelopes/envelopes.pb.go is excluded by !**/*.pb.go, !pkg/proto/**

📒 Files selected for processing (26)

pkg/api/message/publish_test.go (11 hunks)
pkg/api/message/service.go (1 hunks)
pkg/api/payer/nodeSelector.go (3 hunks)
pkg/api/payer/nodeSelector_test.go (1 hunks)
pkg/api/payer/publish_test.go (2 hunks)
pkg/api/payer/service.go (8 hunks)
pkg/constants/constants.go (1 hunks)
pkg/envelopes/envelopes_test.go (4 hunks)
pkg/envelopes/payer.go (3 hunks)
pkg/indexer/e2e_test.go (3 hunks)
pkg/indexer/storer/groupMessage_test.go (2 hunks)
pkg/mocks/authn/mock_JWTVerifier.go (1 hunks)
pkg/mocks/blockchain/mock_ChainClient.go (1 hunks)
pkg/mocks/blockchain/mock_IBlockchainPublisher.go (1 hunks)
pkg/mocks/indexer/mock_ChainReorgHandler.go (1 hunks)
pkg/mocks/indexer/mock_IBlockTracker.go (1 hunks)
pkg/mocks/metadata_api/mock_MetadataApiClient.go (1 hunks)
pkg/mocks/mls_validationv1/mock_ValidationApiClient.go (1 hunks)
pkg/mocks/mlsvalidate/mock_MLSValidationService.go (1 hunks)
pkg/mocks/registry/mock_NodeRegistry.go (1 hunks)
pkg/mocks/registry/mock_NodesContract.go (1 hunks)
pkg/mocks/storer/mock_LogStorer.go (1 hunks)
pkg/server/server_test.go (3 hunks)
pkg/testutils/envelopes/envelopes.go (7 hunks)
pkg/utils/hash.go (1 hunks)
pkg/utils/signature.go (1 hunks)

✅ Files skipped from review due to trivial changes (11)

pkg/mocks/registry/mock_NodesContract.go
pkg/mocks/blockchain/mock_ChainClient.go
pkg/mocks/indexer/mock_IBlockTracker.go
pkg/mocks/mlsvalidate/mock_MLSValidationService.go
pkg/mocks/indexer/mock_ChainReorgHandler.go
pkg/mocks/authn/mock_JWTVerifier.go
pkg/mocks/storer/mock_LogStorer.go
pkg/mocks/mls_validationv1/mock_ValidationApiClient.go
pkg/mocks/registry/mock_NodeRegistry.go
pkg/mocks/blockchain/mock_IBlockchainPublisher.go
pkg/mocks/metadata_api/mock_MetadataApiClient.go

🧰 Additional context used

🪛 GitHub Check: Lint-Go

pkg/api/payer/nodeSelector_test.go

[failure] 262-262:
ineffectual assignment to reselectedNode (ineffassign)

⏰ Context from checks skipped due to timeout of 90000ms (2)

GitHub Check: Build pre-baked anvil-xmtpd
GitHub Check: Test (Node)

🔇 Additional comments (32)

pkg/constants/constants.go (1)

6-6: LGTM! The new constant follows the established naming pattern.

The new separation label is consistent with existing domain separation constants and supports the verifiable payer feature.

pkg/indexer/storer/groupMessage_test.go (1)

52-52: LGTM! Test updates align with function signature changes.

The removal of the targetOriginator parameter from CreateGroupMessageClientEnvelope calls reflects the updated function signature while maintaining test coverage.

Also applies to: 95-95

pkg/indexer/e2e_test.go (1)

65-67: LGTM! Variable rename improves code clarity.

The change from topic to msgTopic makes the variable's purpose clearer, and the function call updates align with the new signature.

Also applies to: 80-80, 94-94

pkg/envelopes/payer.go (2)

29-34: LGTM! Proper initialization of the TargetOriginator field.

The field is correctly initialized from the proto message, maintaining consistency between the struct and protobuf representation.

54-54: LGTM! Enhanced security with TargetOriginator in signature hash.

Including TargetOriginator in the hash computation ensures the signature binds the envelope to the intended node, preventing payload stealing as intended in issue #474.

pkg/api/payer/publish_test.go (2)

184-186: LGTM! Mock setup aligns with PR objectives.

The mock setup correctly returns a node with ID 100, which is used to verify the target originator in the payer envelope.

211-212: LGTM! Verification of target originator.

The test now properly verifies that the target originator in the payer envelope matches the expected node ID, which aligns with the PR objective of preventing node payload theft.

pkg/server/server_test.go (2)

163-164: LGTM! Node ID handling in test cases.

The test cases now correctly pass node IDs when creating payer envelopes, which is essential for the new security feature.

Also applies to: 178-179

272-273: LGTM! Simplified envelope structure.

The test case correctly removes the redundant target originator from the authenticated data structure while maintaining essential fields.

pkg/utils/hash.go (2)

9-12: LGTM! Enhanced security with originator ID in hash calculation.

The function now includes the originator ID in the hash calculation, which is crucial for preventing node payload theft. The use of binary.BigEndian for uint32 conversion is correct and consistent with network byte order standards.

13-15: LGTM! Proper domain separation in hash calculation.

The hash calculation correctly uses domain separation labels and includes the target originator bytes in the appropriate order.

pkg/utils/signature.go (1)

9-14: LGTM! Enhanced signature security with originator ID.

The signature function now correctly incorporates the originator ID in the hash calculation, which is essential for the security enhancement described in issue #474.

pkg/envelopes/envelopes_test.go (3)

22-22: LGTM!

The test correctly validates the originator node ID in the payer envelope, which aligns with the PR's objective of preventing node payload stealing.

51-51: LGTM!

The test ensures that the envelope serialization works correctly with the new originator node ID field.

141-146: LGTM!

The test thoroughly validates the signer recovery functionality with the new originator node ID field, including both valid and invalid signature scenarios.

Also applies to: 164-164

pkg/api/payer/nodeSelector_test.go (1)

230-264: LGTM!

The test comprehensively validates the node selection logic with banned nodes, ensuring that:

Initial node selection works correctly

Alternative nodes are selected when preferred nodes are banned

An error is returned when all nodes are banned

This aligns with the PR's objective of allowing the payer to select different nodes if the connection to the preferred node fails.

🧰 Tools

🪛 GitHub Check: Lint-Go

[failure] 262-262:
ineffectual assignment to reselectedNode (ineffassign)

pkg/testutils/envelopes/envelopes.go (4)

16-16: LGTM!

Good refactoring to extract the hardcoded value into a named constant.

32-32: LGTM!

Good use of the new constant to improve code maintainability.

84-107: LGTM!

The function has been properly enhanced to:

Accept a node ID parameter

Generate a valid signature for testing

Set the target originator field

This aligns with the PR's objective of preventing node payload stealing.

117-117: LGTM!

The functions correctly pass the originator node ID to maintain consistency in test envelope creation.

Also applies to: 142-142

pkg/api/payer/service.go (2)

79-79: No concerns about switching to the new retry method.
Invoking publishToNodeWithRetry here aligns with the updated retry logic and looks correct.

335-338: Confirm that ignoring the node cursor error is intended.
By only logging this error rather than returning it, the method could continue operating with incomplete data. Ensure this decision aligns with your reliability/availability requirements.

pkg/api/message/publish_test.go (9)

25-28: Creation of payer envelope with default node ID looks fine.
No issues spotted with this usage.

76-79: Payer envelope generation is consistent.
Properly matches the new signature usage.

90-112: Verify that ignoring mismatching AAD originator is acceptable.
This test confirms mismatching AAD originator is allowed. Double-check that your security model permits ignoring such mismatches without side effects.

114-130: Good coverage for mismatching originator scenario.
This test ensures the system properly rejects conflicting originator values.

142-146: Envelope creation for missing topic tests is correct.
No concerns with parameter usage here.

153-196: Key package validation success path appears consistent.
The changes comply with the new node ID handling.

202-237: Key package validation failure path is well-handled.
The updated usage of the node ID parameter is consistent with the rest of the code.

243-253: Usage of the helper with cursor validation is straightforward.
No issues noted with the updated node ID parameter.

284-293: Correctly tests scenario with an unknown originator.
Ensures the function returns an error as expected.

pkg/api/payer/nodeSelector.go (1)

13-14: Adding a banlist parameter to the interface is a sensible approach.
It provides flexibility to exclude undesirable nodes.

neekolas · 2025-02-24T21:31:33Z

pkg/api/payer/nodeSelector.go

@@ -10,7 +10,7 @@ import (
 )

 type NodeSelectorAlgorithm interface {
-	GetNode(topic topic.Topic) (uint32, error)
+	GetNode(topic topic.Topic, banlist ...[]uint32) (uint32, error)


Makes sense

mkysel added 5 commits February 20, 2025 14:38

Verifyiable payer

7ff3baf

lots of good stuff

59246af

verify signatures

eaf8c8d

more goodies

806793b

new protos

6914cf6

mkysel requested a review from a team as a code owner February 24, 2025 15:53

coderabbitai bot reviewed Feb 24, 2025

View reviewed changes

linter

16f0c05

neekolas reviewed Feb 24, 2025

View reviewed changes

neekolas approved these changes Feb 24, 2025

View reviewed changes

mkysel merged commit 63fe167 into main Feb 24, 2025
7 checks passed

mkysel deleted the mkysel/verifiable-payer branch February 24, 2025 21:36

coderabbitai bot mentioned this pull request Feb 25, 2025

Change payer envelope to bytes field #553

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verifiable payer #544

Verifiable payer #544

mkysel commented Feb 24, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 24, 2025 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

neekolas Feb 24, 2025

Verifiable payer #544

Verifiable payer #544

Conversation

mkysel commented Feb 24, 2025 • edited by coderabbitai bot Loading

Summary by CodeRabbit

coderabbitai bot commented Feb 24, 2025 • edited Loading

Walkthrough

Changes

Sequence Diagram(s)

Possibly related PRs

Suggested reviewers

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

neekolas Feb 24, 2025

Choose a reason for hiding this comment

mkysel commented Feb 24, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 24, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)