feat: Implement ReadN #276

raulb · 2025-04-25T11:29:58Z

Description

Since the introduction of end-to-end batching on the Connector SDK, (part of the Connector SDK 0.12.0), source connectors can add support to batching by implementing the ReadN method. This wasn't a breaking change since the SDK will fallback to the traditional Read method as it can be spotted in the logs when starting Conduit.

Example:

2025-04-24T19:21:21+00:00 INF source does not support batch reads, falling back to single reads component=plugin connector_id=postgres-to-kafka:postgres-source plugin_name=builtin:postgres plugin_type=source

This pull-request implements ReadN while still maintaining support to Read method for those who wish to use this connector in an older version of Conduit. This is probably not necessary considering Postgres is a builtin connector.

To really utilize the advantage of this, it is recommended to run Conduit with the flag that enables a new upcoming architecture via --preview.pipeline-arch-v2. More information on this blog post.

Running benchmarks

Inserting 20M records using Benchi using CDC:

WIth Read: 142824.47 msg/s
With ReadN: 153107.18 msg/s (previously )

This is a 7,2% better on CDC.

Quick checks:

I have followed the Code Guidelines.
There is no other pull request for the same update/change.
I have written unit tests.
I have made sure that the PR is of reasonable size and can be easily reviewed.

source/logrepl/cdc.go

source/snapshot/iterator.go

source/logrepl/combined.go

source/logrepl/cdc_test.go

source/logrepl/combined.go

lovromazgon

A few cleanups, otherwise looks good 👍

source/logrepl/cdc.go

source.go

source/logrepl/cdc.go

source/logrepl/cdc_test.go

The SDK checks this

check errors

This reverts commit ad6f32e.

raulb added 3 commits April 23, 2025 17:16

add readN

7d3574b

better implementation

bb52ffa

more consistent

00ecdc6

raulb self-assigned this Apr 25, 2025

raulb marked this pull request as ready for review April 25, 2025 13:26

raulb requested a review from a team as a code owner April 25, 2025 13:26

raulb added this to Conduit Main Apr 25, 2025

github-project-automation bot moved this to Triage in Conduit Main Apr 25, 2025

raulb moved this from Triage to Ready for review in Conduit Main Apr 25, 2025

raulb added 2 commits April 25, 2025 15:58

add test

6247721

add NextN test on cdciterator

8f4c034

raulb requested a review from lyuboxa April 25, 2025 21:02

raulb added 4 commits April 25, 2025 23:50

fix lint

3331840

remove Read

adc0b29

fix typo

5aec436

add combined iterator test

bcde1ed

hariso reviewed Apr 29, 2025

View reviewed changes

source/logrepl/cdc.go Show resolved Hide resolved

lovromazgon reviewed Apr 29, 2025

View reviewed changes

source/snapshot/iterator.go Outdated Show resolved Hide resolved

source/logrepl/combined.go Outdated Show resolved Hide resolved

source/logrepl/cdc_test.go Outdated Show resolved Hide resolved

remove Next()

fae1a17

lyuboxa reviewed Apr 29, 2025

View reviewed changes

source/logrepl/combined.go Outdated Show resolved Hide resolved

raulb added 7 commits April 29, 2025 15:52

fetch until there's 0 records left

bb682c6

cleaner

89f2f0c

better return an empty slice

ec0dd58

better return a slice

d2dc205

more

808ecba

and more

0e08e6d

allocate based on n

b5413e2

lovromazgon reviewed Apr 30, 2025

View reviewed changes

source/logrepl/cdc.go Outdated Show resolved Hide resolved

source/logrepl/cdc.go Outdated Show resolved Hide resolved

source.go Outdated Show resolved Hide resolved

lyuboxa approved these changes Apr 30, 2025

View reviewed changes

github-project-automation bot moved this from Ready for review to Reviewed (ready to ship) in Conduit Main Apr 30, 2025

allocate

5109cad

hariso reviewed Apr 30, 2025

View reviewed changes

source/logrepl/cdc.go Outdated Show resolved Hide resolved

source/logrepl/cdc.go Outdated Show resolved Hide resolved

source/logrepl/cdc.go Outdated Show resolved Hide resolved

source/logrepl/cdc_test.go Outdated Show resolved Hide resolved

source/logrepl/cdc_test.go Show resolved Hide resolved

raulb added 4 commits April 30, 2025 15:21

no need

3189627

The SDK checks this

save allocation

fdd1b3b

return nil insteaed

afe9784

share context

5f9fdc2

raulb requested review from hariso and lovromazgon April 30, 2025 14:03

hariso approved these changes Apr 30, 2025

View reviewed changes

raulb added 2 commits April 30, 2025 16:29

use RecvTimeout to avoid unnecessary timeouts

ad6f32e

check errors

Revert "use RecvTimeout to avoid unnecessary timeouts"

d61629d

This reverts commit ad6f32e.

raulb force-pushed the read-n branch from 7ef8d9a to d61629d Compare April 30, 2025 14:30

raulb enabled auto-merge (squash) April 30, 2025 14:30

raulb merged commit e468f39 into main Apr 30, 2025
3 checks passed

raulb deleted the read-n branch April 30, 2025 14:31

github-project-automation bot moved this from Reviewed (ready to ship) to Done in Conduit Main Apr 30, 2025

This was referenced Apr 30, 2025

fix: speed on ReadN #279

Merged

Investigate: Why allocating memory is less efficient on ReadN #282

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: Implement ReadN #276

feat: Implement ReadN #276

Uh oh!

raulb commented Apr 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lovromazgon left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

feat: Implement ReadN #276

feat: Implement ReadN #276

Uh oh!

Conversation

raulb commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Running benchmarks

Quick checks:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lovromazgon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

raulb commented Apr 25, 2025 •

edited

Loading