Fully configure frame processors when they are used directly on an audio stream by 1egoman · Pull Request #679 · livekit/python-sdks

1egoman · 2026-05-20T17:44:02Z

Updates the python sdk so that FrameProcessor-based noise cancellation providers can be used directly on AudioStream, without having to go through the agent's RoomIO to be able to initialize itself with credentials.

For example, with this change, something like the below becomes possible:

stream = rtc.AudioStream.from_track(                                                                                                                   
    track=track,
    sample_rate=SAMPLE_RATE,                                             
    num_channels=CHANNELS,
    noise_cancellation=ai_coustics.audio_enhancement(model=ai_coustics.EnhancerModel.QUAIL_VF_L)  ,
)

The way this works - Tracks now keep track of which room they are part of (holding a weakref value). When the room a track is in changes, it computes new frame processor options and sends these to any AudioStreams which are associated with the track.

The noise_cancellation_leave_open parameter allows the agents sdk to call this from_track method with a frame processor which remains open across the whole session, and won't be auto-closed when the track is closed.

This goes along with livekit/agents#5867, which removes the relevant event handling logic in the agents sdk. I will follow up with a node version of this once the python one is in a good state.

Todo

Add some tests for this newly added behavior

theomonnom · 2026-05-27T22:46:43Z

        num_channels: int = 1,
        frame_size_ms: int | None = None,
        noise_cancellation: Optional[NoiseCancellationOptions | FrameProcessor[AudioFrame]] = None,
+        noise_cancellation_leave_open: bool = False,


Suggested change

noise_cancellation_leave_open: bool = False,

Can we move that inside NoiseCancellationOptions?

Unfortunately, no - this is important to the FrameProcessor[AudioFrame] side of that noise_cancellation union. Open to putting it somewhere else but it needs to be settable in the FrameProcessor path.

hmm, not sure if it's a good idea, but could it be a field on the FrameProcessor interface instead?

Then we could add it to NoiseCancellationOptions and new FrameProcessors would be able to set it on the processor itself

It's not a setting that a frame processor would always want to have set or not have set, so I'm not sure that would really make sense either.

For context, the reason this is here is so the agents sdk can reuse a single FrameProcessor across multiple underlying tracks. Previously, this wasn't a problem in the way this used to work, because the agents sdk had the responsibility of closing the FrameProcessor, so it could easily do it at room disconnection time. But in order to support the ability to use FrameProcessors directly on an AudioStream, calling close needs to be pushed down deeper than the agents sdk layer. This flag allows the caller to explictly tell AudioStream that they will manage cleaning up the FrameProcessor so that both use cases can continue to work.

I think this flag is not really configuring the noise suppression behavior, but how AudioStream deals with its own noise suppression, maybe the naming of noise_cancellation_leave_open is a bit confusing ?

how about close_noise_cancellation_on_stream_close or manage_noise_cancellation_processor ?

It's not a setting that a frame processor would always want to have set or not have set

it could stay undefined by default? 🤷
I understand however that it feels a bit weird for it to live on the processor if the processor itself doesn't really use the field.

We shortly discussed also the option to introduce a restart method on the processor. I think this could still be a viable alternative?

We shortly discussed also the option to introduce a restart method on the processor. I think this could still be a viable alternative?

It could, but the con there is it's a breaking api change to FrameProcessor.

Just generally, I want to understand what folks' concerns are in more detail. Is it just the noise_cancellation_ prefix naming like shijing suggested (I think out of the two suggestions, I like manage_noise_cancellation_processor better)? Or is there something deeper behavior wise that is concerning?

FWIW, two fairly similar patterns I found:

LiveKitAPI conditionally controls aiohttp.ClientSession cleanup here based on whether the user passes a custom session or uses an inbuilt session.

The LocalAudioTrack has a userProvidedTrack parameter which is used to control whether the track is cleaned up or not here.

Talked to @lukasIO in a 1:1 and he confirmed his concern was mostly with the naming, not with the broad approach, which is helpful.

A few other name ideas, in addition to shijing's suggestions (close_noise_cancellation_on_stream_close / manage_noise_cancellation_processor) - some of these would involve flipping the flag:

shared_noise_cancellation

noise_cancellation_externally_managed

auto_close_noise_cancellation

owns_noise_cancellation

Out of the above, I think I like auto_close_noise_cancellation the best:

# Usage within agents sdk: AudioStream.from_track( # ... noise_cancellation=frame_processor, auto_close_noise_cancellation=False, )

I'm going to update the pull request to use it for now in 8d5e656.

Another possible idea: maybe something like the below could be a different way to package the same data which could better contain it. In a world like this, noise_cancellation would be of type Union[NoiseCancellationOptions, FrameProcessorOptions, FrameProcessor]:

AudioStream.from_track( # ... noise_cancellation=FrameProcessorOptions(frame_processor=self, leave_open=True) )

Do any of these ideas look better than the current state?

@theomonnom @xianshijing-lk Lukas gave a a general 👍 to the rename to auto_close_noise_cancellation addressing his concern. Do either of you have further concerns beyond what this rename could accomplish with this approach?

If not / I don't hear anything in the next few days I think I am good to merge this.

…io stream And extracting metadata from that room that can be fed into the frame processor.

…o_stream

…from room

…oStream This makes it less complex.

The agents sdk can pass this opt-out flag so that it can reuse the frame processor across many audio tracks

Need to think about this a bit more, this pattern as written won't work, since the FrameProcessor today can't have a set of no-op credentials pushed.

…_track

…Processor methods, and use them when moving a track out of a room

These tests exercise all the frame processor track reparenting under room / etc paths.

…ation

…ll reconnect When the room does a full reconnect, make sure the audiostream metadata gets a new push with the updated track sid

… unpublished

…mock

…handlers once Previously, calling track._set_room(None) twice would call the handlers twice.

Set _track to None AFTER the unpublished event is fired to be 100% backwards compatible

lukasIO · 2026-06-24T14:19:37Z

+
    def _on_credentials_updated(self, *, token: str, url: str) -> None: ...

+    def _on_credentials_cleared(self) -> None: ...


question(non-blocking): do you foresee any scenario where on_credentials_cleared and on_stream_info_cleared would get called independently from one another?

I don't have a good suggestion for a unified method name, just raising this in case you can think of one that would be obvious and allow for less methods to be implemented for each processor

Yes, one example (at least in theory) where they could be called at different times: a frame processor is detached from a track, kept around connected to the room for a period of time, then reattached to another track in that same room.

1egoman · 2026-06-24T16:28:20Z

Did a little bit of testing locally along with livekit/agents#5867 and this looks to work in all the easy to test happy path cases. That along with the test coverage of the harder to test non happy path cases I think makes me fairly confident in this now!

Also, in case anyone is curious - I also have tested the current main of the agents repo (451bae4f7) with this loaded and everything works, there just are in some cases callbacks like _on_credentials_updated / etc are being called twice (once by agents code, once by the code in this PR) instead of once.

1egoman force-pushed the frame-processor-on-audio-stream branch from 3e5a9ab to f62c247 Compare May 26, 2026 15:15

1egoman commented May 26, 2026

View reviewed changes

Comment thread livekit-rtc/livekit/rtc/track.py Outdated

1egoman marked this pull request as ready for review May 26, 2026 21:25

1egoman requested review from cloudwebrtc, lukasIO and xianshijing-lk as code owners May 26, 2026 21:25

This comment was marked as resolved.

Sign in to view

1egoman force-pushed the frame-processor-on-audio-stream branch from 564b2c7 to 8d3f4fe Compare May 27, 2026 17:02

1egoman mentioned this pull request May 27, 2026

Move frame processor url/token/stream info to client sdk livekit/agents#5867

Merged

3 tasks

theomonnom reviewed May 27, 2026

View reviewed changes

1egoman mentioned this pull request May 29, 2026

Add initial support for frame processor usage directly on tracks livekit/node-sdks#671

Open

1 task

lukasIO approved these changes Jun 2, 2026

View reviewed changes

1egoman added 18 commits June 8, 2026 12:23

feat: add MVP of propagating room downwards from room -> track -> aud…

27c841a

…io stream And extracting metadata from that room that can be fed into the frame processor.

feat: call _on_stream_info_updated with parent room reference on audi…

15fe1b4

…o_stream

feat: call _on_credentials_updated with token / server url extracted …

56e4b95

…from room

fix: remove debugging logs

4fdef63

fix: address lint errors

5a55617

feat: only call frame processor handlers if room is set

83dada3

fix: properly intercept room refresh token events

2b96668

feat: add from __future__ import annotations to remove string types

46f11dd

fix: address incorrect docs

4fc15ea

refactor: centralize frame processor state logic into Track, not Audi…

8bc8953

…oStream This makes it less complex.

feat: add auto cleanup of FrameProcessor as opt-out

dedf686

The agents sdk can pass this opt-out flag so that it can reuse the frame processor across many audio tracks

fix: disable no-op credentials push

1699ad4

Need to think about this a bit more, this pattern as written won't work, since the FrameProcessor today can't have a set of no-op credentials pushed.

fix: move processor close from __del__ to aclose

277b634

fix: proxy throgh noise_cancellation_leave_open into AudioStream.from…

6bcdd5b

…_track

fix: include missed noise_cancellation_leave_open in from_track

a8d3879

fix: address type checker warning

4c7a95e

feat: add new _on_stream_info_cleared / _on_credentials_cleared Frame…

6a2c5f2

…Processor methods, and use them when moving a track out of a room

fix: apply devin suggestion

d93f881

1egoman added 3 commits June 8, 2026 12:24

feat: add new frame processor tests

2c9d8de

These tests exercise all the frame processor track reparenting under room / etc paths.

fix: address type errors in tests

ea1cde0

fix: rename noise_cancellation_leave_open -> auto_close_noise_cancell…

8e3a461

…ation

1egoman force-pushed the frame-processor-on-audio-stream branch from 8d5e656 to 8e3a461 Compare June 8, 2026 16:27

This comment was marked as resolved.

Sign in to view

fix: address incorrect default value for auto_close_noise_cancellation

ea1de09

This comment was marked as resolved.

Sign in to view

1egoman added 3 commits June 8, 2026 13:06

fix: ensure that audio stream metadata is updated properly on room fu…

cb47aec

…ll reconnect When the room does a full reconnect, make sure the audiostream metadata gets a new push with the updated track sid

fix: ensure that audio stream room reference is cleared when track is…

b9a9c15

… unpublished

fix: use proper ffi objects in test instead of types.SimpleNamespace …

92e334e

…mock

1egoman commented Jun 8, 2026

View reviewed changes

Comment thread livekit-rtc/livekit/rtc/room.py

This comment was marked as resolved.

Sign in to view

1egoman added 2 commits June 18, 2026 15:55

fix: ensure that track._set_room(None) is idempotent, and only calls …

e8b12a4

…handlers once Previously, calling track._set_room(None) twice would call the handlers twice.

fix: reset unpublished._track = None on track unpublish

4fe5018

This comment was marked as resolved.

Sign in to view

fix: ensure unpublished event is backwards compatible

adc16d3

Set _track to None AFTER the unpublished event is fired to be 100% backwards compatible

1egoman requested a review from lukasIO June 23, 2026 15:45

lukasIO approved these changes Jun 24, 2026

View reviewed changes

1egoman merged commit 6893e13 into main Jun 24, 2026
34 checks passed

1egoman deleted the frame-processor-on-audio-stream branch June 24, 2026 16:28

1egoman added a commit to livekit/node-sdks that referenced this pull request Jun 24, 2026

feat: port in updates from livekit/python-sdks#679

4a5494c

1egoman added a commit to livekit/node-sdks that referenced this pull request Jun 24, 2026

feat: port in updates from livekit/python-sdks#679

479f8bc

rosetta-livekit-bot Bot mentioned this pull request Jun 25, 2026

fix(agents): stop forwarding frame processor info livekit/agents-js#1883

Closed

3 tasks


		def _on_credentials_updated(self, *, token: str, url: str) -> None: ...

		def _on_credentials_cleared(self) -> None: ...

Uh oh!

Conversation

1egoman commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Todo

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

theomonnom May 27, 2026

Choose a reason for hiding this comment

Uh oh!

1egoman May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lukasIO May 29, 2026

Choose a reason for hiding this comment

Uh oh!

1egoman May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xianshijing-lk May 29, 2026

Choose a reason for hiding this comment

Uh oh!

lukasIO Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

lukasIO Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

1egoman commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

1egoman commented May 20, 2026 •

edited

Loading

1egoman May 28, 2026 •

edited

Loading

1egoman May 29, 2026 •

edited

Loading

1egoman Jun 1, 2026 •

edited

Loading

1egoman Jun 1, 2026 •

edited

Loading

1egoman Jun 24, 2026 •

edited

Loading

1egoman commented Jun 24, 2026 •

edited

Loading