How can we use it for streaming ASR, say for models like nemo parakeet or openai whisper? #2
programindz
started this conversation in
General
Replies: 1 comment
-
|
That's a good idea, thanks for the suggestion. I believe ASR models are often designed with streaming in mind because they are typically used in live applications:
Nonetheless, it would be valuable insight to tackle either of these models with torchstream, to see how it handles a totally different use case with the challenge of an output format (i.e. the transcript with timestamps) that is not sliding-window based. I'll give it a shot in the upcoming weeks, hopefully leading to a demo |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I would be really interested to see TorchStream being used in current SOTA models for ASR.
Beta Was this translation helpful? Give feedback.
All reactions