modify encode_video() / encode_audio() API design #58

awkrail · 2025-05-18T10:04:55Z

This PR is on WIP. Modifying encode_video and encode_audio API design because the current code sets video_feats/mask to the Predictor, which is buggy in gradio demo. We make the models stateless, which makes us more easier to use.

awkrail · 2025-05-19T04:57:56Z

@h-munakata Could you review PR?

h-munakata · 2025-05-19T09:48:53Z

I found an unexpected behavior.
When we change the model and the model extract the video feature showing Processing the video. Wait for a minute..., pushing the retrieve moments button leads AttributeError: 'XXPredictor' object has no attribute 'inputs'.
I think this is caused model object does not have inputs before running model.encode_video()
Therefore, error handling is required when the button is pressed immediately after the model is loaded and model do not hold inputs.

In addition, I found a line that should be fixed [Link].(https://github.com/awkrail/lighthouse/blob/5b5d14e5394e5615b63b7e8e3d5820fa3f7da757/lighthouse/common/tr_detr_transformer.py#L141)
This line requires CUDA, and thereby, the inference does not work without GPU.

awkrail added 2 commits May 18, 2025 19:03

[WIP] modify encode_video API design

a76af56

fix APIs

0f893ea

awkrail changed the title ~~[WIP] modify encode_video API design~~ [WIP] modify encode_video() / encode_audio() API design May 19, 2025

awkrail added 2 commits May 19, 2025 12:06

fix mypy test

b67e050

fix gradio demo

5b5d14e

awkrail changed the title ~~[WIP] modify encode_video() / encode_audio() API design~~ modify encode_video() / encode_audio() API design May 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

modify encode_video() / encode_audio() API design #58

modify encode_video() / encode_audio() API design #58

awkrail commented May 18, 2025

awkrail commented May 19, 2025

h-munakata commented May 19, 2025

modify encode_video() / encode_audio() API design #58

Are you sure you want to change the base?

modify encode_video() / encode_audio() API design #58

Conversation

awkrail commented May 18, 2025

awkrail commented May 19, 2025

h-munakata commented May 19, 2025