Skip to content

fix serve_multiplexed_model_id and mm data process#120

Merged
Yunnglin merged 13 commits intomainfrom
fix_request
Mar 24, 2026
Merged

fix serve_multiplexed_model_id and mm data process#120
Yunnglin merged 13 commits intomainfrom
fix_request

Conversation

@Yunnglin
Copy link
Collaborator

@Yunnglin Yunnglin commented Mar 19, 2026

PR type

  • Bug Fix
  • New Feature
  • Document Updates
  • More Models or Datasets Support

PR information

This pull request introduces a new client script for multimodal training with Tinker, updates existing client examples, and significantly refactors the Twinkle server's resource lifecycle management to be asynchronous. This refactoring involves converting numerous synchronous methods for session, model, adapter, and processor management to async/await patterns, consolidating common logic into a new SessionResourceMixin, and updating all corresponding calls throughout the server and client components. A patch for Ray Serve is also included to correctly handle HTTP header normalization for multiplexed model IDs. A review comment suggests refactoring duplicated logic for packing multimodal tensors into loss_fn_inputs within the input_feature_to_datum function for improved maintainability.

Experiment results

Paste your experiment result here(if needed).

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants