fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

HyperExtendedReality · 2026-01-15T19:44:19Z

Add automatic detection and default to bfloat16 (or fp16 fallback) when no explicit dtype is provided, based on device capabilities
Respect provided dtype_llama/dtype consistently across Gemma model, projection layer, and connectors
Remove forced out.float() in encode_token_weights to prevent downgrading to fp32 after projection
This allows SageAttention's optimized kernel to run instead of falling back to PyTorch attention

Fixes the warning:
"Error running sage attention: Input tensors must be in dtype of torch.float16 or torch.bfloat16, using pytorch attention instead."

…ibility - Add automatic detection and default to bfloat16 (or fp16 fallback) when no explicit dtype is provided, based on device capabilities - Respect provided dtype_llama/dtype consistently across Gemma model, projection layer, and connectors - Remove forced `out.float()` in encode_token_weights to prevent downgrading to fp32 after projection - This allows SageAttention's optimized kernel to run instead of falling back to PyTorch attention Fixes the warning: "Error running sage attention: Input tensors must be in dtype of torch.float16 or torch.bfloat16, using pytorch attention instead."

comfy-pr-bot · 2026-01-22T03:36:55Z

Test Evidence Check

⚠️ Warning: Test Explanation Missing

If this PR modifies behavior that requires testing, a test explanation is required. PRs lacking applicable test explanations may not be reviewed until added. Please add test explanations to ensure code quality and prevent regressions.

⚠️ Warning: Visual Documentation Missing

If this PR changes user-facing behavior, visual proof (screen recording or screenshot) is required. PRs without applicable visual documentation may not be reviewed until provided.

You can add it by:

GitHub: Drag & drop media directly into the PR description
YouTube: Include a link to a short demo

HyperExtendedReality requested review from Kosinkadink, comfyanonymous and guill as code owners January 15, 2026 19:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

HyperExtendedReality commented Jan 15, 2026 •

edited

Loading

Uh oh!

comfy-pr-bot commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

Are you sure you want to change the base?

fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

Conversation

HyperExtendedReality commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

comfy-pr-bot commented Jan 22, 2026

Test Evidence Check

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HyperExtendedReality commented Jan 15, 2026 •

edited

Loading