Fix missing property access for multimodal models #966

albertvillanova · 2025-12-04T14:28:01Z

Summary

This PR fixes access to missing attributes for multimodal models in src/liger_kernel/transformers/monkey_patch.py. The main change is to consistently access attributes (like language_model, vision_tower, and visual) through the submodel .model attribute of the parent model, rather than directly from the parent model itself.

This fixes AttributeError after this PR was merged in transformers:

🚨 Generalize get_decoder() for multimodal and delete redundant code 🔪 huggingface/transformers#42156

See associated issue in TRL:

CI fails with dev dependencies: AttributeError: 'Qwen2_5_VLForConditionalGeneration' object has no attribute 'language_model' huggingface/trl#4601

Fix #960.

Details

Fix: Consistent attribute access via .model

Updated all references to submodules such as language_model, vision_tower, and visual to use the .model attribute (e.g., model.model.language_model instead of model.language_model) across all kernel application functions for models including LLava, Mllama, Gemma3, PaliGemma, Qwen2 VL, Qwen2.5 VL, Qwen3 VL, Qwen3 VL MoE, GLM4V, GLM4V MoE, and InternVL.

Normalization and patching logic updates

Adjusted normalization and patching calls to operate on submodels accessed via .model, ensuring that layer normalization and RMS normalization are consistently applied to the correct components.

These changes make the codebase more maintainable and robust against future changes in model class implementations.

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

Tcc0403 · 2025-12-04T18:12:44Z

src/liger_kernel/transformers/monkey_patch.py

            # Note: language_model and visual properties can be accessed throught conditional class for BC.
            # Not sure if it is subject to changes in the future.
            # Reference: https://github.com/huggingface/transformers/blob/v4.52.4/src/transformers/models/qwen2_vl/modeling_qwen2_vl.py#L1698


Could you help me remove this comment? Thanks!

Tcc0403 · 2025-12-04T18:21:50Z

src/liger_kernel/transformers/monkey_patch.py

        # The model instance already exists, so we need to additionally patch the
        # instance variables that reference already-instantiated modules

        if isinstance(model, (Qwen2VLForConditionalGeneration, Qwen2VLModel)):


We also need to update this condition.

model.model.language for XXXForConditionalGeneration, model.language_model for XXXVLModel

Good catch!

src/liger_kernel/transformers/monkey_patch.py

Tcc0403 · 2025-12-05T15:59:35Z

There still exist some missing attribute error

MllamaForConditionalGeneration
Qwen2VLForConditionalGeneration
Qwen2_5_VLForConditionalGeneration
InternVLForConditionalGeneration
Glm4vForConditionalGeneration
Glm4vMoeForConditionalGeneration

FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_mllama_for_conditional_generation - AttributeError: 'MllamaForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_qwen2_vl_for_conditional_generation - AttributeError: 'Qwen2VLForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_qwen2_5_vl_for_conditional_generation - AttributeError: 'Qwen2_5_VLForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_internvl - AttributeError: 'InternVLForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_glm4v - AttributeError: 'Glm4vForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_glm4v_moe - AttributeError: 'Glm4vMoeForConditionalGeneration' object has no attribute 'language_model'

Similar errors are also listed in #960 (comment). It's just a reminder for myself, not necessarily have to fix all of them in this PR! We can focus on handling all language_model property in this PR.

Fix missing property access for multimodal models

d24880a

Tcc0403 requested changes Dec 4, 2025

View reviewed changes

albertvillanova added 8 commits December 5, 2025 09:42

Fix Qwen2VLModel

e976f4b

Fix Qwen2_5_VLModel

26f7c8d

Fix Glm4vModel

26843e0

Fix Glm4vMoeModel

1848e2f

Fix Qwen3VLModel

931575a

Fix Qwen3VLMoeModel

b85f91a

Fix InternVLModel

55f1235

Fix qwen2_5_vl vision_model

ea3f78c

albertvillanova commented Dec 5, 2025

View reviewed changes

src/liger_kernel/transformers/monkey_patch.py Show resolved Hide resolved

albertvillanova commented Dec 5, 2025

View reviewed changes

src/liger_kernel/transformers/monkey_patch.py Show resolved Hide resolved

albertvillanova commented Dec 5, 2025

View reviewed changes

src/liger_kernel/transformers/monkey_patch.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix missing property access for multimodal models #966

Fix missing property access for multimodal models #966

Uh oh!

albertvillanova commented Dec 4, 2025

Uh oh!

Tcc0403 Dec 4, 2025

Uh oh!

albertvillanova Dec 5, 2025

Uh oh!

Tcc0403 Dec 4, 2025

Uh oh!

albertvillanova Dec 5, 2025

Uh oh!

albertvillanova Dec 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Tcc0403 commented Dec 5, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix missing property access for multimodal models #966

Are you sure you want to change the base?

Fix missing property access for multimodal models #966

Uh oh!

Conversation

albertvillanova commented Dec 4, 2025

Summary

Details

Testing Done

Uh oh!

Tcc0403 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

albertvillanova Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Tcc0403 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

albertvillanova Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

albertvillanova Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Tcc0403 commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Tcc0403 commented Dec 5, 2025 •

edited

Loading