[megatron] megatron lora_llm support no_save_optim by Jintao-Huang · Pull Request #9269 · modelscope/ms-swift

Jintao-Huang · 2026-05-06T08:56:35Z

No description provided.

gemini-code-assist

Code Review

This pull request relaxes validation constraints for the lora_llm tuner type and updates the trainer to handle lora_llm similarly to lora during weight loading and checkpointing. Review feedback highlights a potential issue where non-LoRA trainable parameters in multimodal models might be excluded from checkpoints when peft_format is enabled. It is also suggested to default the tuner type to lora_llm for multimodal models when adapters are present.

gemini-code-assist · 2026-05-06T08:59:24Z

            self.opt_param_scheduler,
            iteration=iteration,
-            peft_format=args.tuner_type == 'lora',
+            peft_format=args.tuner_type in {'lora', 'lora_llm'},


Setting peft_format=True for lora_llm ensures that LoRA weights are saved, but it may exclude other trainable parameters in multimodal models. If freeze_vit or freeze_aligner are set to False, their weights will be filtered out of the Megatron checkpoint because they are not LoRA parameters and are not automatically added to modules_to_save. Consider ensuring that peft_format is only True when no other non-LoRA parameters are trainable, or that those modules are included in the filter.

gemini-code-assist · 2026-05-06T08:59:24Z

            if self.tuner_type == 'full':
                self.tuner_type = 'lora'


For multimodal models, when adapters are provided but tuner_type is not specified (defaulting to 'full'), it might be more appropriate to set tuner_type to 'lora_llm' instead of 'lora'. This ensures that the specialized multimodal LoRA logic is used consistently for multimodal models.

Suggested change

if self.tuner_type == 'full':

self.tuner_type = 'lora'

if self.tuner_type == 'full':

self.tuner_type = 'lora_llm' if self.is_multimodal else 'lora'

megatron lora_llm support no_save_optim

6dab0f2

gemini-code-assist Bot reviewed May 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[megatron] megatron lora_llm support no_save_optim#9269

[megatron] megatron lora_llm support no_save_optim#9269
Jintao-Huang wants to merge 1 commit into
modelscope:mainfrom
Jintao-Huang:megatron_lora_llm_support_no_save_optim

Jintao-Huang commented May 6, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 6, 2026

Uh oh!

gemini-code-assist Bot May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jintao-Huang commented May 6, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 6, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant