set lora #506

JamesBrianD · 2025-12-01T11:30:13Z

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Please use English, otherwise it will be closed.
The purpose of the PR, or link existing issues this PR will resolve.
The test plan, such as providing test command.
(Optional) The necessary documentation update.

Co-authored-by: pathfinder-fp <[email protected]>

feat: lora manager

* feat: test lora * fix codes * repeat kv heads --------- Co-authored-by: JamesBrianD <[email protected]>

* feat: infer lora target modules from adapters * feat: add radix key with extra key

gemini-code-assist · 2025-12-01T11:30:33Z

Summary of Changes

Hello @JamesBrianD, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces performance optimizations for LoRA (Low-Rank Adaptation) weight management within the system. The primary goal is to reduce redundant computations by ensuring that LoRA layer buffer references are only updated, and model states are only re-captured, when new LoRA weights have actually been loaded into memory. This change enhances efficiency by avoiding unnecessary processing cycles when the active set of LoRA adapters remains unchanged.

Highlights

Optimized LoRA Weight Updates: The system now avoids unnecessary re-capturing of model states and updates to LoRA layer buffer references when no new LoRA weights are loaded, improving efficiency.
Conditional update_lora_info: The LoRAManager now only calls update_lora_info if new LoRA weights have actually been loaded into the memory pool, preventing redundant operations.
LoRAMemoryPool Return Value: The prepare_lora_batch method in LoRAMemoryPool now returns a boolean indicating whether new weights were loaded, enabling the conditional update logic.
Conditional Model State Re-capture: The ModelRunner now conditionally re-captures the model state for LoRA-enabled models, otherwise using a pre-captured state, further enhancing performance.
New Test Coverage: A new unit test file has been added to validate the conditional update logic for LoRA management, ensuring correctness of the optimization.
Gemini CLI Documentation: A new markdown file gemini.md was added, outlining testing constraints for the Gemini CLI.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

JamesBrianD and others added 16 commits December 1, 2025 17:44

feat: lora initial code (#432)

603c444

add adaptor (#443)

f1fafe8

Co-authored-by: pathfinder-fp <[email protected]>

[Feat] add bgmv backend for LoRA (#381)

fce946d

fix adapter (#451)

9093f42

Co-authored-by: pathfinder-fp <[email protected]>

feat: lora manager (#438)

66449f5

feat: lora manager

remove duplicate code

546b6b9

[SGL-452] add 96 e2e tests for BgmvBackend (#456)

7d73859

scheduler lora reqs (#465)

9290d29

add LoRALayer (#472)

7ddb7f5

can run without a little lora config (#491)

e96ca44

Add lora test (#492)

1679798

* feat: test lora * fix codes * repeat kv heads --------- Co-authored-by: JamesBrianD <[email protected]>

add output_sharding for lora_a and lora_b (#493)

9fa3018

fix lora test (#495)

83cfe5a

feat: infer lora target modules from adapters (#496)

b2c7a61

* feat: infer lora target modules from adapters * feat: add radix key with extra key

feat: add radix key with extra key

1de3a32

fix: update lora layers

3ccb667

JamesBrianD force-pushed the set-lora branch 7 times, most recently from 5a51e60 to 9839809 Compare December 2, 2025 05:25

set lora

c7b2afc

JamesBrianD force-pushed the set-lora branch from 9839809 to c7b2afc Compare December 2, 2025 06:32

aolemila force-pushed the epic/support-multi-lora branch from 074af9f to 8e22c18 Compare December 3, 2025 03:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

set lora #506

set lora #506

Uh oh!

JamesBrianD commented Dec 1, 2025

Uh oh!

gemini-code-assist bot commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

set lora #506

Are you sure you want to change the base?

set lora #506

Uh oh!

Conversation

JamesBrianD commented Dec 1, 2025

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

gemini-code-assist bot commented Dec 1, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants