[caikitnlp-169] Add LoRA configuration support for fine-tuning module #212

ibm-peach-fish · 2023-09-28T20:18:25Z

Add LoRA configuration support in the toolkit functionality created in Refactor peft module to take out common peft config functionality #163
Add LoRA configuration and training from text-generation module (train).
Expose parameter required for LoRA configuration via .train function
Add support for saving LoRA models with "merged weights". This is to be done in .train function itself, that way, the model that we configure to __init__ function will look like any other transformers model.
Unit tests cover new/changed code
Examples build against new/changed code
README updated here
Example for training lora added in example script

…n 163 Signed-off-by: Trevor Grant <[email protected]>

Signed-off-by: Trevor Grant <[email protected]>

… done in .train function itself, that way, the model that we configure to __init__ function will look like any other transformers model. Signed-off-by: Trevor Grant <[email protected]>

Signed-off-by: Trevor Grant <[email protected]>

ibm-peach-fish · 2023-10-06T14:58:20Z

For the readme update:

From @gkumbhat on slack

Me: could i get more specification on what readme.md you want updated, I'm having a hard time following the link on 169
G: just this section: https://github.com/caikit/caikit-nlp/blob/main/README.md#:~:text=Salient%20Feature(s)-,Text%20Generation,-1.%20PeftPromptTuning
Particularly “Salient Feature(s)” part
Me: The link in the issue looked like that too / I think you mean the text generation row of the table in the introduction section / and from the links i think you're pointing to the PeftPromptTuning, but all of that work was dismissed, the LoRA stuff lives in the text genertaion model / and hence i am lost with what you want me to put where
G: oh yes / I was thinking it would be a mention after Fine-tuning Both modules above provide optimized inference capability using Text Generation Inference Server / that way its in point 2 for TextGeneration but we add a line stating this module can also do LoRA fine-tuning

Sometimes when I don't understand things, I'll just walk away for a bit then I 'get it' when I came back- often it works. This time it didn't. So I need more guidance on what to put where in the readme.md, but ott this PR is ready for review

Signed-off-by: Trevor Grant <[email protected]>

chakrn · 2023-11-01T19:45:35Z

cc: @Ssukriti

Would be great to get your eyes on this

gkumbhat

Thanks for adding support for LoRA. Upon review a couple of things needs to be changed:

The input to train function needs to be a data model or built-in python data types. This is so that we can automatically then leverage those data types and type hints to generate a server spec servable via caikit.runtime. Under the hood, these parameter to .train function gets converted to a proto spec automatically and gets exposed to our gRPC and REST server. Currently it it accepting LoRAConfig, which won't work with auto generation of server and thus won't get exposed in API.
The lora vector and lora based PEFT model gets generated after the model is already trained. So it looks like the LoRA vectors / weights are actually not getting modified / trained.
Once we move the LoRAConfig parameters to data model type object,, we would also need to be filter and set "reasonable defaults" for those configuration parameters.
It looks like the lora config related changes in peft_config.py aren't actually getting used.
Upon looking at the implementation and considering generally how Lora is being referenced as part of "prompt tuning", we decided it would probably be better to move this to prompt tuning module and add a flag to save prompt vectors via merged weights or not. There will certainly be some things we would need to figure out to correctly hook it up for inferencing and its routing configuration. But that probably aligns more with how people are using it.

ibm-peach-fish · 2023-11-09T18:25:09Z

Based on the requested changes, I think a complete refactor is in order- closing.

chakrn · 2023-11-09T19:27:17Z

I would like to keep this open while Sukriti reviews

Ssukriti · 2023-11-10T23:52:07Z

@gkumbhat since I do not have write access to caikit-nlp, could you create a branch in main repo and merge this PR to that branch ? I can then continue to commit to that branch too, instead of having to pull from another fork

Ssukriti · 2023-11-20T23:30:08Z

added first PR to create data model and utilities #270

Remaining work is being paused till we have more direction on priorities. We can close this PR , and continue to refer to it for any further refactor as separate PRs

Add LoRA configuration support in the toolkit functionality created i…

d90c7d3

…n 163 Signed-off-by: Trevor Grant <[email protected]>

ibm-peach-fish requested review from alex-jw-brooks, evaline-ju, gabe-l-hart, gkumbhat and tharapalanivel as code owners September 28, 2023 20:18

ibm-peach-fish marked this pull request as draft September 28, 2023 20:33

ibm-peach-fish added 2 commits September 29, 2023 11:29

Add LoRA configuration and training from text-generation module (train)

89f67ca

Signed-off-by: Trevor Grant <[email protected]>

Add LoRA configuration and training from text-generation module (train)

7a79aea

Signed-off-by: Trevor Grant <[email protected]>

ibm-peach-fish self-assigned this Sep 29, 2023

ibm-peach-fish added 10 commits October 2, 2023 16:40

Add support for saving LoRA models with merged weights. This is to be…

03ac92c

… done in .train function itself, that way, the model that we configure to __init__ function will look like any other transformers model. Signed-off-by: Trevor Grant <[email protected]>

typos

8fd1996

Signed-off-by: Trevor Grant <[email protected]>

add example

9f06ae9

Signed-off-by: Trevor Grant <[email protected]>

add example

946e4ee

Signed-off-by: Trevor Grant <[email protected]>

add example

5cfc475

Signed-off-by: Trevor Grant <[email protected]>

bug bashing

ab5f01e

Signed-off-by: Trevor Grant <[email protected]>

bug bashing

8264dcc

Signed-off-by: Trevor Grant <[email protected]>

clean up

a65480a

Signed-off-by: Trevor Grant <[email protected]>

lint

24c3f7e

Signed-off-by: Trevor Grant <[email protected]>

added unit tests

daa1481

Signed-off-by: Trevor Grant <[email protected]>

ibm-peach-fish marked this pull request as ready for review October 6, 2023 15:08

ibm-peach-fish changed the title ~~[wip][caikitnlp-169] Add LoRA configuration support for fine-tuning module~~ [caikitnlp-169] Add LoRA configuration support for fine-tuning module Oct 6, 2023

ibm-peach-fish added 2 commits October 6, 2023 10:32

print debug statement dropped

af71e6e

Signed-off-by: Trevor Grant <[email protected]>

print debug statement dropped again

4917c7a

Signed-off-by: Trevor Grant <[email protected]>

ibm-peach-fish linked an issue Oct 9, 2023 that may be closed by this pull request

Add LoRA configuration support for fine-tuning module #169

Open

4 tasks

gkumbhat requested changes Nov 8, 2023

View reviewed changes

ibm-peach-fish closed this Nov 9, 2023

chakrn reopened this Nov 9, 2023

ibm-peach-fish removed their assignment Nov 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[caikitnlp-169] Add LoRA configuration support for fine-tuning module #212

[caikitnlp-169] Add LoRA configuration support for fine-tuning module #212

Uh oh!

ibm-peach-fish commented Sep 28, 2023 •

edited

Loading

Uh oh!

ibm-peach-fish commented Oct 6, 2023

Uh oh!

chakrn commented Nov 1, 2023

Uh oh!

gkumbhat left a comment

Uh oh!

ibm-peach-fish commented Nov 9, 2023

Uh oh!

chakrn commented Nov 9, 2023

Uh oh!

Ssukriti commented Nov 10, 2023

Uh oh!

Ssukriti commented Nov 20, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[caikitnlp-169] Add LoRA configuration support for fine-tuning module #212

Are you sure you want to change the base?

[caikitnlp-169] Add LoRA configuration support for fine-tuning module #212

Uh oh!

Conversation

ibm-peach-fish commented Sep 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ibm-peach-fish commented Oct 6, 2023

Uh oh!

chakrn commented Nov 1, 2023

Uh oh!

gkumbhat left a comment

Choose a reason for hiding this comment

Uh oh!

ibm-peach-fish commented Nov 9, 2023

Uh oh!

chakrn commented Nov 9, 2023

Uh oh!

Ssukriti commented Nov 10, 2023

Uh oh!

Ssukriti commented Nov 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ibm-peach-fish commented Sep 28, 2023 •

edited

Loading

Ssukriti commented Nov 20, 2023 •

edited

Loading