Use latest peft/transformers/accelerate/bitsandbytes for 4-bit (qlora) #166

arnocandel · 2023-05-25T05:10:34Z

https://arxiv.org/abs/2305.14314
https://huggingface.co/blog/4bit-transformers-bitsandbytes

update deps
enable 4-bit generation
enable 4-bit lora training

Addresses #136 more directly than #107, but only at runtime.

finetune.py

generate.py

arnocandel · 2023-05-25T06:17:27Z

Testing 65B on 2x A6000Ada:
(env) arno@rippa:/nfs4/llm/h2ogpt(4bit)$ CUDA_VISIBLE_DEVICES=0,1 torchrun --nproc_per_node=2 finetune.py --base_model=decapoda-research/llama-65b-hf --train_4bit=True --micro_batch_size=1 --run_id=3 --data_path=h2oai/openassistant_oasst1_h2ogpt_graded
works:
0%|▋ | 1/237 [04:27<17:33:37, 267.87s/it]
1%|█▎ | 2/237 [09:01<17:41:35, 271.04s/it]

pseudotensor

We will do more full testing of eval on 4bit vs. 8bit vs. 16bit. According to QLORA paper, as long as do LORA on all linear layers, then should be as good as 16bit. I'm unclear on speed of new Tim stuff.

arnocandel added 2 commits May 24, 2023 22:09

Use latest peft/transformers/accelerate/bitsandbytes for 4-bit (qlora)

61aedd2

Update generation/fine-tuning to handle 4-bit

e403081

arnocandel commented May 25, 2023

View reviewed changes

finetune.py Show resolved Hide resolved

arnocandel commented May 25, 2023

View reviewed changes

generate.py Show resolved Hide resolved

arnocandel marked this pull request as ready for review May 25, 2023 06:17

arnocandel added 2 commits May 24, 2023 23:28

Update README.

34b5d15

Update readme for fine-tuning.

4a8cdf3

pseudotensor approved these changes May 25, 2023

View reviewed changes

arnocandel added 4 commits May 24, 2023 23:48

Add test coverage for cpu/gpu for 4/8/16/32 bit generation.

5d4e09f

Update test.

beb091e

Merge remote-tracking branch 'origin/main' into 4bit

cc88db0

Fix and speedup eval tests.

f7d04e0

arnocandel merged commit d160eb6 into main May 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use latest peft/transformers/accelerate/bitsandbytes for 4-bit (qlora) #166

Use latest peft/transformers/accelerate/bitsandbytes for 4-bit (qlora) #166

arnocandel commented May 25, 2023 •

edited

Loading

arnocandel commented May 25, 2023 •

edited

Loading

pseudotensor left a comment

Use latest peft/transformers/accelerate/bitsandbytes for 4-bit (qlora) #166

Use latest peft/transformers/accelerate/bitsandbytes for 4-bit (qlora) #166

Conversation

arnocandel commented May 25, 2023 • edited Loading

arnocandel commented May 25, 2023 • edited Loading

pseudotensor left a comment

Choose a reason for hiding this comment

arnocandel commented May 25, 2023 •

edited

Loading

arnocandel commented May 25, 2023 •

edited

Loading