Cannot use int8 #9

RiverDong · 2022-08-14T05:55:17Z

I tried to use 8xA100 to run BLOOM. But I cannot do load_in_8bit. I tried to follow the instruction here load the model by model = AutoModelForCausalLM.from_pretrained(model_name, device_map='auto', load_in_8bit=True, max_memory=max_memory) Basically, if I don't have max_memory=max_memory, then most memory would go the gpu:0 and then CUDA out of memory error. If I put max_memory=max_memory, it will throw 8-bit operation are not supported under CPU.

The text was updated successfully, but these errors were encountered:

TimDettmers · 2022-08-14T19:53:09Z

Looking again at this error, I realize the problem is likely that you set the memory threshold too low in the max_memory. You are currently using 3 GB per GPU, for a total of 24 GB across 8 GPUs, but BLOOM needs ~180 GB of GPU memory. You can set it to ~36GB if you have A100 with 40 GB memory (or higher if you have the 80 GB ones).

We will fix the error message to note that this error appears if not enough memory is allocated for the GPU.

…mports Add missing imports to adam

…ix_bfloat16 Enable hip_bfloat16 for optim tests

TimDettmers closed this as completed Sep 5, 2022

techthiyanes pushed a commit to techthiyanes/bitsandbytes-1 that referenced this issue Jul 7, 2023

Merge pull request bitsandbytes-foundation#9 from ditschuk/fix_adam_i…

037022e

…mports Add missing imports to adam

CRyan2016 mentioned this issue Jul 16, 2023

terminate called after throwing an instance of 'c10::Error' #597

Closed

TNTran92 pushed a commit to TNTran92/bitsandbytes that referenced this issue Mar 24, 2024

Merge pull request bitsandbytes-foundation#9 from ROCm/rocm_enabled_f…

b6770bf

…ix_bfloat16 Enable hip_bfloat16 for optim tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot use int8 #9

Cannot use int8 #9

RiverDong commented Aug 14, 2022

TimDettmers commented Aug 14, 2022

Cannot use int8 #9

Cannot use int8 #9

Comments

RiverDong commented Aug 14, 2022

TimDettmers commented Aug 14, 2022