load_in_8bit is not working for some huggingface model #14

sanyalsunny111 · 2022-08-18T01:46:14Z

I have updated the transformers package and I am using ViLT model: https://huggingface.co/docs/transformers/model_doc/vilt#transformers.ViltForQuestionAnswering

I am getting this error is load_in_8bit is not integrated will all hugging face models ? Could you please let me know how to use load_in_8bit for any huggingface model not just BLOOM and T5.

younesbelkada · 2022-08-18T14:09:50Z

Hi @sanyalsunny111
Thank you very much for your message
Your initial issue is related to the fact that you did not installed the latest version of transformers. Since the new features of the library has not been released yet, you cannot retrieve these features with pip install transformers. Therefore, you have to manually install the latest version by running:
pip install git+https:/huggingface/transformers.git

However, this model does not support device_map=auto yet. This should be addressed in the PR: huggingface/transformers#18683 therefore available as soon as the fix will be validated.
If you want to use this feature, you can directly download the transformers version that contains the ViLT support. I made an example colab that you can try out here
Let me know if anything else is unclear!

sanyalsunny111 · 2022-08-18T17:40:59Z

@younesbelkada Thank you for your previous response. you rightly mentioned device map auto is not supported yet and without that we cannot run a 8 bit model. But my question is how you have used device_map="auto in the colab link you have shared in your previous comment?

younesbelkada · 2022-08-19T10:32:55Z

Hi @sanyalsunny111
If you follow the same installation guidelines as on the google colab I shared you, you should be able to pass "device_map=auto" without any problems

younesbelkada · 2022-08-19T17:08:24Z

Hi @sanyalsunny111 !
No worries, I think that you still didn't installed the correct version because you have your previous transformers that you probably did not removed.
Could you try this command? pip install --force git+https:/younesbelkada/transformers.git@eee3986ec37e3050c1ee94a63efb13090602eae5
Thanks!

sanyalsunny111 · 2022-08-19T17:10:00Z

Hey @younesbelkada Thank you very much sir. It is working fine.

younesbelkada · 2022-08-19T17:12:55Z

Great ! Very happy that you made it work! 💪 Do not hesitate to open an issue if you face into any new issue

sanyalsunny111 · 2022-08-19T19:32:08Z

Hey @younesbelkada the device_map='auto' is actually affecting the distributed data parallel (DDP). I am using 8 GPUs and trying to run a faster inference. Here is the error I am getting model = ViltForQuestionAnswering.from_pretrained("dandelin/vilt-b32-finetuned-vqa", device_map="auto", load_in_8bit=True)

Could you please suggest how to use load_in_8bit with DDP?

younesbelkada · 2022-08-19T19:43:03Z

Hi @sanyalsunny111
Thanks for your message!
Did the error happen also with "load_in_8bit=False"? Could you also share the full script to reproduce the issue?
Thanks

sanyalsunny111 · 2022-08-19T20:59:59Z

Hey @younesbelkada Sorry to bother you with more error. Yes, with load_in_8bit=False this error happened code attached screenshot-1.

Now when I am not using load_in_8bit at all no error is happening so, it's safe to assume either device_map or load_in_8bit is causing the error. Here is my piece of code and here is the hugging face tutorial which my code is based upon.

[FIX] passing of sparse in StableEmbedding

improve the gemv 4bit accuracy by forcing the hipcub to 32

younesbelkada mentioned this issue Aug 18, 2022

Add accelerate support for ViLT huggingface/transformers#18683

Merged

younesbelkada closed this as completed Aug 19, 2022

techthiyanes pushed a commit to techthiyanes/bitsandbytes-1 that referenced this issue Jul 7, 2023

Merge pull request bitsandbytes-foundation#14 from SirRob1997/main

262350c

[FIX] passing of sparse in StableEmbedding

CRyan2016 mentioned this issue Jul 16, 2023

terminate called after throwing an instance of 'c10::Error' #597

Closed

TNTran92 pushed a commit to TNTran92/bitsandbytes that referenced this issue Mar 24, 2024

Merge pull request bitsandbytes-foundation#14 from ROCm/fix_gemv_4bit

f4ac9ac

improve the gemv 4bit accuracy by forcing the hipcub to 32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

load_in_8bit is not working for some huggingface model #14

load_in_8bit is not working for some huggingface model #14

sanyalsunny111 commented Aug 18, 2022

younesbelkada commented Aug 18, 2022 •

edited

Loading

sanyalsunny111 commented Aug 18, 2022

younesbelkada commented Aug 19, 2022

younesbelkada commented Aug 19, 2022

sanyalsunny111 commented Aug 19, 2022

younesbelkada commented Aug 19, 2022 •

edited

Loading

sanyalsunny111 commented Aug 19, 2022

younesbelkada commented Aug 19, 2022

sanyalsunny111 commented Aug 19, 2022 •

edited

Loading

load_in_8bit is not working for some huggingface model #14

load_in_8bit is not working for some huggingface model #14

Comments

sanyalsunny111 commented Aug 18, 2022

younesbelkada commented Aug 18, 2022 • edited Loading

sanyalsunny111 commented Aug 18, 2022

younesbelkada commented Aug 19, 2022

younesbelkada commented Aug 19, 2022

sanyalsunny111 commented Aug 19, 2022

younesbelkada commented Aug 19, 2022 • edited Loading

sanyalsunny111 commented Aug 19, 2022

younesbelkada commented Aug 19, 2022

sanyalsunny111 commented Aug 19, 2022 • edited Loading

younesbelkada commented Aug 18, 2022 •

edited

Loading

younesbelkada commented Aug 19, 2022 •

edited

Loading

sanyalsunny111 commented Aug 19, 2022 •

edited

Loading