Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bloom generation generated repeated characters #8

Closed
prajdabre opened this issue Aug 12, 2022 · 1 comment
Closed

Bloom generation generated repeated characters #8

prajdabre opened this issue Aug 12, 2022 · 1 comment

Comments

@prajdabre
Copy link
Collaborator

Firstly: Fantastic work! This is the way!

I followed the instructions in your doc file where instead of opt66b I used bloom and bloom-3b.

The models load properly on my 8 V100 32GB gpus (3b needs 1 gpu obviously).

Decoding also finishes but the output is problematic:

My input: text = """The translation of 'I am a boy' in French is"""
My output: The translation of 'I am a boy' in French is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is is

This happens for both models.

Some details about my settings:

  1. V100 gpus
  2. transformers-4.22.0.dev0
  3. CUDA 11.1
  4. CUDNN 8.x
  5. bitsandbytes (I am assuming its the latest version copatible with cuda 11.x)

Kindly let me know how this can be fixed.

Thanks and regards.

@prajdabre
Copy link
Collaborator Author

Works on A100-40GB GPUS! Thanks for your awesome work!

techthiyanes pushed a commit to techthiyanes/bitsandbytes-1 that referenced this issue Jul 7, 2023
TNTran92 pushed a commit to TNTran92/bitsandbytes that referenced this issue Mar 24, 2024
…rm_with_transpose

Enable transform with transpose
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant