Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: #3703 - Deepseek-Coder-33B-Instruct is incompatible #3732

Merged
merged 1 commit into from
Sep 26, 2024

Conversation

louis-jan
Copy link
Contributor

Describe Your Changes

This PR updates the DeepSeek Coder 33B Instruct download URL to a working quantized version. Stop tokens have also been added, allowing the model to outputs properly.

Screenshot 2024-09-25 at 11 38 36

Steps to Reproduce

  1. Download Deepseek Coder 33B Instruct
  2. Try to use it in threads
Deepseek Coder 33B
38183

Fixes Issues

Changes made

  1. package.json Updates:

    • The version number of the package has been incremented from 1.0.17 to 1.0.18.
  2. model.json Updates for deepseek-coder-1.3b:

    • Version number incremented from 1.3 to 1.4.
    • The array for stop is now explicitly set to ["<|EOT|>"], which defines the end-of-text token.
    • Tags have been modified to remove "Foundational Model", leaving only "Tiny".
  3. model.json Updates for deepseek-coder-34b:

    • The download URL for the model file deepseek-coder-33b-instruct.Q4_K_M.gguf has been changed from https://huggingface.co/TheBloke/deepseek-coder-33B-instruct-GGUF/resolve/main/deepseek-coder-33b-instruct.Q4_K_M.gguf to https://huggingface.co/mradermacher/deepseek-coder-33b-instruct-GGUF/resolve/main/deepseek-coder-33b-instruct.Q4_K_M.gguf.
    • Version number incremented from 1.3 to 1.4.
    • The array for stop is now explicitly set to ["<|EOT|>"].
    • Tags have been modified to remove "Foundational Model" and Updated "34B" to "33B".

@github-actions github-actions bot added the type: bug Something isn't working label Sep 25, 2024
Copy link
Contributor

Barecheck - Code coverage report

Total: 64.8%

Your code coverage diff: 0.00% ▴

✅ All code changes are covered

Copy link
Contributor

@dan-homebrew dan-homebrew left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @louis-jan

@louis-jan louis-jan merged commit 7f08f0f into dev Sep 26, 2024
11 checks passed
@louis-jan louis-jan deleted the fix/3703-deepseek-coder-33b-is-incompatible branch September 26, 2024 05:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

bug: Can't run model Deepseek Coder 33B Instruct - Check out what's wrong with VM 114
2 participants