Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update runtime packages #8

Closed
wants to merge 21 commits into from
Closed

Update runtime packages #8

wants to merge 21 commits into from

Conversation

klwetstone
Copy link
Collaborator

@klwetstone klwetstone commented Aug 27, 2024

Update the runtime environment ahead of competition launch.

Removing / adding packages

To change the starting set of packages, I:

  • incorporated recommendations from Monday's triage
  • went through each package that was included in the runtime, checked its use, and removed any that seemed irrelevant
  • briefly skimmed blog posts / past solutions to NLP challenges for commonly used packages

Packages removed:

  • aiofiles
  • aiohttp
  • diskcache
  • gensim
  • keras
  • lightgbm
  • ray-default
  • tensorflow

Packages added:

  • bitsandbytes
  • langchain
  • llama-cpp-python
  • nltk

Versioning

Steps used to update package versions:

  • Removed all pinned versions (set to "*")
  • Added a pypi dependency for vllm using the wheel compatible with cuda 11.8 (unfortunately it's not yet available on conda forge):
    [feature.base.pypi-dependencies]
    vllm = { url = "https:/vllm-project/vllm/releases/download/v0.4.0/vllm-0.4.0+cu118-cp310-cp310-manylinux1_x86_64.whl" }
    
  • Ran pixi to solve dependencies
  • Updated the pinned package versions to those compatible with vLLM. We are generally using more recent versions of all packages. Changes of note:
    • xformers 0.0.23 to 0.0.23.post1
    • pytorch 2.1.1 to 2.1.2
  • Remove the vLLM dependency. So it will not be in the initial runtime, but the initial runtime will be compatible with it.

Comparison to pypi

Where do our versions differ from the most recent versions on pypi?

  • numpy is 1.26.4, most recent is 2.1.0. v2 was only released in June 2024, so I'm not too suspicious about this resolving to v1
  • We are using CUDA 11.8, CUDA 12 is the most recent version (we decided not to update). All other packages that are behind are likely a result of this
    • We use python 3.10, because there is a vllm wheel for CUDA 11.8 + python 3.10
    • We use torch 2.1.2, most recent is 2.4

The rest of the packages are the most recent version.

@klwetstone klwetstone changed the title [WIP] Update runtime packages Update runtime packages Aug 28, 2024
@klwetstone klwetstone requested a review from ejm714 August 28, 2024 18:58
klwetstone and others added 12 commits August 28, 2024 17:47
* scoring script

* new requirements

* readme updates

* add to quickstart readme

* quickstart description

* remove template, doesnt add much on top of example

* update example

* one example_submission dir

* update pack-example command

* link checks

* copy editing
@klwetstone
Copy link
Collaborator Author

This is being replaced by #10 because of rebasing issues

@klwetstone klwetstone closed this Aug 30, 2024
@klwetstone klwetstone deleted the add-packages branch August 30, 2024 13:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant