Support for Open LLMs #1082

zigabrencic · 2024-03-21T17:58:54Z

This PR resolves the issue 943 by adding an explanation on how to use openLLMs.

Overview of changes:

minor change in gpt_engineer/applications/cli/main.py to work with open models.
Rewritten docs/open_models.md
Added docs/examples to help users test if their LLM setup works.

Feel free to let me know if anything is unclear or has to be modified.

Including the relevant reviewers: @ErikBjare @captivus @ATheorell @viborc

…onsider them as tests. Adding the shell parameter MODEL_NAME to make example clearer

codecov · 2024-03-22T08:56:51Z

Codecov Report

Attention: Patch coverage is 0% with 2 lines in your changes are missing coverage. Please review.

Project coverage is 84.18%. Comparing base (164730a) to head (4e7b072).
Report is 28 commits behind head on main.

Files	Patch %	Lines
gpt_engineer/applications/cli/main.py	0.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1082      +/-   ##
==========================================
- Coverage   84.29%   84.18%   -0.11%     
==========================================
  Files          27       27              
  Lines        1566     1568       +2     
==========================================
  Hits         1320     1320              
- Misses        246      248       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

zigabrencic · 2024-03-22T09:00:29Z

I managed to resolve the tox/pytest issues by re-naming the example scripts to openai_api_interface.py and langchain_interface.py

pytest thought that they were "tests" 😉

TheoMcCabe · 2024-03-23T15:56:15Z

docs/open_models.md

+
+```bash
+export OPENAI_API_BASE="http://localhost:8000/v1"
+export OPENAI_API_KEY="sk-xxx"


Is this required? Can it not just null? Seems like an arbrirary magic string

According to this: https://llama-cpp-python.readthedocs.io/en/latest/server/#multimodal-models yes.

Although I'm not sure why they do it as such. Didn't find an explanation.

Since we use llama-cpp-python for open model inference I decided to follow their documentation/convention.

TheoMcCabe · 2024-03-23T15:57:32Z

gpt_engineer/applications/cli/main.py

 openai.api_key = os.getenv("OPENAI_API_KEY")

+ if openai.api_key == "sk-xxx":


this doesnt look clean to me. A boolean variable called isLocalModel or LocalModel or something would be better

I was thinking about this quite a lot and then decided that the above is cleanest. And requires the least amount of code changes in the code base. At the price of code readability 🤷‍♂️

If we introduce LocalModel variable or sth similar then we need to have a list of what clarifies as a local model. There are many options for open LLMs. Model list isn't as trivial as for the OpenAI case.

In principle we could require the user to set another env variable local_model and then decide on that.

Not sure. I'm open to suggestions here.

@TheoMcCabe, do you have any follow-up comments on @zigabrencic's explanation?

viborc · 2024-03-27T10:25:24Z

@ErikBjare @captivus @ATheorell do you guys want to weigh-in on this PR, too?

ErikBjare

I haven't looked super closely, but looks good!

One thing that struck me when reading Aider's blog is that openrouter.ai has a OpenAI-compatible API which let's users dispatch requests to a wide range of models: from OpenAI and Anthropic models, to custom/open models via Together.ai and other providers.

This might be the fastest way to get started with open models, and doesn't require any special hardware. Worth mentioning in the docs imo!

…ble name.

zigabrencic · 2024-03-28T21:04:19Z

As discussed in today's meeting I switched to export LOCAL_MODEL=true.

And removed the rest except for this line: elif os.getenv("LOCAL_MODEL"): which is needed to properly compute local API costs.

Ziga Brencic added 25 commits February 20, 2024 16:54

Removing the legacy docs for open models that were broken

cb2141a

Updating the chapter structure

9f0d79b

Adding documentation for running a specific open LLM model

7bb1da4

Adding an explanation why to use open LLL's

e8c34d2

Updating the example call of the open model

db6f5bb

Adding the option to load local model URL

bf7a802

Explaining how to run the open LLM model with gpte

5d481f1

Formating

d12e8b4

Adding the necceseary scripts for testing that openLLM works

c49eb28

In the api cost estimation step we don't pay for running our local LLM.

f717498

Updating the commands for running the gpte with open models

29ec1b9

Resolving the merge conflict

106fe33

Simplifying the test library examples

130da6f

Fixing unclear parts in the test docs

5b8c2fd

Cleaning up the explanations for open model use

31c0dc0

Readablity changes

c2bb6bb

Readablity changes

6b192a9

Readablity changes

906fe0f

Removing redundant if/else lines

04822f8

Removing redundant if/else lines

276d3a6

Ruff fixes

99e7dd9

Fix: trailing whitespace trim

278b360

Fixing style changes

2883428

Removing the test_ from example script names so that pytest doesn't c…

704bf03

…onsider them as tests. Adding the shell parameter MODEL_NAME to make example clearer

Fixing style changes

87fa9da

TheoMcCabe reviewed Mar 23, 2024

View reviewed changes

ErikBjare reviewed Mar 27, 2024

View reviewed changes

Ziga Brencic added 3 commits March 28, 2024 21:03

Switching from HTML test to python code example

b0ae731

Switching to env variable LOCAL_MODEL

8afeb74

Switching to env variable LOCAL_MODEL and fixing the lower case varia…

4e7b072

…ble name.

ErikBjare mentioned this pull request Apr 2, 2024

Investigate/document how to use with OpenRouter #1099

Closed

viborc merged commit d00be76 into gpt-engineer-org:main Apr 8, 2024
6 checks passed

zigabrencic deleted the docs/open-llm-suport branch May 16, 2024 18:32

zigabrencic restored the docs/open-llm-suport branch May 16, 2024 18:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Open LLMs #1082

Support for Open LLMs #1082

zigabrencic commented Mar 21, 2024

codecov bot commented Mar 22, 2024 •

edited

Loading

zigabrencic commented Mar 22, 2024

TheoMcCabe Mar 23, 2024

zigabrencic Mar 24, 2024 •

edited

Loading

TheoMcCabe Mar 23, 2024

zigabrencic Mar 24, 2024

viborc Mar 27, 2024

viborc commented Mar 27, 2024

ErikBjare left a comment

zigabrencic commented Mar 28, 2024

		openai.api_key = os.getenv("OPENAI_API_KEY")

		if openai.api_key == "sk-xxx":

Support for Open LLMs #1082

Support for Open LLMs #1082

Conversation

zigabrencic commented Mar 21, 2024

codecov bot commented Mar 22, 2024 • edited Loading

Codecov Report

zigabrencic commented Mar 22, 2024

TheoMcCabe Mar 23, 2024

Choose a reason for hiding this comment

zigabrencic Mar 24, 2024 • edited Loading

Choose a reason for hiding this comment

TheoMcCabe Mar 23, 2024

Choose a reason for hiding this comment

zigabrencic Mar 24, 2024

Choose a reason for hiding this comment

viborc Mar 27, 2024

Choose a reason for hiding this comment

viborc commented Mar 27, 2024

ErikBjare left a comment

Choose a reason for hiding this comment

zigabrencic commented Mar 28, 2024

codecov bot commented Mar 22, 2024 •

edited

Loading

zigabrencic Mar 24, 2024 •

edited

Loading