Set Leo default model to Llama 3 8b (25% rollout) #1225

nvonpentz · 2024-10-15T18:57:18Z

Related to brave/brave-core#21398
Blocked on deploying https:/brave/aichat-ops/pull/359 to prod

This sets the default model for 25% of free users to chat-basic, which corresponds to Llama 3 8b.

The intention is to start this to progressively roll out this change (25%, 50%, 75%, 100%), so we can ensure our single instance of Llama 3 can handle the increase in traffic. Based on these estimations, one instance of llama 3 (one gpu) should have higher token throughput than our single instance of mixtral (four gpus), so we expect this to work with two llama instances which will be added in https:/brave/aichat-ops/pull/359.

cc @petemill @LorenzoMinto

Note:

Chromium version 122.0.6261.57 was selected because that was the first chromium version when 1.63.x was released (see https://bravesoftware.slack.com/archives/C04PX1BUN/p1708629893634639), which is when AI Chat: introduce freemium model concept brave-core#21398 went in.
I have included all platforms because we want to make this change across all platforms, however this does differ from the BraveAIChatEnabledStudy which applies only to desktop

github-actions · 2024-10-15T18:57:33Z

✅ Test Seed Generated Successfully

To apply the test seed:

Desktop: Launch the browser with --variations-pr=1225.
Android: Set the command line to --variations-pr=1225 in debug menu, restart the browser.
iOS: Set Variations PR to 1225 in Brave Core Switches debug menu, restart the browser.
Wait 5-10 seconds to fetch the seed.
Restart the browser to apply the seed.
Ensure Active Variations section at brave://version starts with the expected seed version (see below).

Seed Details

Parameter	Value
Version	`pull/1225@02be91adb07839f2668690152f397b0a9ec4c0c2`
Uploaded	Thu, 17 Oct 2024 14:10:54 GMT
PR commit	`ed3f1df`
Base commit	`024f17f`
Merge commit	`02be91a`
Serial number	`2b5fac7fc29881a4c81bbafb69157d15`

…on't overlap

petemill · 2024-10-15T21:18:35Z

seed/seed.json

+ "BETA",
+ "NIGHTLY"
+ ],
+ "min_version": "122.0.6261.57",


looks like in this file the versioning is [chrome_major].[brave_version], e.g 122.1.63.0?

I updated to use 122.1.63.161 as the min version of the default model change. 1.63.161 seems to be the first 1.63 release. This also adds in a little buffer because brave/brave-browser#34721 was uplifted into 1.62.x.

I specified 122.1.63.160 as the new max version for the previous setting. This may not correlate to an actual release, it's just one patch version behind 122.1.63.161.

petemill · 2024-10-15T21:24:21Z

I tested running with --enable-features=AIChat:default_model\/chat-basic and all seems fine

Use [chrome_major].[brave_version] format, not just the chromium version.

Set Leo default model to Llama 3 8b (25% rollout)

6f85252

Specify a max_version for the older BraveAIChatEnabledStudy so they d…

71a9606

…on't overlap

petemill reviewed Oct 15, 2024

View reviewed changes

Update min / max versions

ed3f1df

Use [chrome_major].[brave_version] format, not just the chromium version.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set Leo default model to Llama 3 8b (25% rollout) #1225

Set Leo default model to Llama 3 8b (25% rollout) #1225

nvonpentz commented Oct 15, 2024 •

edited

Loading

github-actions bot commented Oct 15, 2024 •

edited

Loading

petemill Oct 15, 2024

nvonpentz Oct 17, 2024

petemill commented Oct 15, 2024

Set Leo default model to Llama 3 8b (25% rollout) #1225

Are you sure you want to change the base?

Set Leo default model to Llama 3 8b (25% rollout) #1225

Conversation

nvonpentz commented Oct 15, 2024 • edited Loading

github-actions bot commented Oct 15, 2024 • edited Loading

✅ Test Seed Generated Successfully

Seed Details

petemill Oct 15, 2024

Choose a reason for hiding this comment

nvonpentz Oct 17, 2024

Choose a reason for hiding this comment

petemill commented Oct 15, 2024

nvonpentz commented Oct 15, 2024 •

edited

Loading

github-actions bot commented Oct 15, 2024 •

edited

Loading