Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disabling adaptive batching leads to slower batched requests #1826

Open
jegork opened this issue Jun 19, 2024 · 0 comments
Open

Disabling adaptive batching leads to slower batched requests #1826

jegork opened this issue Jun 19, 2024 · 0 comments

Comments

@jegork
Copy link

jegork commented Jun 19, 2024

Hi!

I am sending 100 data samples in a single request (1 request with data containing 100 examples)

When I set adaptive batching to these values:

max_batch_time = 0.25
max_batch_size = 8

then request gets processed in 3.5 seconds, but if do not set these two parameters (i.e. disable adaptive batching), then the same request takes 8 seconds.
This looks strange, as in my opinion adaptive batching should not have impact when making a single request.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant