Features / Cleanup for MP Frontend #387

robertgshaw2-neuralmagic · 2024-07-31T13:41:16Z

SUMMARY:

refactor to use single socket
cleanup comments / logging
add do_log_stats
add abort

github-actions · 2024-07-31T13:41:30Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

joerunde · 2024-07-31T15:33:12Z

vllm/entrypoints/openai/api_server.py

- # task.add_done_callback(_running_tasks.remove)
+ if not engine_args.disable_log_stats:
+ task = asyncio.create_task(_force_log())
+ _running_tasks.add(task)


if this is here just to be a hard ref to the task so it doesn't get GC'ed, then I think it would work fine as a local variable inside lifespan instead of a module-scope variable

joerunde · 2024-07-31T15:33:41Z

vllm/entrypoints/openai/api_server.py

@@ -82,6 +82,7 @@ async def _force_log():
 async def build_backend(args) -> AsyncIterator[VLLMBackend]:
 # Context manager to handle backend lifecycle
 # Ensures everything is shutdown and cleaned up on error/exit
+ global engine_args


🌶️ , thanks!

joerunde · 2024-07-31T15:35:54Z

vllm/entrypoints/openai/api_server.py

-  # Wait for server process to join
-  rpc_server_process.join()
+ # Wait for server process to join
+ rpc_server_process.join()


I actually meant this to be in the finally statement so that it will run on exit when there's an unhandled exception, which can happen if there's an exception before we get to the guarded

try: await server_task

this was a typo, will fix

joerunde · 2024-07-31T15:39:41Z

vllm/entrypoints/openai/rpc/client.py

- self.get_data_socket.send(pickle.dumps(GetDataRequest.MODEL_CONFIG))
- model_config = await self.get_data_socket.recv()
- return pickle.loads(model_config)
+ # Await acknowledgement from RPCServer that it aborted.


copy/paste comment error?

robertgshaw2-neuralmagic added 7 commits July 30, 2024 14:52

stash

4f01472

updated to use DEALER<>ROUTER for abort socket

011db76

got abort working

85ef62b

stash

f5f96ee

Merge branch 'isolate-oai-server-process' into add-abort

5b39ada

refactored sockets, add do_log_stats and abort

cddf771

cleaning

01a8242

robertgshaw2-neuralmagic added 3 commits July 31, 2024 14:02

shared send_request function

b61fdd1

formatted

53207a0

format

04b4c9e

robertgshaw2-neuralmagic changed the title ~~Add abort~~ Features / Cleanup for MP Frontend Jul 31, 2024

robertgshaw2-neuralmagic added 3 commits July 31, 2024 14:33

update

7d92d50

fix

4e7efcd

code consistency nit

fa11533

joerunde reviewed Jul 31, 2024

View reviewed changes

robertgshaw2-neuralmagic added 3 commits July 31, 2024 16:02

format

9cae6fb

back inside finally

5b91f0f

simple

81fc45f

robertgshaw2-neuralmagic merged commit 1f33286 into isolate-oai-server-process Jul 31, 2024
1 check was pending

robertgshaw2-neuralmagic deleted the add-abort branch July 31, 2024 16:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features / Cleanup for MP Frontend #387

Features / Cleanup for MP Frontend #387

robertgshaw2-neuralmagic commented Jul 31, 2024 •

edited

Loading

github-actions bot commented Jul 31, 2024

joerunde Jul 31, 2024

joerunde Jul 31, 2024

joerunde Jul 31, 2024 •

edited

Loading

robertgshaw2-neuralmagic Jul 31, 2024

joerunde Jul 31, 2024

Features / Cleanup for MP Frontend #387

Features / Cleanup for MP Frontend #387

Conversation

robertgshaw2-neuralmagic commented Jul 31, 2024 • edited Loading

github-actions bot commented Jul 31, 2024

joerunde Jul 31, 2024

Choose a reason for hiding this comment

joerunde Jul 31, 2024

Choose a reason for hiding this comment

joerunde Jul 31, 2024 • edited Loading

Choose a reason for hiding this comment

robertgshaw2-neuralmagic Jul 31, 2024

Choose a reason for hiding this comment

joerunde Jul 31, 2024

Choose a reason for hiding this comment

robertgshaw2-neuralmagic commented Jul 31, 2024 •

edited

Loading

joerunde Jul 31, 2024 •

edited

Loading