Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does time-slicing or MPS GPU-sharing supports a mode for processe to exclusively use GPU DRAM? #966

Open
so2bin opened this issue Sep 25, 2024 · 0 comments

Comments

@so2bin
Copy link

so2bin commented Sep 25, 2024

  • Currently, with time-slicing or MPS GPU-sharing technology, multiple processes simultaneously occupy GPU memory, preventing a single process from utilizing all the memory. Is there any technology or configuration that allows these GPU-sharing modes to swap the memory occupied by processes to host-memory when they are not using the GPU? This way, process that is running on the GPU can utilize all the memory.

  • I want to achieve a scenario where N GPUs can be shared by M developers' containers, generally with M>=N. However, the M developers will not use the GPU simultaneously and will only use it intermittently. I hope that developers will only occupy GPU memory when they need the GPU. Even if the debugging process has not ended, it should not occupy GPU memory when the GPU is not needed. This way, the memory can be freed up for other users. Can the current GPU-sharing technology support this implementation?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant