Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RFC] Drop support for CUDA 10 #5789

Closed
shiyu1994 opened this issue Mar 17, 2023 · 5 comments · Fixed by #6099
Closed

[RFC] Drop support for CUDA 10 #5789

shiyu1994 opened this issue Mar 17, 2023 · 5 comments · Fixed by #6099

Comments

@shiyu1994
Copy link
Collaborator

shiyu1994 commented Mar 17, 2023

Summary

CUDA 10 is very old. And with CUDA 10 we found some compilation problems that do not occur in CUDA 11. See (#5605 (comment)) for example. Dropping support for CUDA 10 may reduce maintenance and CI test cost. Just want to hear your ideas about this @guolinke @jameslamb @StrikerRUS @jmoralez.

@jameslamb
Copy link
Collaborator

I'm +1 on moving LightGBM's minimum supported CUDA version to CUDA 11.x.

We have such a small team of maintainers here, and so few non-maintainer contributors around the project right now, I think the reduction in maintenance burden is necessary to ensure that at least the CDUA 11.x support is high-quality.


Some references to help with this decision...

The last CUDA 10.x release, v10.2, was in November 2019.

The first CUDA 11.x release, v11.0.1, was in June 2020. (release history).

And it seems to me that many other machine learning projects supporting GPU acceleration have already done that.

XGBoost has been requiring CUDA 11.x since at least June 2022: dmlc/xgboost#8006 (comment) (cc @trivialfis @hcho3 could tell us if it's even further back than that)

RAPIDS announced that they considered CUDA 10.2 "deprecated" as of February 2021 (https://docs.rapids.ai/notices/rsn0005/)... not sure if / when they formally removed support for it.

Pytorch seems to only be supporting be publishing precompiled binaries CUDA 11.x as far as I can tell (not sure if they support building from source against older CUDA)

image

Tensorflow dropped support for CUDA 10 as of v2.4.0, December 2020 (support table, release history)

Screen Shot 2023-03-18 at 10 08 00 PM

@hcho3
Copy link
Contributor

hcho3 commented Mar 19, 2023

@jameslamb We dropped CUDA 10.x support in February 2022: dmlc/xgboost#7366

@jameslamb
Copy link
Collaborator

Perfect, thanks for that @hcho3 !

@jameslamb
Copy link
Collaborator

To add more evidence here... the default CUDA on Google Colab is v12.0.

Ran !nvidia-smi in a notebook with a T4 GPU attached tonight:

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.105.17   Driver Version: 525.105.17   CUDA Version: 12.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Tesla T4            Off  | 00000000:00:04.0 Off |                    0 |
| N/A   39C    P8     9W /  70W |      0MiB / 15360MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

Copy link

github-actions bot commented Jan 3, 2024

This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https:/microsoft/LightGBM/issues including a reference to this.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Jan 3, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants