release v4.2.0 #6191

jameslamb · 2023-11-14T03:03:07Z

Release checklist:

Copied from #6076, with a few changes.

before merge

PRs that should be merged before releasing:

[python-package] Allow to pass Arrow table for prediction #6168 (finishes first pass of pyarrow support)
[ci] [R-package] allow more possibly-lost warnings from valgrind #6233 (fixes valgrind check)

after merge

Notes for Reviewers

I believe this should be v4.2.0 instead of v4.1.0 because of the two breaking changes:

[CUDA] drop CUDA 10 support, start supporting CUDA 12 (fixes #5789) #6099
[python-package] fix access to Dataset metadata in scikit-learn custom metrics and objectives #6108

This release of the R package will not be published to CRAN, as #5987 has still not been resolved. I'm still working on that (and making good progress!), but let's not delay the critical fix for quantized training (#6108) waiting on that. #6191 (comment)

jameslamb · 2023-11-14T03:03:26Z

/gha run r-valgrind

Workflow R valgrind tests has been triggered! 🚀
https:/microsoft/LightGBM/actions/runs/6858665668

Status: failure ❌.

jameslamb · 2023-11-14T03:05:27Z

Add release branch to RTD versions, trigger a new build, check docs

✅ successful build: https://readthedocs.org/projects/lightgbm/builds/22543919/

✅ docs look good: https://lightgbm.readthedocs.io/en/release-v4.2.0/

jmoralez · 2023-11-22T18:31:33Z

I think we should include a fix for #6195 in this release, I can work on it this week. Also given that the work for supporting arrow isn't complete yet I think we could wait for it as well, WDYT?

jameslamb · 2023-12-01T02:47:09Z

I think we should include a fix for #6195 in this release, I can work on it this week.

Yeah since you already have #6218 up, I'm good with waiting to officially release this until that's included.

Also given that the work for supporting arrow isn't complete yet I think we could wait for it as well, WDYT?

I still feel the way I did in #6034 (comment) ... we shouldn't delay releasing to wait for the Arrow stuff to be done. I want to get that fix for quantized training out soon.

jameslamb · 2023-12-01T02:47:29Z

/gha run r-valgrind

Workflow R valgrind tests has been triggered! 🚀
https:/microsoft/LightGBM/actions/runs/7055105840

Status: failure ❌.

borchero · 2023-12-05T22:25:12Z

we shouldn't delay releasing to wait for the Arrow stuff to be done.

I think we're very close to being done now though 😄 given the release cycle of this package, it would be a pity to wait another couple months for it to arrive.

jameslamb · 2023-12-07T23:03:54Z

/gha run r-valgrind

Workflow R valgrind tests has been triggered! 🚀
https:/microsoft/LightGBM/actions/runs/7135050470

Status: failure ❌.

jameslamb · 2023-12-08T03:40:22Z

The valgrind checks are failing 3 errors all reporting bytes "possibly" lost, and all on code paths involving pthread_create@@GLIBC_2.34.

valgrind output (click me)

==5666== 
==5666== HEAP SUMMARY:
==5666==     in use at exit: 356,517,358 bytes in 57,373 blocks
==5666==   total heap usage: 11,237,862 allocs, 11,180,489 frees, 9,435,448,587 bytes allocated
==5666== 
==5666== 352 bytes in 1 blocks are possibly lost in loss record 157 of 2,135
==5666==    at 0x484DA83: calloc (in /usr/libexec/valgrind/vgpreload_memcheck-amd64-linux.so)
==5666==    by 0x40147D9: calloc (rtld-malloc.h:44)
==5666==    by 0x40147D9: allocate_dtv (dl-tls.c:375)
==5666==    by 0x40147D9: _dl_allocate_tls (dl-tls.c:634)
==5666==    by 0x4DA37B4: allocate_stack (allocatestack.c:430)
==5666==    by 0x4DA37B4: pthread_create@@GLIBC_2.34 (pthread_create.c:647)
==5666==    by 0x572D25F: ??? (in /usr/lib/x86_64-linux-gnu/libgomp.so.1.0.0)
==5666==    by 0x5723A10: GOMP_parallel (in /usr/lib/x86_64-linux-gnu/libgomp.so.1.0.0)
==5666==    by 0x17BB2B04: LGBM_DatasetCreateFromCSC (c_api.cpp:1512)
==5666==    by 0x17BEA3CB: LGBM_DatasetCreateFromCSC_R (lightgbm_R.cpp:184)
==5666==    by 0x495AE00: R_doDotCall (dotcode.c:894)
==5666==    by 0x4965E41: do_dotcall (dotcode.c:1551)
==5666==    by 0x49A7662: Rf_eval (eval.c:1253)
==5666==    by 0x49AE2CF: do_set (eval.c:3556)
==5666==    by 0x49A7409: Rf_eval (eval.c:1225)
==5666== 
==5666== 352 bytes in 1 blocks are possibly lost in loss record 158 of 2,135
==5666==    at 0x484DA83: calloc (in /usr/libexec/valgrind/vgpreload_memcheck-amd64-linux.so)
==5666==    by 0x40147D9: calloc (rtld-malloc.h:44)
==5666==    by 0x40147D9: allocate_dtv (dl-tls.c:375)
==5666==    by 0x40147D9: _dl_allocate_tls (dl-tls.c:634)
==5666==    by 0x4DA37B4: allocate_stack (allocatestack.c:430)
==5666==    by 0x4DA37B4: pthread_create@@GLIBC_2.34 (pthread_create.c:647)
==5666==    by 0x74DB328: std::thread::_M_start_thread(std::unique_ptr<std::thread::_State, std::default_delete<std::thread::_State> >, void (*)()) (in /usr/lib/x86_64-linux-gnu/libstdc++.so.6.0.30)
==5666==    by 0x177F719E: std::thread::thread<LightGBM::PipelineReader::Read(char const*, int, std::function<unsigned long (char const*, unsigned long)> const&)::{lambda()#1}, , void>(LightGBM::PipelineReader::Read(char const*, int, std::function<unsigned long (char const*, unsigned long)> const&)::{lambda()#1}&&) (std_thread.h:163)
==5666==    by 0x177F65B7: LightGBM::PipelineReader::Read(char const*, int, std::function<unsigned long (char const*, unsigned long)> const&) (pipeline_reader.h:56)
==5666==    by 0x177F9EA1: LightGBM::TextReader<int>::ReadAllAndProcess(std::function<void (int, char const*, unsigned long)> const&) (text_reader.h:103)
==5666==    by 0x177F7A7F: LightGBM::TextReader<int>::ReadAllLines() (text_reader.h:160)
==5666==    by 0x177ED0E9: LightGBM::DatasetLoader::LoadTextDataToMemory[abi:cxx11](char const*, LightGBM::Metadata const&, int, int, int*, std::vector<int, std::allocator<int> >*) (dataset_loader.cpp:967)
==5666==    by 0x177E85FA: LightGBM::DatasetLoader::LoadFromFile(char const*, int, int) (dataset_loader.cpp:231)
==5666==    by 0x17BC0CB9: LightGBM::DatasetLoader::LoadFromFile(char const*) (dataset_loader.h:26)
==5666==    by 0x17BAEC75: LGBM_DatasetCreateFromFile (c_api.cpp:983)
==5666== 
==5666== 352 bytes in 1 blocks are possibly lost in loss record 159 of 2,135
==5666==    at 0x484DA83: calloc (in /usr/libexec/valgrind/vgpreload_memcheck-amd64-linux.so)
==5666==    by 0x40147D9: calloc (rtld-malloc.h:44)
==5666==    by 0x40147D9: allocate_dtv (dl-tls.c:375)
==5666==    by 0x40147D9: _dl_allocate_tls (dl-tls.c:634)
==5666==    by 0x4DA37B4: allocate_stack (allocatestack.c:430)
==5666==    by 0x4DA37B4: pthread_create@@GLIBC_2.34 (pthread_create.c:647)
==5666==    by 0x572D25F: ??? (in /usr/lib/x86_64-linux-gnu/libgomp.so.1.0.0)
==5666==    by 0x5723A10: GOMP_parallel (in /usr/lib/x86_64-linux-gnu/libgomp.so.1.0.0)
==5666==    by 0x17BC6D7E: LightGBM::Booster::Predict(int, int, int, int, int, std::function<std::vector<std::pair<int, double>, std::allocator<std::pair<int, double> > > (int)>, LightGBM::Config const&, double*, long*) const (c_api.cpp:441)
==5666==    by 0x17BB9C62: LGBM_BoosterPredictForMat (c_api.cpp:2482)
==5666==    by 0x17BF15CF: LGBM_BoosterPredictForMat_R (lightgbm_R.cpp:974)
==5666==    by 0x495AFB2: R_doDotCall (dotcode.c:909)
==5666==    by 0x4965E41: do_dotcall (dotcode.c:1551)
==5666==    by 0x49A7662: Rf_eval (eval.c:1253)
==5666==    by 0x49AC6F4: do_begin (eval.c:2977)
==5666== 
==5666== LEAK SUMMARY:
==5666==    definitely lost: 0 bytes in 0 blocks
==5666==    indirectly lost: 0 bytes in 0 blocks
==5666==      possibly lost: 1,056 bytes in 3 blocks
==5666==    still reachable: 356,516,302 bytes in 57,370 blocks
==5666==                       of which reachable via heuristic:
==5666==                         newarray           : 4,264 bytes in 1 blocks
==5666==         suppressed: 0 bytes in 0 blocks
==5666== Reachable blocks (those to which a pointer was found) are not shown.
==5666== To see them, rerun with: --leak-check=full --show-leak-kinds=all
==5666== 
==5666== For lists of detected and suppressed errors, rerun with: -s
==5666== ERROR SUMMARY: 3 errors from 3 contexts (suppressed: 0 from 0)
writing valgrind output to valgrind-logs.log
valgrind found 0 bytes definitely lost
valgrind found 0 bytes indirectly lost
valgrind found 1056 bytes possibly lost
Error: Process completed with exit code 255.

I strongly believe these are false positives, based on these:

And based on the fact that CRAN has previously accepted our submissions which show these "possibly lost" valgrind findings related to pthread_create().

I'm going to build the R package from this branch and submit it to CRAN as v4.2.0. Will post here when I've done that.

jameslamb · 2023-12-08T05:01:13Z

I've built the R package from this branch and submitted it to CRAN as v4.2.0.

In addition to all the passed tests here, tested the locally-built package by running R CMD check --as-cran on my mac (Intel) and by submitting it to win-builder.

Logs to successful win-builder r-devel build: https://win-builder.r-project.org/lWkS6twSPNK8/00check.log

@shiyu1994 I'm not sure, but you might receive an email from CRAN asking about the maintainer change. Please check at your earliest convenience, and click any confirmation links they send you.

I'll post updates here as the checks run on CRAN.

jameslamb · 2023-12-08T05:19:23Z

🎉 the package passed the 2 automatic checks!

I just got a message from CRAN saying that, and that they'll continue with other checks once you confirm the maintainer change @shiyu1994 .

package lightgbm_4.2.0.tar.gz has been auto-processed.
We are waiting for confirmation from the old maintainer address now.

Log dir: https://win-builder.r-project.org/incoming_pretest/lightgbm_4.2.0_20231208_055726/
The files will be removed after roughly 7 days.
Installation time in seconds: 420
Check time in seconds: 199
R Under development (unstable) (2023-12-07 r85661 ucrt)

Pretests results:
Windows: https://win-builder.r-project.org/incoming_pretest/lightgbm_4.2.0_20231208_055726/Windows/00check.log
Status: 1 NOTE
Debian: https://win-builder.r-project.org/incoming_pretest/lightgbm_4.2.0_20231208_055726/Debian/00check.log
Status: 1 NOTE

shiyu1994 · 2023-12-08T10:59:47Z

@jameslamb I've confirmed the changes.

jameslamb · 2023-12-08T14:01:39Z

thank you!!

jameslamb · 2023-12-09T06:18:46Z

R-package update: so far, so good 🎉

passed the CRAN pre-checks
binaries have been built for macOS (x86_64 and arm64)
binary for Windows R-devel has been built
checks have passed for 6 CRAN check flavors

https://cran.r-project.org/web/checks/check_results_lightgbm.html

mayer79 · 2023-12-09T10:57:25Z

Wonderful, thank you so much!

jameslamb · 2023-12-13T05:09:22Z

The CRAN checks for the R package are progressing well!

Only 2 of the main CRAN checks remain (unsure how many of the extra ones from https://cran.r-project.org/web/checks/check_issue_kinds.html will be run or if any have been run already).

https://cran.r-project.org/web/checks/check_results_lightgbm.html

Given that... let's continue with the release. Here's my proposed sequence:

merge [python-package] Allow to pass Arrow table for prediction #6168
merge [ci] [R-package] allow more possibly-lost warnings from valgrind #6233
re-run valgrind checks and confirm they work on this branch
add any versionadded:: annotations in docs
manually test on M1/M2 mac (I'll do that on my laptop)
get approvals from everyone
merge this branch and do all the other checklist stuff (add new git tag, create GitHub release, upload to PyPI, etc.)

jameslamb · 2023-12-20T04:35:24Z

manually test Python and R packages on M1/M2 Mac

Seems that LightGBM can be compiled successfully on arm64 Macs, but experiences deadlocks if OpenMP is enabled (which is the default). Looks like #4229 might still be an issue with newer versions of libomp.

I'll put some time into that for the next release... but I don't think it should stop this one, as it's been a problem for a while.

R-package details (click me)

The R package is passing all checks on CRAN's arm64 Mac checks:

But it isn't finding OpenMP. e.g., see https://www.r-project.org/nosvn/R.check/r-release-macos-arm64/lightgbm-00install.html

* installing *source* package ‘lightgbm’ ...
** package ‘lightgbm’ successfully unpacked and MD5 sums checked
** using staged installation
checking location of R... /Library/Frameworks/R.framework/Resources
checking whether MM_PREFETCH works... no
checking whether MM_MALLOC works... yes
checking whether OpenMP will work in a package... no
***********************************************************************************************
 OpenMP is unavailable on this macOS system. LightGBM code will run single-threaded as a result.
 To use all CPU cores for training jobs, you should install OpenMP by running

     brew install libomp
***********************************************************************************************
configure: creating ./config.status

Python package details (click me)

On my M2 Mac with the following:

OS: macOS 14.1.2 (Sonoma)
compiler: AppleClang 15.0.0
Python: 3.11.7

I ran the following on this branch

sh build-python.sh sdist
pip install ./dist/lightgbm-4.2.0.tar.gz

lightgbm compiled successfully and could be imported (python -c "import lightgbm"), but I found that even the following simple program deadlocks (hangs indefinitely) during Dataset construction.

import lightgbm as lgb
from sklearn.datasets import make_regression

X, y = make_regression(n_samples=10_000)
dtrain = lgb.Dataset(X, label=y)
dtrain.construct()

I saw a similar deadlock trying to run the tests.

pytest tests/python_package_test/test_basic.py

I uninstalled lightgbm and tried reinstalling with OpenMP support turned off.

pip uninstall lightgbm
pip install \
    --config-settings=cmake.define.USE_OPENMP=OFF \
    ./dist/lightgbm-4.2.0.tar.gz

When I did that, that simple example and the tests ran successfully and very fast.

jameslamb · 2023-12-20T04:36:25Z

I think this release is ready to go!

@guolinke @shiyu1994 @jmoralez could you please review?

@borchero let us know here if you have any questions about the release process or anything you see in the diff of this PR.

shiyu1994 · 2023-12-20T15:18:54Z

.appveyor.yml

@@ -1,4 +1,4 @@
-version: 4.1.0.99.{build}
+version: 4.2.0.{build}


Just want to make sure why we are using 4.2.0 now instead of 4.2.0.99 to differentiate between released version and the one built from source?

Like we've done for previous releases, after this that'll get changed to 4.2.0.99 in a follow-up PR.

For example: #6090

This version doesn't affect any artifacts that are delivered to users or anything. Just the way builds are organized in the AppVeyor UI.

https://ci.appveyor.com/project/guolinke/lightgbm/history

Doing this on the release PR ensures these builds are identifiable in the future as belong to the 4.2.0 release.

differentiate between released version and the one built from source

The commit produced when we merge this PR will be the released version.

jameslamb · 2023-12-20T15:40:53Z

Thanks everyone! I'll publish the release some time today.

borchero

For completeness in the reviewer list 😁🚀

shiyu1994 · 2023-12-20T16:02:39Z

@jameslamb Thanks for your explanation!

jameslamb · 2023-12-21T05:10:41Z

Ran the following to create the v4.2.0 tag and update the stable tag.

git fetch upstream --tags
git tag -d stable
git push upstream :refs/tags/stable
git tag stable
git tag v4.2.0
git push upstream stable v4.2.0

(NOTE: I alias this repo to upstream and my fork to origin in my git settings)

https:/microsoft/LightGBM/tags

That triggered an Azure DevOps build which should create the release automatically: https://dev.azure.com/lightgbm-ci/lightgbm-ci/_build/results?buildId=15616&view=results. This takes around 90 minutes (because of the QEMU CI job).

I'll do the remaining tasks tomorrow.

jameslamb · 2023-12-21T19:00:56Z

v4.2.0 release has been created: https:/microsoft/LightGBM/releases/tag/v4.2.0

I'll handle PyPI, NuGet, and homebrew later today.

jameslamb · 2023-12-22T04:17:30Z

Update version and commit hash in Homebrew formula

Homebrew/homebrew-core#157978

jameslamb · 2023-12-22T04:56:02Z

Upload release to test PyPI
Upload release to PyPI.

Uploaded v4.2.0 to test PyPI

gh release download \
    --repo microsoft/LightGBM \
    --dir ./artifacts \
    --pattern 'lightgbm*-py3-*.whl' \
    --pattern 'lightgbm-4.2.0.tar.gz' \
    v4.2.0

twine upload \
    -r testpypi \
    ./artifacts/*

(gh is the GitHub CLI, see https://cli.github.com/manual/gh_release_download)

Then confirmed that installing the latest wheel works.

pip install -i https://test.pypi.org/simple/ 'lightgbm==4.2.0'
python ./examples/python-guide/logistic_regression.py

Then pushed them to real PyPI.

twine upload \
    ./artifacts/*

jameslamb · 2023-12-22T05:01:05Z

Add new tag to RTD versions and trigger a new build.

Remove the release branch from RTD versions

These are done. v4.2.0 are now available on readthedocs.

successful build: https://readthedocs.org/projects/lightgbm/builds/22930492/
version-pinned docs: https://lightgbm.readthedocs.io/en/v4.2.0/

jameslamb · 2023-12-22T05:09:18Z

Published to NuGet: https://www.nuget.org/packages/LightGBM/4.2.0

And with that, this release is done! Thanks again to everyone who contributed 👋🏻

release v4.2.0

12fb454

jameslamb added awaiting review maintenance labels Nov 14, 2023

jameslamb requested review from guolinke, shiyu1994 and jmoralez as code owners November 14, 2023 03:03

jameslamb added 2 commits November 24, 2023 23:25

Merge branch 'master' into release/v4.2.0

e1a1e57

Merge branch 'master' into release/v4.2.0

2a33bac

jameslamb mentioned this pull request Dec 3, 2023

[R-package] change CRAN maintainer #6224

Merged

jameslamb mentioned this pull request Dec 7, 2023

[R-package] [c++] add tighter multithreading control, avoid global OpenMP side effects (fixes #4705, fixes #5102) #6226

Merged

Merge branch 'master' into release/v4.2.0

9209b0f

jameslamb mentioned this pull request Dec 13, 2023

[ci] [R-package] allow more possibly-lost warnings from valgrind #6233

Merged

Merge branch 'master' into release/v4.2.0

eb647ab

jameslamb added 2 commits December 13, 2023 21:38

Merge branch 'master' into release/v4.2.0

8e80a36

Merge branch 'master' into release/v4.2.0

ca40f6f

update cran-comments

97f726e

jameslamb added in progress and removed awaiting review labels Dec 19, 2023

This was referenced Dec 19, 2023

[R-package] Warnings of CRAN Package #6221

Closed

[R-package] v4.0.0 CRAN submission issues #5987

Closed

jameslamb added awaiting review and removed in progress labels Dec 20, 2023

jameslamb requested a review from borchero December 20, 2023 04:35

jameslamb mentioned this pull request Dec 20, 2023

LightGBM is incompatible with libomp 12 and 13 on macOS #4229

Closed

guolinke approved these changes Dec 20, 2023

View reviewed changes

shiyu1994 approved these changes Dec 20, 2023

View reviewed changes

jmoralez approved these changes Dec 20, 2023

View reviewed changes

borchero approved these changes Dec 20, 2023

View reviewed changes

jameslamb removed the awaiting review label Dec 20, 2023

jameslamb mentioned this pull request Dec 20, 2023

Add bundle support for LightGBM rstudio/bundle#55

Closed

jameslamb merged commit 0a9a6bb into master Dec 21, 2023
42 checks passed

jameslamb deleted the release/v4.2.0 branch December 21, 2023 04:28

jameslamb mentioned this pull request Dec 21, 2023

bump development version to 4.2.0.99 #6241

Merged

jameslamb mentioned this pull request Dec 22, 2023

[R-package] remove readRDS.lgb.Booster() and saveRDS.lgb.Booster() #6246

Merged

jameslamb mentioned this pull request Jan 17, 2024

release v4.3.0 #6277

Merged

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release v4.2.0 #6191

release v4.2.0 #6191

jameslamb commented Nov 14, 2023 •

edited

Loading

jameslamb commented Nov 14, 2023 •

edited by guolinke

Loading

jameslamb commented Nov 14, 2023 •

edited

Loading

jmoralez commented Nov 22, 2023

jameslamb commented Dec 1, 2023

jameslamb commented Dec 1, 2023 •

edited by guolinke

Loading

borchero commented Dec 5, 2023

jameslamb commented Dec 7, 2023 •

edited by guolinke

Loading

jameslamb commented Dec 8, 2023

jameslamb commented Dec 8, 2023

jameslamb commented Dec 8, 2023

shiyu1994 commented Dec 8, 2023

jameslamb commented Dec 8, 2023

jameslamb commented Dec 9, 2023

mayer79 commented Dec 9, 2023

jameslamb commented Dec 13, 2023

jameslamb commented Dec 20, 2023

jameslamb commented Dec 20, 2023

shiyu1994 Dec 20, 2023

jameslamb Dec 20, 2023

jameslamb commented Dec 20, 2023

borchero left a comment

shiyu1994 commented Dec 20, 2023

jameslamb commented Dec 21, 2023 •

edited

Loading

jameslamb commented Dec 21, 2023

jameslamb commented Dec 22, 2023

jameslamb commented Dec 22, 2023

jameslamb commented Dec 22, 2023

jameslamb commented Dec 22, 2023

		@@ -1,4 +1,4 @@
		version: 4.1.0.99.{build}
		version: 4.2.0.{build}

release v4.2.0 #6191

release v4.2.0 #6191

Conversation

jameslamb commented Nov 14, 2023 • edited Loading

Release checklist:

Notes for Reviewers

jameslamb commented Nov 14, 2023 • edited by guolinke Loading

jameslamb commented Nov 14, 2023 • edited Loading

jmoralez commented Nov 22, 2023

jameslamb commented Dec 1, 2023

jameslamb commented Dec 1, 2023 • edited by guolinke Loading

borchero commented Dec 5, 2023

jameslamb commented Dec 7, 2023 • edited by guolinke Loading

jameslamb commented Dec 8, 2023

jameslamb commented Dec 8, 2023

jameslamb commented Dec 8, 2023

shiyu1994 commented Dec 8, 2023

jameslamb commented Dec 8, 2023

jameslamb commented Dec 9, 2023

mayer79 commented Dec 9, 2023

jameslamb commented Dec 13, 2023

jameslamb commented Dec 20, 2023

jameslamb commented Dec 20, 2023

shiyu1994 Dec 20, 2023

Choose a reason for hiding this comment

jameslamb Dec 20, 2023

Choose a reason for hiding this comment

jameslamb commented Dec 20, 2023

borchero left a comment

Choose a reason for hiding this comment

shiyu1994 commented Dec 20, 2023

jameslamb commented Dec 21, 2023 • edited Loading

jameslamb commented Dec 21, 2023

jameslamb commented Dec 22, 2023

jameslamb commented Dec 22, 2023

jameslamb commented Dec 22, 2023

jameslamb commented Dec 22, 2023

jameslamb commented Nov 14, 2023 •

edited

Loading

jameslamb commented Nov 14, 2023 •

edited by guolinke

Loading

jameslamb commented Nov 14, 2023 •

edited

Loading

jameslamb commented Dec 1, 2023 •

edited by guolinke

Loading

jameslamb commented Dec 7, 2023 •

edited by guolinke

Loading

jameslamb commented Dec 21, 2023 •

edited

Loading