-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[python-package] Segmentation fault with CUDA version in Python interface (core dumped) #6300
Comments
Thanks for using LightGBM, and for the well-formatted report. We'd be happy to help, but there are some things you can do to narrow down the issue further and reduce the effort that'll be required to find the root cause.
|
Also note that I've reformatted your original post slightly to make the difference between code, your own words, and text printed by code clearer. You can click If you're unsure how I did that, please review https://docs.github.com/en/get-started/writing-on-github/getting-started-with-writing-and-formatting-on-github/basic-writing-and-formatting-syntax. |
Thank you @jameslamb for the kindly reply. I took a new year's leave and back to work today. There is one import thing that I forgot to post here, that is when I reduced the number of data points in the train set to a smaller one (e.g. 100,000), it worked. So maybe it is the data problem? |
Please provide the details I asked for at #6300 (comment) to help us eliminate possible causes. |
Hi @jameslamb , I re-run my code with nothing changed but train data, it was replaced by iris data from |
Seems related to the cuda version. I will investigate this. |
@leedchou Could you provide the implementation of |
In addition, if you could provide a minimal example for reproducing the error, that would be very helpful. |
Thank you @shiyu1994 , I'd love to show you the implementaion of |
@leedchou Thanks. You may send that to my personal email shiyu_k1994@qq.com. It would also be great if you could post the example here for clear and open discussion. |
Please do this, @leedchou, so that everyone finding this discussion from search in the future can learn from it and so that others can contribute to helping. |
Ok, I'll post it here @shiyu1994 . focal_loss_obj:
train example:
|
Description
I installed lightgbm-4.3.0.0, cuda version. After data loaded and transported to GPU, execution just stopped. Below is the log.
GPU memory is about 12GB while the data is 6GB.
Reproducible example
Environment info
LightGBM version or commit hash:
4.3.0.0
Command(s) you used to install LightGBM
Additional Comments
The text was updated successfully, but these errors were encountered: