Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different quantization techniques #3707

Closed
glevv opened this issue Jan 2, 2021 · 1 comment
Closed

Different quantization techniques #3707

glevv opened this issue Jan 2, 2021 · 1 comment

Comments

@glevv
Copy link

glevv commented Jan 2, 2021

Is it possible to add different quantization techniques? As I understand most of GBDTs use quantiles to quantize data (like pd.qcut or KBinsDiscretizer(strategy='quantile')), but there are other options, like uniform quantization (like pd.cut or KBinsDiscretizer(strategy='uniform')). There are also other techniques in Catboost (like entropy minimization) and in 1d classification/clustering (like jenkins/natural/head-tails breaks), but they require fitting of additional parameters and could be time-wasteful unlike uniform and quantile strategies.

Third option could be geometric intervals which is increasing intervals that could be used when distribution of the feature is skewed. It is also easy to compute and gives different to uniform and quantile strategies results.

@StrikerRUS
Copy link
Collaborator

Closed in favor of being in #2302. We decided to keep all feature requests in one place.

Welcome to contribute this feature! Please re-open this issue (or post a comment if you are not a topic starter) if you are actively working on implementing this feature.

@StrikerRUS StrikerRUS changed the title Question about quantization Different quantization techniques Jan 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants