allow arbitrary cross validation fold indices #3353

owlas · 2018-06-01T12:37:06Z

use training indices passed to folds parameter in training.cv
update doc string

xgboost cv allows user to pass scikitlearn split objects or pass a list of indices for each fold. Indices are of the form:

[ (fold_1_train_idx, fold_1_test_idx), (fold_2_train_idx, fold_2_test_idx) ]

The current implementation uses the test indices to form dtest and forms dtrain from all indices that are not in the test indices.

An improvement would be to use the passed train indices explicitly (i.e. form dtrain from the train indices). This allows completely arbitrary cross validation strategies.

- use training indices passed to `folds` parameter in `training.cv` - update doc string

* allow arbitrary cross validation fold indices - use training indices passed to `folds` parameter in `training.cv` - update doc string * add tests for arbitrary fold indices

owlas force-pushed the train-test-indices branch 3 times, most recently from 2a1792f to 8473b5b Compare June 1, 2018 16:05

allow arbitrary cross validation fold indices

97be08d

- use training indices passed to `folds` parameter in `training.cv` - update doc string

owlas force-pushed the train-test-indices branch 2 times, most recently from 2b87e24 to 8a1a98c Compare June 4, 2018 13:02

add tests for arbitrary fold indices

ec01fde

owlas force-pushed the train-test-indices branch from 8a1a98c to ec01fde Compare June 4, 2018 13:41

hcho3 merged commit 18813a2 into dmlc:master Jun 30, 2018

lock bot locked as resolved and limited conversation to collaborators Jan 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow arbitrary cross validation fold indices #3353

allow arbitrary cross validation fold indices #3353

owlas commented Jun 1, 2018

allow arbitrary cross validation fold indices #3353

allow arbitrary cross validation fold indices #3353

Conversation

owlas commented Jun 1, 2018