This will sample the training and test sets so that case identifiers (e.g. subject ID's) are not shared across training and test sets.

resample_fold_train_test_split(X, y, train_size = 0.667, foldid = NULL,
  random_state = NULL)

Arguments

X

A data.frame, the data to be resampled.

y

A numeric vector with two classes, 0 and 1.

train_size

A numeric vector of length one; specifies what proportion of the data should be used for the training data set. Defaults to 0.667.

foldid

A vector with length equal to length(y) which identifies cases belonging to the same fold.

random_state

An integer vector of length one; specifies the seed to be used for the analysis. Defaults to NULL.

Value

A boolean vector of length n_obs where TRUE represents that observation should be in the train set.

See also

Other resample: resample_simple_train_test_split, resample_stratified_class_train_test_split, resample_stratified_simple_train_test_split