Light weight R package to do fast data splitting for cross-validation or train/valid/test splits
GPL-2.0 License
Bot releases are hidden (Show)
Published by mayer79 over 1 year ago
importFrom
by ::
.Published by mayer79 over 1 year ago
multi_strata()
provides a vector of stratification groups based on a data frame that can be then passed to partition()
or create_folds()
. Each stratification group will contain "similar" data rows, where similarity is either based on a kmeans cluster analysis or forming all combinations of binned columns. Thanks to kapsner for the idea and the help with the implementation.Published by mayer79 over 2 years ago
Maintenance release only.
Published by mayer79 over 3 years ago