check_cols {recipes} | R Documentation |
check_cols
creates a specification of a recipe
step that will check if all the columns of the training frame are
present in the new data.
check_cols(recipe, ..., role = NA, trained = FALSE, skip = FALSE, id = rand_id("cols")) ## S3 method for class 'check_cols' tidy(x, ...)
recipe |
A recipe object. The check will be added to the sequence of operations for this recipe. |
... |
One or more selector functions to choose which
variables are checked in the check See |
role |
Not used by this check since no new variables are created. |
trained |
A logical for whether the selectors in |
skip |
A logical. Should the check be skipped when the
recipe is baked by |
id |
A character string that is unique to this step to identify it. |
x |
A |
This check will break the bake
function if any of the checked
columns does contain NA
values. If the check passes, nothing is changed
to the data.
data(biomass) biomass_rec <- recipe(HHV ~ ., data = biomass) %>% step_rm(sample, dataset) %>% check_cols(contains("gen")) %>% step_center(all_predictors()) ## Not run: bake(biomass_rec, biomass[, c("carbon", "HHV")]) ## End(Not run)