Introduction to Random Forests in H2O
This tutorial introduces H2O's Random Forest framework in R.
Random Forests
Intuition: Average an ensemble of weakly predicting (larger) trees where each tree is de-correlated from all other trees.
Important components:
- Number of trees
- Maximum depth of tree
- Number of variables randomly sampled as candidates for splits
- Sampling rate for constructing data set to use on each tree
R Documentation
The h2o.randomForest
function fits H2O's Random Forest from within R.
library(h2o)
args(h2o.randomForest)
The R documentation (man page) for H2O's Random Forest can be opened from within R using the help
or ?
functions:
help(h2o.randomForest)
We can run the example from the man page using the example
function:
example(h2o.randomForest)
And run a longer demonstration from the h2o
package using the demo
function:
demo(h2o.randomForest)