Business Analytics

Explain briefly the random forest method of classification

explain briefly the random forest method of classification

Random forest is currently the most accurate of all classification techniques available. Random forest is an ensemble method that works on the principle that many weak learners can come together to make a strong prediction. In this case, the weak learner is a simple decision tree, and random forest is strong learner.

Random forest optimizes the output from many decision trees formed from samples of the same data set. In general, the higher the number of trees, the better the accuracy of the resulting random forest ensemble will be. Yet, at higher numbers, the gain in accuracy decreases. So, the analyst has to decide on the number of trees, based on the cost of implementation that he/she will face with higher numbers of trees.

The trees are combined according to a voting mechanism. The voting is based on the success criteria of each tree. The best results are gained using a weighted approach, wherein the votes are weighted, based on the accuracy of individual trees. (See Figure .) Thus, the most accurate of classification model is found.

explain briefly the random forest method of classification

Team Educate

About Author

Leave a comment

Your email address will not be published. Required fields are marked *

You may also like

What are some of the tools used for statistical analysis
Business Analytics

What are some of the tools used for statistical analysis

There are several popular tools used for statistical analysis, each with its unique features and strengths: R: A programming language
Differentiate between structured semi-structured and unstructured data
Business Analytics

Differentiate between structured semi-structured and unstructured data

Most traditional data is structured , i.e., it can be stored in well-defined rows and columns. Legacy transaction systems are