Fundamentals of Machine Learning for
Predictive Data Analytics
Chapter 8: Evaluation Sections 8.1, 8.2, 8.3
The most important part of the design of an evaluation experiment for a predictive model is ensuring that the data used to evaluate the model is not the same as the data used to train the model.

The purpose of evaluation is threefold:
1 to determine which model is the most suitable for a task
2 to estimate how the model will perform
3 to convince users that the model will meet their needs

Figure: The process of building and evaluating a model using a hold-out test set.

ID Target
Pred. Outcome ham FN ham FN ham TN
spam TP ham TN spam TP ham TN spam TP spam TP spam TP
ID Target Pred.
11 ham ham TN 12 spam ham FN 13 ham ham TN 14 ham ham TN 15 ham ham TN 16 ham ham TN 17 ham spam FP 18 spam spam TP 19 ham ham TN 20 ham spam FP
Table: A sample test set with model predictions.
Outcome
1 2 3 4 5 6 7 8 9 10
spam spam ham spam ham spam ham spam spam spam

misclassification rate = number incorrect predictions (1) total predictions

misclassification rate = (2 + 3) = 0.25 (6+9+2+3)

For binary prediction problems there are 4 possible outcomes:
1 True Positive (TP)
2 True Negative (TN)
3 False Positive (FP)
4 False Negative (FN)

Table: The structure of a confusion matrix.
Target
Prediction positive negative
positive TP FN negative FP TN

Table: A confusion matrix for the set of predictions shown in Table 1 [7] .
Target
¡¯spam¡¯ ¡¯ham¡¯
Prediction
¡¯spam¡¯ ¡¯ham¡¯
6 3 2 9

misclassification accuracy = (FP + FN) (2) (TP +TN +FP +FN)

misclassification accuracy = (FP + FN) (2) (TP +TN +FP +FN)
misclassification accuracy = (2 + 3) = 0.25 (6+9+2+3)

classification accuracy = (TP + TN) (3) (TP +TN +FP +FN)

classification accuracy = (TP + TN) (3) (TP +TN +FP +FN)
classification accuracy = (6 + 9) = 0.75 (6+9+2+3)

