Home Blog What Is a Confusion Matrix in Machine Learning?

What Is a Confusion Matrix in Machine Learning?

Published: August 18, 2022
Editor at Plat.AI
Editor: Ani Mosinyan
Reviewer at Plat.AI
Reviewer: Alek Kotolyan

A confusion matrix is a performance measurement technique for a machine learning classification algorithm. Data scientists use it to evaluate the performance of a classification model on a set of test data when the actual values are known. For example, classification accuracy can be misleading, especially when two or more classes are in the dataset.

Consequently, calculating the confusion matrix helps data scientists understand the effectiveness of the classification model.

Four Outcomes of Confusion Matrix

The confusion matrix visualizes the accuracy of a classifier by comparing the actual values and the predicted values. In addition, it presents a table layout of the different outcomes of the prediction, such as the table below:

Let’s decipher the confusion table:

  • The target variable has two values–positive or negative
  • The columns represent the actual values of the targeted variable–the known truth
  • The rows correspond to the predicted values of the machine learning algorithm

Now that we have deciphered the confusion table let’s understand each value.

True Positive

  • The predicted value by the model matches the actual value
  • The actual value was positive, and the machine learning model predicted a positive value

True Negative

  • The predicted value by the model matches the actual value
  • The actual value was negative, and the machine learning model predicted a negative value

False Positive

  • The machine learning model made a false prediction
  • The actual value was negative, but the machine learning model predicted a positive value
  • A false positive is also known as the Type 1 error

False Negative

  • The machine learning model made a false prediction 
  • The actual value was positive, but the machine learning model predicted a negative value
  • A false negative is also known as the Type 2 error

Example of Confusion Matrix

Let’s consider the following example to understand the confusion matrix and its values better. Suppose we want to create a model that can predict the number of patients suffering from migraines.

In this case:

  • True positives are the patients that suffer from migraines and that were correctly identified by the algorithm
  • True negatives are the patients that did not suffer from migraines and that were correctly identified by the algorithm
  • False negatives are the patients that suffer from migraines, but the algorithm stated they did not
  • False positives are the patients that do not have migraines, but the algorithm said they did

Let’s consider the following numbers: 

Let’s break the numbers down

  • There were 100 true positives or patients who suffered from migraines that were correctly classified.
  • There were 150 true negatives or patients who did not suffer from migraines correctly classified.
  • However, the algorithm misclassified 30 patients by stating they didn’t have migraines when they did, hence a false negative.
  • The algorithm also misclassified 20 patients by saying that they did have migraines when they didn’t, hence a false positive.

Consequently, the true positives and the true negatives tell us how many times the algorithm correctly classified the samples. On the other hand, the false negatives and false positives indicate how many times the algorithm incorrectly made predictions.

Once we have filled out the confusion matrix, we can perform various calculations for the model to understand its accuracy, error rate, and more.

Calculations Using Confusion Matrix

Once the confusion matrix has determined the number of True Positives (TP), True Negatives (TN), False Negatives (FN), and False Positives (FP), scientists can determine the model’s classification accuracy, error rate, precision, and recall.

Classification Accuracy

Classification accuracy is one of the most critical parameters to determine because it defines how often the model predicts the correct output. The higher the accuracy, the better the model. To calculate the model’s accuracy, consider the following formula:

Accuracy =  TP+TN / TP+FP+FN+TN

Let’s consider the example of predicting patients with migraines. In that instance, the accuracy of the machine learning algorithm is 100+150/100+20+30+150 = 0.83

This means that the machine learning algorithm is 83% accurate in its predictions.

Misclassification rate

Also referred to as the error rate, the misclassification rate defines how often the model makes incorrect predictions. The error rate is calculated through the following formula:

Error rate = FP+FN/TP+FP+FN+TN 

Based on the example above, the miscalculation rate is 20+30/100+20+30+150 = 0.17

Hence, the machine learning algorithm is 17% inaccurate in its predictions. 

Precision

Precision compares the number of correct outputs provided by the model (true positives) to the total number of classified positive samples (true positives and false positives). It is one indicator of the model’s performance and helps scientists measure its ability to classify positive samples.

The model’s precision can be calculated using the following formula:

Precision = TP/TP+FP

In our case, precision equals 100/100+20 = 0.83 

This means that out of all the positive predictions, 83% were true.

Recall

Recall lets data scientists measure the model’s ability to detect positive samples. The higher it is, the more positive samples the model has detected. 

It is calculated as the ratio between the number of positive samples correctly classified (TP) to the total number of positive values.

Recall = TP/TP+FN = 100/100+30 = 0.76

This means that out of all the actual positive cases, only 76% were predicted correctly. 

Difference Between Precision and Recall in Machine Learning

PrecisionRecall
Helps data scientists measure the model’s ability to classify positive samplesHelps data scientists measure the number of positive samples correctly classified by the model
Scientists consider positive and negative samples to calculate the model’s precisionScientists only consider all positive samples to calculate the model’s recall
Scientists consider all positive samples, including those that the model identified incorrectlyScientists only consider the positive samples correctly classified by the model, disregarding negative samples classified as positive (FP)

Sum-up

A confusion matrix allows data scientists to understand how effective their machine learning algorithm is in making correct predictions. It also allows them to understand incorrect predictions and calculate the error rate to improve the machine learning lifecycle.


Lory Seraydarian

Lory Seraydarian

Staff WriterLory Seraydarian is a writer with a background in Journalism. Lory has covered various topics such as politics, healthcare, religion, and arts to fulfill her curious nature. Lory is always up for new adventures that will challenge her and lead her to new discoveries.


Recent Blogs