My IBM

What is supervised learning?

28 December 2024

Authors

What is supervised learning?

Supervised learning is a machine learning technique that uses labeled datasets to train artificial intelligence algorithm models to identify the underlying patterns and relationships between input features and outputs. The goal of the learning process is to create a model that can predict correct outputs on new real-world data.

Labeled data consists of example data points along with the correct outputs or answers. As input data is fed into the machine learning algorithm, it adjusts its weights until the model has been fitted appropriately. Labeled training data explicitly teaches the model to identify the relationships between features and data labels.

Supervised machine learning helps organizations solve for various real-world problems at scale, such as classifying spam or predicting stock prices. It can be used to build highly accurate machine learning models.

How supervised learning works

Supervised learning uses a labeled training dataset to understand the relationships between inputs and output data. Data scientists manually create training datasets containing input data along with the corresponding labels. Supervised learning trains the model to apply the correct outputs to new input data in real-world use cases.

During training, the model’s algorithm processes large datasets to explore potential correlations between inputs and outputs. Then, model performance is evaluated with test data to find out whether it was trained successfully. Cross-validation is the process of testing a model using a different portion of the dataset.

The gradient descent family of algorithms, including stochastic gradient descent (SGD), are the most commonly used optimization algorithms, or learning algorithms, when training neural networks and other machine learning models. The model’s optimization algorithm assesses accuracy through the loss function: an equation that measures the discrepancy between the model’s predictions and actual values.

The loss function’s slope, or gradient, is the primary metric of model performance. The optimization algorithm descends the gradient to minimize its value. Throughout training, the optimization algorithm updates the model’s parameters—its operating rules or “settings”—to optimize the model.

The supervised learning workflow

A typical supervised learning process might look like this:

Identify the type of training data to be used for training the model. This data should be similar to the intended input data that the model will process when ready for use.
Assemble the training data and label it to create the labeled training dataset. The training data must be free of data bias to avoid resultant algorithmic bias and other performance flaws.
Create three groups of data: training data, validation data and test data. Validation assesses the training process for further tuning and adjustment, and testing evaluates the final model.
Choose a machine learning algorithm with which to create the model.
Feed the training dataset into the selected algorithm.
Validate and test the model accordingly.
Keep an eye on model performance and maintain accuracy with regular updates.

The latest AI News + Insights  

Discover expertly curated insights and news on AI, cloud and more in the weekly Think Newsletter.

Subscribe today

An example of supervised learning in action

As an example of supervised learning, consider an image classification model created to recognize images of vehicles and determine which type of vehicle they are. Such a model can power the CAPTCHA tests many websites use to detect spam bots.

To train this model, data scientists prepare a labeled training dataset containing numerous vehicle examples along with the corresponding vehicle type: car, motorcycle, truck, bicycle and more. The model’s algorithm attempts to identify the patterns in the training data that causes an input—vehicle images—to receive a designated output—vehicle type.

The model’s guesses are measured against actual data values in a test set to determine whether it has made accurate predictions. If not, the training cycle continues until the model’s performance has reached a satisfactory level of accuracy. The principle of generalization refers to a model’s ability to make appropriate predictions on new data from the same distribution as its training data.

Types of supervised learning

Supervised learning tasks can be broadly divided into classification and regression problems:

Classification in machine learning uses an algorithm to sort data into categories. It recognizes specific entities within the dataset and attempts to determine how those entities should be labeled or defined. Common classification algorithms are linear classifiers, support vector machines (SVM), decision trees, k-nearest neighbor and random forest.

Neural networks excel at handling complex classification problems. A  neural network is a deep learning architecture that processes training data with layers of nodes that mimic the human brain. Each node is made up of inputs, weights, a bias (or threshold) and an output. If an output value exceeds a preset threshold, the node “fires” or activates, passing data to the next layer in the network.

Regression is used to understand the relationship between dependent and independent variables. In regression problems, the output is a continuous value, and models attempt to predict the target output. Regression tasks include projections for sales revenue or financial planning. Linear regression, logistical regression and polynomial regression are three examples of regression algorithms.

Because large datasets typically contain many features, data scientists can simplify this complexity through dimensionality reduction. This data science technique reduces the number of features to those most crucial for predicting data labels, which preserves accuracy while increasing efficiency.

Supervised learning algorithms

Optimization algorithms such as gradient descent train a wide range of machine learning algorithms that excel in supervised learning tasks.

Naive Bayes: Naive Bayes is a classification algorithm that adopts the principle of class conditional independence from Bayes’ theorem. This means that the presence of one feature does not impact the presence of another in the probability of an outcome, and each predictor has an equal effect on that result.

Naïve Bayes classifiers include multinomial, Bernoulli and Gaussian Naive Bayes. This technique is often used in text classification, spam identification and recommendation systems.

Linear regression: Linear regression is used to identify the relationship between a continuous dependent variable and one or more independent variables. It is typically used to make predictions about future outcomes.

Linear regression expresses the relationship between variables as a straight line. When there is one independent variable and one dependent variable, it is known as simple linear regression. As the number of independent variables increases, the technique is referred to as multiple linear regression.

Nonlinear regression: Sometimes, an output cannot be reproduced from linear inputs. In these cases, outputs must be modeled with a nonlinear function. Nonlinear regression expresses a relationship between variables through a nonlinear, or curved, line. Nonlinear models can handle complex relationships with many parameters.

Logistic regression: Logistic regression handles categorical dependent variables—when they have binary outputs, such as true or false or positive or negative. While linear and logistic regression models seek to understand relationships between data inputs, logistic regression mainly solves binary classification problems, such as spam identification.

Polynomial regression: Similar to other regression models, polynomial regression models a relationship between variables on a graph. The functions used in polynomial regression express this relationship though an exponential degree. Polynomial regression is a subset of nonlinear regression.

Support vector machine (SVM): A support vector machine is used for both data classification and regression. That said, it usually handles classification problems. Here, SVM separates the classes of data points with a decision boundary or hyperplane. The goal of the SVM algorithm is to plot the hyperplane that maximizes the distance between the groups of data points.

K-nearest neighbor: K-nearest neighbor (KNN) is a nonparametric algorithm that classifies data points based on their proximity and association to other available data. This algorithm assumes that similar data points can be found near each other when plotted mathematically.

Its ease of use and low calculation time make it efficient when used for recommendation engines and image recognition. But as the test dataset grows, the processing time lengthens, making it less appealing for classification tasks.

Random forest: Random forest is a flexible supervised machine learning algorithm used for both classification and regression purposes. The "forest" references a collection of uncorrelated decision trees which are merged to reduce variance and increase accuracy.

Mixture of Experts | 25 April, episode 52

Decoding AI: Weekly News Roundup

Join our world-class panel of engineers, researchers, product leaders and more as they cut through the AI noise to bring you the latest in AI news and insights.

Watch the latest podcast episodes

Supervised learning versus other learning methods

Supervised learning is not the only learning method for training machine learning models. Other types of machine learning include:

Unsupervised learning

Semisupervised learning

Self-supervised learning

Reinforcement learning

Supervised versus unsupervised learning

The difference between supervised learning and unsupervised learning is that unsupervised machine learning uses unlabeled data. The model is left to discover patterns and relationships in the data on its own. Many generative AI models are initially trained with unsupervised learning and later with supervised learning to increase domain expertise.

Unsupervised learning can help solve for clustering or association problems in which common properties within a dataset are uncertain. Common clustering algorithms are hierarchical, K-means and Gaussian mixture models.

Supervised versus semi-supervised learning

Semi-supervised learning labels a portion of the input data. Because it can be time-consuming and costly to rely on domain expertise to label data appropriately for supervised learning, semi-supervised learning can be an appealing alternative.

Supervised versus self-supervised learning

Self-supervised learning (SSL) mimics supervised learning with unlabeled data. Rather than use the manually created labels of supervised learning datasets, SSL tasks are configured so that the model can generate implicit labels from unstructured data. Then, the model’s loss function uses those labels in place of actual labels to assess model performance.

Self-supervised learning sees widespread use in computer vision and natural language processing (NLP) tasks requiring large datasets that are prohibitively expensive and time-consuming to label.

Supervised versus reinforcement learning

Reinforcement learning trains autonomous agents, such as robots and self-driving cars, to make decisions through environmental interactions. Reinforcement learning does not use labeled data and also differs from unsupervised learning in that it teaches by trial-and-error and reward, not by identifying underlying patterns within datasets.

Real-world supervised learning use cases

Supervised learning models can build and advance business applications, including:

Image- and object-recognition: Supervised learning algorithms can be used to locate, isolate and categorize objects out of videos or images, making them useful with computer vision and image analysis tasks.

Predictive analytics: Supervised learning models create predictive analytics systems to provide insights. This allows enterprises to anticipate results based on an output variable and make data-driven decisions, in turn helping business leaders justify their choices or pivot for the benefit of the organization.

Regression also allows healthcare providers to predict outcomes based on patient criteria and historical data. A predictive model might assess a patient’s risk for a specific disease or condition based on their biological and lifestyle data.

Customer sentiment analysis: Organizations can extract and classify important pieces of information from large volumes of data—including context, emotion and intent—with minimal human intervention. Sentiment analysis gives a better understanding of customer interactions and can be used to improve brand engagement efforts.

Customer segmentation: Regression models can predict customer behavior based on various traits and historical trends. Businesses can use predictive models to segment their customer base and create buyer personas to improve marketing efforts and product development.

Spam detection: Spam detection is another example of a supervised learning model. Using supervised classification algorithms, organizations can train databases to recognize patterns or anomalies in new data to organize spam and non-spam-related correspondences effectively.

Forecasting: Regressive models excel at forecasting based on historical trends, making them suitable for use in the financial industries. Enterprises can also use regression to predict inventory needs, estimate employee salaries and avoid potential supply chain hiccups.

Recommendation engines: With supervised learning models in play, content providers and online marketplaces can analyze customer choices, preferences and purchases and build recommendation engines that offer tailored recommendations more likely to convert.

Challenges of supervised learning

Although supervised learning can offer businesses advantages such as deep data insights and improved automation, it might not be the best choice for all situations.

Personnel limitations: Supervised learning models can require certain levels of expertise to structure accurately.

Human involvement: Supervised learning models are incapable of self-learning. Data scientists must validate the models’ performance output.

Time requirements: Training datasets are large and must be manually labeled, which makes the supervised learning process time-intensive.

Inflexibility: Supervised learning models struggle to label data outside the bounds of their training datasets. An unsupervised learning model might be more capable of dealing with new data.

Bias: Datasets risk a higher likelihood of human error and bias, resulting in algorithms learning incorrectly.

Overfitting: Supervised learning can sometimes result in overfitting: where a model becomes too closely tailored to its training dataset. High accuracy in training can indicate overfitting as opposed to generally strong performance. Avoiding overfitting requires that models be tested with data that is different from the training data.

Unlock the power of generative AI + ML

Learn how to confidently incorporate generative AI and machine learning into your business.

Resources

Explore IBM Granite

IBM® Granite™ is our family of open, performant and trusted AI models, tailored for business and optimized to scale your AI applications. Explore language, code, time series and guardrail options.

AI in Action 2024

We surveyed 2,000 organizations about their AI initiatives to discover what’s working, what’s not and how you can get ahead.

Supervised learning models

Explore supervised learning approaches such as support vector machines and probabilistic classifiers.

Hands-on with generative AI

Learn fundamental concepts and build your skills with hands-on labs, courses, guided projects, trials and more.

How to choose the right foundation model

Learn how to select the most suitable AI foundation model for your use case.

What is supervised learning?

28 December 2024

Authors

Ivan Belcic

Cole Stryker

What is supervised learning?

How supervised learning works

The supervised learning workflow

The latest AI News + Insights

An example of supervised learning in action

Types of supervised learning

Supervised learning algorithms

Decoding AI: Weekly News Roundup

Supervised learning versus other learning methods

Supervised versus unsupervised learning

Supervised versus semi-supervised learning

Supervised versus self-supervised learning

Supervised versus reinforcement learning

Real-world supervised learning use cases

Challenges of supervised learning

Resources

Related solutions