In supervised learning, feedback is explicit and comes in the form of labeled data. The AI agent is trained using input/output pairs (for example, an image with a corresponding label). After the agent makes predictions, feedback is provided by comparing its output to the correct label (ground truth).
The difference between the predicted and true output (error) is calculated, often using a loss function. This feedback is then used to adjust the model parameters so that the model can improve its predictions over time.
AI agents can use supervised learning to predict which products or services a customer is likely to be interested in, based on their past behavior, purchase history or user preferences.
For example, if building AI agents for e-commerce apps, historical data such as past purchases and ratings as labeled examples could be used to to train a model that predicts the products a customer might want to purchase next, improving customer experiences.
Supervised learning is considered human-in-the-loop (HITL) learning because AI agents integrate human feedback to refine their models, improve decision-making and adapt to new situations.
This method combines automated learning with human expertise, allowing AI to handle complex tasks more effectively while minimizing errors and biases. HITL can also be integrated as a feedback mechanism in other types of learning, but it is only integral to the process of self-supervised learning.