Building NLP models: From out-of-the-box to customization

Background

With the help of machine learning-based natural language processing (NLP) models, your business can speed up processes, make more accurate predictions, and uncover new insights from your existing data. To get the maximum value for your efforts, it is important to choose the right machine learning-based NLP service for your specific use case. At IBM, we often get questions like, “How does a machine learning model know how to read my data?” or “How does a machine learning model know what to look for?” The answer depends on which model you use.

Let’s discuss two different approaches to building machine learning solutions with NLP: out-of-the-box models and custom models. When you need to quickly uncover general insights, you probably want an out-of-the-box model. When your use case contains specific industry or domain data, such as specialized data or jargon, a custom model is appropriate.

Modeling in NLP

Out-of-the-box models

We frequently hear that building solutions for text-based analysis is cumbersome and time-consuming. Spending hours sorting through documents, labeling keywords or sentiment in the text, isn’t the best use of your time. Out-of-the-box or pretrained NLP models automate those tasks for you. Out-of-the-box models provide tremendous advantage when you have limited time or don’t have data to train on. They’re effective for quick proofs of concept and have a high ROI, especially when you need to understand higher level items like categories or concepts within your documents. Prebuilt models are often trained on general purpose data sources and tend not to be tailored to any one industry or domain.

Custom models
When your use case contains specific industry lingo, pretrained models may only meet a portion of your needs. Across different industries, some entities (people, places, events, etc.) will have different meanings, so an out-of-the-box model may not be able to pick up on these nuances depending on the context. For example, the word “premium” can mean the top end of a product range, or it can mean the payment made in exchange for an insurance policy. In Australia, this insurance payment is referred to as “the cover”.

IBM Research is cultivating NLP technologies to give clients a competitive edge. Register here to learn how, live on March 18.

Your custom model can be trained to understand the underlying nuances, meaning, and relationships specific to your use case and industry domain. As above, you can teach the model what the word “premium” means in the context of your business. These models can be built in a few ways, and you don’t have to be a data scientist or a software developer to create them. With code-free tooling, all you need to bring is subject matter expertise. A subject matter expert (SME) will define what information matters in the model, upload documents for training, and then select examples, often called annotations, within those documents.

The notations you create on your documents provide the system with specific examples that serve as the “ground truth” for the model. Whether defining specific entities for extraction from a document or the relationships between them, through training, you’ll make the improvements which ensure the model’s effectiveness.

When should you invest in a custom model?

When combining out-of-the-box NLP models with custom NLP models, you get the best of both worlds. We recommend training your model to understand simple components first, such as locations or organizations. This way, your NLP model homes in on the pieces that matter to your business, to ultimately drive smarter decisions.

Now what?

Learn more about the differences between out-of-the-box NLP models and custom NLP models, or check out Watson Natural Language Understanding and Watson Knowledge Studio to build your very own machine learning-based NLP models.

Was this article helpful?

YesNo

Background

Modeling in NLP

When should you invest in a custom model?

Now what?

Tags

More from Artificial intelligence

AI transforms the IT support experience

Bigger isn’t always better: How hybrid AI pattern enables smaller language models

Chat with watsonx models

IBM Newsletters