Background

With the help of machine learning-based natural language processing (NLP) models, your business can speed up processes, make more accurate predictions, and uncover new insights from your existing data. To get the maximum value for your efforts, it is important to choose the right machine learning-based NLP service for your specific use case. At IBM, we often get questions like, “How does a machine learning model know how to read my data?” or “How does a machine learning model know what to look for?” The answer depends on which model you use.

Let’s discuss two different approaches to building machine learning solutions with NLP: out-of-the-box models and custom models. When you need to quickly uncover general insights, you probably want an out-of-the-box model. When your use case contains specific industry or domain data, such as specialized data or jargon, a custom model is appropriate.

Modeling in NLP

Out-of-the-box models

We frequently hear that building solutions for text-based analysis is cumbersome and time-consuming. Spending hours sorting through documents, labeling keywords or sentiment in the text, isn’t the best use of your time. Out-of-the-box or pretrained NLP models automate those tasks for you. Out-of-the-box models provide tremendous advantage when you have limited time or don’t have data to train on. They’re effective for quick proofs of concept and have a high ROI, especially when you need to understand higher level items like categories or concepts within your documents. Prebuilt models are often trained on general purpose data sources and tend not to be tailored to any one industry or domain.

Custom models
When your use case contains specific industry lingo, pretrained models may only meet a portion of your needs. Across different industries, some entities (people, places, events, etc.) will have different meanings, so an out-of-the-box model may not be able to pick up on these nuances depending on the context. For example, the word “premium” can mean the top end of a product range, or it can mean the payment made in exchange for an insurance policy. In Australia, this insurance payment is referred to as “the cover”.

Your custom model can be trained to understand the underlying nuances, meaning, and relationships specific to your use case and industry domain. As above, you can teach the model what the word “premium” means in the context of your business. These models can be built in a few ways, and you don’t have to be a data scientist or a software developer to create them. With code-free tooling, all you need to bring is subject matter expertise. A subject matter expert (SME) will define what information matters in the model, upload documents for training, and then select examples, often called annotations, within those documents.

The notations you create on your documents provide the system with specific examples that serve as the “ground truth” for the model. Whether defining specific entities for extraction from a document or the relationships between them, through training, you’ll make the improvements which ensure the model’s effectiveness.

When should you invest in a custom model?

When combining out-of-the-box NLP models with custom NLP models, you get the best of both worlds. We recommend training your model to understand simple components first, such as locations or organizations. This way, your NLP model homes in on the pieces that matter to your business, to ultimately drive smarter decisions.

Now what?

Learn more about the differences between out-of-the-box NLP models and custom NLP models, or check out Watson Natural Language Understanding and Watson Knowledge Studio to build your very own machine learning-based NLP models.

Was this article helpful?
YesNo

More from Artificial intelligence

AI transforms the IT support experience

5 min read - We know that understanding clients’ technical issues is paramount for delivering effective support service. Enterprises demand prompt and accurate solutions to their technical issues, requiring support teams to possess deep technical knowledge and communicate action plans clearly. Product-embedded or online support tools, such as virtual assistants, can drive more informed and efficient support interactions with client self-service. About 85% of execs say generative AI will be interacting directly with customers in the next two years. Those who implement self-service search…

Bigger isn’t always better: How hybrid AI pattern enables smaller language models

5 min read - As large language models (LLMs) have entered the common vernacular, people have discovered how to use apps that access them. Modern AI tools can generate, create, summarize, translate, classify and even converse. Tools in the generative AI domain allow us to generate responses to prompts after learning from existing artifacts. One area that has not seen much innovation is at the far edge and on constrained devices. We see some versions of AI apps running locally on mobile devices with…

Chat with watsonx models

3 min read - IBM is excited to offer a 30-day demo, in which you can chat with a solo model to experience working with generative AI in the IBM® watsonx.ai™ studio.   In the watsonx.ai demo, you can access some of our most popular AI models, ask them questions and see how they respond. This gives users a taste of some of the capabilities of large language models (LLMs). AI developers may also use this interface as an introduction to building more advanced…

IBM Newsletters

Get our newsletters and topic updates that deliver the latest thought leadership and insights on emerging trends.
Subscribe now More newsletters