Developers

Build your own Custom Language Model to convert unique Speech to Text

Share this post:

Modern day speech-to-text services are built using a corpus of general, everyday words and pronunciations. While this system works well for common conversation, it can fall short when it comes to accurately transcribing unique accents, industry specific words, or uncommon dialect. The majority of speech recognition services don’t offer tooling to train the system on how to appropriately transcribe these outliers and users are left with an unsolvable problem. Watson is the solution.

Now, with the new STT Language Model Customization capability, you can train Watson Speech-to-Text (STT) service to learn from your input. Create your own custom models that align more closely with your application’s unique requirements, in order to more accurately convert unique accents, topics, and words to text.

Greater accuracy, even with difficult vocabulary

The new STT Language Model Customization capability gives you the ability to generate more unique machine transcriptions, even when the utterances may include highly specialized, industry-specific terminology and jargon.

Here are a few examples of applications that can benefit from Language Model Customization:

  • Medical dictation application. Easily create a custom model by feeding examples of words and phrases the application users might say, including medical terms, drug names, etc.
  • Call-center transcription application. Use custom models to accurately transcribe company names, product names, and industry-specific words and phrases in customer service interactions
  • Retail conversation applications. Create a custom model to transcribe any product name and turn your retail bot into a more effective sales associate.

How it works

The new STT customization API allows a user to add new words (along with their pronunciations in a “sounds-like” style) to the base vocabulary. In addition, the service can also accept corpus files containing sentences from the user’s domain (that is, sentences that a user of the service might say for a particular application). The customization API will automatically extract any new words and provide pronunciations for the new words (which can be adjusted by the user via the API).

Once you input data, the custom model is trained immediately. After training concludes, the custom model is instantly available for use in the STT service.

The STT customization API is a Beta service and initially available for US English only. Visit the Watson speech-to-text page on the Watson Developer Cloud to find out how you can get started using this service.

 

Get started building

Senior Offering Manager, Watson Speech Services

More Developers stories
July 16, 2018

Building trust in the age of AI – How businesses can build fairness into their machine learning models

Every organization that develops or uses AI, or hosts or processes data, must do so in ways that allow them to rationalize the decisions or recommendations in a way that is easily consumable. Let's examine Forrester's recommendations how organizations can leverage AI for the good of humankind, while avoiding the ethical pitfalls associated with perceived discrimination.

Continue reading

July 12, 2018

Improving productivity: IBM ranked leader in AI-based text analytics platforms by Forrester Wave

IBM has been named a leader in AI-Based Text Analytics Platforms by The Forrester Wave ™, Forrester's rigorous evaluation of vendors in a software, hardware, or services market. Watson Discovery, Watson Explorer, and Watson Natural Language Understanding’s strengths and weaknesses were assessed alongside seven other providers.

Continue reading

June 25, 2018

Faster data discovery and access – Forrester Names IBM a Leader in Machine Learning Data Catalogs

IBM Watson Knowledge Catalog was recently recognized by Forrester as a Leader in: “The Forrester Wave™: Machine Learning Data Catalogs, Q2 2018.” IBM’s MLDC offering, Watson Knowledge Catalog, is a powerful tool that developers and data scientists can use to index all the available data sets in their business.

Continue reading