Developers + APIs

3 ways to get the most out of the Watson Speech to Text API

Share this post:

Key Points:
–  Watson Speech to Text API converts audio voice into written text so you can add speech transcription capabilities to your applications.
– You can use it to create voice-controlled applications and customize the model to improve accuracy for the languages and content you care about.
– IBM offers a breadth of resources so you can quickly find what’s relevant to your app and business.

Try Watson Speech to Text for free

Watson Speech to Text converts audio voice into written text. Use Speech to Text to transcribe calls in a contact center for example to identify what is being discussed, when to escalate calls, and to understand content from multiple speakers.

To put it simply, you can use the Watson Speech to Text API to add speech transcription capabilities to your applications. The service has three interfaces you can use: a WebSocket interface, an HTTP REST interface, and an asynchronous HTTP interface. They have options to stream the audio or to send it as a single request. The service has a few extra features such as profanity filtering, formatting and word confidence as well.

You can also use Speech to Text to create voice-controlled applications – even customize the model to improve accuracy for the language and content you care about most such as product names, sensitive subjects, or names of individuals.

Like any feature-rich API, you don’t know what you don’t know about how to use it until you do a little digging. When you’re getting started with a new app, the trick is sifting through the information to determine what’s relevant to your app and what’s not. Fortunately, IBM has a lot of resources that can help.

1. See what Watson’s Speech to Text service can do

The Speech to Text service transcribes audio voice files into written text, so you can use it for a number of different types of applications. For example, you can use Speech to Text to transcribe calls or to create voice-controlled applications. You can also customize it to improve accuracy for a specific language or to understand specific content such as names of products or people. These links give you an overview of what the service can do:

2. Learn how Speech to Text works

The Speech to Text service uses machine intelligence to combine information about grammar and language structure with knowledge about the composition of the audio signal to transcribe the human voice accurately. It continuously returns and retroactively updates the transcription as more speech is heard. To get started, you can:

3. Try it out for yourself

Once you start digging into code, you don’t have to go it alone. If you run into questions or have comments, you can share them in the Watson developerWorks forums. For more answers, you can check out these developer communities as well:

  • The Watson forums on Stack Overflow.
  • The Watson forum on dW Answers.

Get started with Speech to Text on Bluemix today.

See how our Watson Speech to Text API works with our free, 30-day Bluemix trial

 

Add Comment
No Comments

Leave a Reply

Your email address will not be published.Required fields are marked *

More Cognitive Enterprise Stories
February 10, 2017

How Staples is making customer service “easy” with Watson Conversation

Ask any office manager about ordering supplies and you’re likely to get a few groans. Overseeing office procedures requires a level of complexity as customer expectations continue to grow. Shoppers today are looking to order exactly what they want, when they want it, no matter the time or place. Office products and services superstore and […]

Continue reading

June 10, 2016

Welcome to the world of A.I.

Artificial Intelligence is all too often associated only with futuristic technologies seen in movies or in the news. Yet what many people don’t realize is that technology disruptions have already been influencing our daily lives for more than a decade! News on Artificial Intelligence and Cognitive Technologies now surrounds us on a daily basis — […]

Continue reading

January 31, 2017

17 Top AI and Machine Learning Conferences for Developers in 2017

Whether you’re interested in cognitive computing, artificial intelligence or machine learning, you probably know that the fourth industrial revolution is well underway and accelerating rapidly. The speed of change presents a challenge to developers who want to stay abreast of the latest ideas and approaches. Conferences, workshops and other meetings provide opportunities to learn where […]

Continue reading