IBM Watson Speech to Text

Overview

IBM Watson® Speech to Text technology enables fast and accurate speech transcription in multiple languages for a variety of use cases, including but not limited to customer self-service, agent assistance and speech analytics. Get started fast with our advanced machine learning models out-of-the-box or customize them for your use case.

More accurate AI

Our best-in-class AI, embedded within Watson Speech to Text, truly understands your customers.

Customizable for your business

Train Watson Speech to Text on your unique domain language and specific audio characteristics.

Protects your data

Enjoy the security of IBM’s world-class data governance practices.

Truly runs anywhere

Built to support global languages and deployable on any cloud — public, private, hybrid, multicloud, or on-premises.

Fine-tuning features

Improve speech recognition accuracy for extracting phrases, words, letters, numbers or lists.

Optimized for customer care

Activate your voice application with speech models tuned for the customer care domain.

Speaker diarization

Recognize who said what in a multi-participant voice exchange.

Interim transcription before final results

Improve application response times by using speech transcription as it is generated and throughout the finalization process.

Use cases

Agent assist
Agent assist
Customer self-service
Customer self-service
Call analytics
Call analytics

Abstract illustration of Woman on mobile phone and man at desk interconnected to sound bars, documents, chat bubble and search icon

Agent assist

Boost agent productivity and success with real time assistance during calls using AI-powered document and intranet search. As the agent is speaking with a customer, Watson listens in on the conversation, transcribes the audio, searches for relevant content within documentation and feeds the answer back to the agent within seconds.

Abstract illustration of two people interconnected to sound, documents, and search icons

Call analytics

Improve call center performance by mining conversation logs to quickly and accurately identify emerging call patterns, customer complaints, sentiment, non-compliant behavior and more.

Agent assist

Call analytics

Improve call center performance by mining conversation logs to quickly and accurately identify emerging call patterns, customer complaints, sentiment, non-compliant behavior and more.

Ways to buy

Lite

500 minutes of free speech recognition a month and 38 pre-trained speech models.

Start for free

Plus

Tune your speech models to improve accuracy in recognition as well as transcription. Plus version includes unlimited minutes per month and 100 concurrent transcriptions.

View details

Premium

Provides large and security-sensitive firms with more capacity and data protection. Premium includes unlimited minutes per month and unlimited concurrent transcriptions.

Partner with IBM

Accelerate your business growth as an Independent Software Vendor (ISV) by innovating with IBM. Partner with us to deliver enhanced commercial solutions embedded with AI to better address clients’ needs.

Build AI-based solutions faster

Accelerate your growth with IBM

Resources

API reference

Technical API specifications for all of your development needs.

Download SDKs

The Watson SDK repository in GitHub.

Data privacy and security

See documentation about our enhanced security features that ensure your data is isolated and encrypted end-to-end, while in transit and at rest.

Build custom speech recognition models within minutes

Learn how to create custom speech models using IBM Watson quickly — without knowing how to code.

How to train your own speech “dragon”

Read about Watson Speech to Text requirements, the methodology and some best practices inspired by actual clients.

Replacing my old IVR system with IBM Watson

Guidelines on how to add a new or existing virtual assistant to your brand-new Watson IVR.

Related products

Watson Text to Speech

Improve customer engagement by interacting with users in their own language using any written text.

watsonx Assistant

Solve customer issues the first time using an AI virtual assistant across any application, device, or channel.

Watson Speech Libraries for Embed

Integrate advanced natural language AI into commercial applications through a containerized library offering enhanced flexibility for IBM partners.

Take the next step

See Watson Speech to Text capabilities in action.

Start your free trial

Explore the demo

More ways to explore

Documentation

Community

Artificial intelligence services