Overview

What is IBM Business Automation Content Analyzer?

IBM Business Automation Content Analyzer is a cloud-based REST API web service, which is designed to work with IBM's Digital Business Automation platform or any non-IBM content or process systems. Content Analyzer helps you rapidly accelerate extraction and classification of data in your documents – no matter what you are using today.

For more information, see Digital Business Automation on Cloud portal.

Language support

Documentation in these following languages can be submitted for processing through APIs and the Ontology interface:

Table 1. Supported document languages
Processing features supported Languages

• Input file types like DOC/DOCX, PDF, TIFF, JPG, and PNG.

• Output file types like PDF and UTF8.

• Full text OCR

Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Spanish, and Swedish

• Output file types like PDF, JSON, and UTF8.

• Document classification

• Section header identification

• Key-Value pair data extraction

• Table header identification

• Segmentation of a document into headers and text between headers

English, French, and Spanish
Table 2. Supported User Interface languages (Training UI)
Supported Languages
User Interface languages (Training UI)

Czech, German, Greek, French, Croatian, Hungary, Italian, Spanish, Dutch, Brazilian Portuguese, Polish, Romanian, Russian, Slovak, Swedish, Turkish, Japanese, Simplified Chinese, Traditional Chinese.