Overview
What is IBM Business Automation Content Analyzer?
IBM Business Automation Content Analyzer is a cloud-based REST API web service, which is designed to work with IBM's Digital Business Automation platform or any non-IBM content or process systems. Content Analyzer helps you rapidly accelerate extraction and classification of data in your documents – no matter what you are using today.
For more information, see Digital Business Automation on Cloud portal.
Language support
Documentation in these following languages can be submitted for processing through APIs and the Ontology interface:
| Processing features supported | Languages |
|---|---|
|
• Input file types like DOC/DOCX, PDF, TIFF, JPG, and PNG. • Output file types like PDF and UTF8. • Full text OCR |
Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Spanish, and Swedish |
|
• Output file types like PDF, JSON, and UTF8. • Document classification • Section header identification • Key-Value pair data extraction • Table header identification • Segmentation of a document into headers and text between headers |
English, French, and Spanish |
| Supported | Languages |
|---|---|
| User Interface languages (Training UI) |
Czech, German, Greek, French, Croatian, Hungary, Italian, Spanish, Dutch, Brazilian Portuguese, Polish, Romanian, Russian, Slovak, Swedish, Turkish, Japanese, Simplified Chinese, Traditional Chinese. |