IBM Document Processing
IBM Document Processing classifies and extracts information from your business documents more quickly, easily and accurately. This document describes how to use IBM Document Processing skills on IBM watsonx Orchestrate.
Using IBM Document Processing skills
To use the IBM Document processing skills, you must first add them to your skill set.
- On the watsonx Orchestrate chat page, click Add skills from the catalog.
- Search and click the IBM Document Processing app tile.
- You can add the skill in two ways:
- Click Add skill + in the skill tile or select the skill tile and click the Add skill button.
- Next, open the watsonx Orchestrate chat.
- On the chat, start the IBM Document Processing skill.
- Click Connect app.
- You are prompted to enter the following connection details:
- Bearer Token: You can specify any dummy data as a placeholder. watsonx Orchestrate internally manages the connection to IBM Document Processing.
- Click Connect app.
To know the available skills that you can use, see the list of prebuilt apps, skills and skill flows.
Technical parameters
The following table contains the technical parameters required by some skills of this app and instructions to retrieve them.
Skill name: Classify a document
Classifies a document into bill_of_lading
, contract
, invoice
, purchase_order
, utility_bill
, or tax_form
.
Input
Field | Type | Definition |
---|---|---|
File content |
File | The document to be classified. The supported formats are:
|
Language |
String | The language in which the document was written. The supported languages are:
|
Output
Field | Type | Definition |
---|---|---|
Class |
String | The document classification, it can be one of:
|
Confidence |
Float | The confidence percentage of the document classification. |
Classifications |
Array | List of all possible document classifications. The document classifications class_name and confidence are displayed for each instance in that list. |
Skill name: Extract invoice fields (KVPs)
Extracts information from invoice documents.
Input
Field | Type | Definition |
---|---|---|
File content |
File | The invoice document from which information is to be extracted. The supported formats are:
|
Language |
String | The language in which the document was written. The supported languages are:
|
Output
Field | Type | Format | Definition |
---|---|---|---|
Bank account number |
String | The bank account number. | |
Bank name |
String | Name of the bank. | |
Bill-to address |
String | The address used to send the invoice. | |
Bill-to name |
String | Name of the person who receives the invoice. | |
Invoice date |
String | Date | The invoice date. |
Invoice number |
Number | The invoice number. | |
Invoice total |
Number | The total invoice amount. | |
Payment due date |
String | Date | The invoice payment due date. |
Payment terms |
String | The invoice payment terms. | |
Purchase order number |
String | The purchase order number. | |
Ship-to address |
String | The address to send the items listed on the invoice. | |
Ship-to name |
String | The address to send the items listed on the invoice. | |
Shipping amount |
Number | The total cost to send the items. | |
Subtotal |
Number | The sum of the items price listed in the invoice. | |
Tax amount |
Number | The tax amount. | |
Tax rate |
Number | The tax rate. | |
Tax type |
String | The tax type. | |
Vendor address |
String | The vendor address. | |
Vendor name |
String | The vendor name. |
Skill name: Extract utility bill fields (KVPs)
Extracts information from utility bills.
Input
Field | Type | Definition |
---|---|---|
File content |
File | The content can be a .pdf , MS Office document (.doc, .docx, .ppt, .pptx) , or an image file (.jpg, .png, .tiff) . |
Language |
String | The language in which the document was written. The supported languages are:
|
Output
Field | Type | Format | Definition |
---|---|---|---|
account_number |
String | Account number details. | |
amount_due |
Number | Amount due details. | |
client_number |
String | Client number details. | |
company_name |
String | Company name details. | |
company_address |
String | Company address details. | |
customer_name |
String | Customer name details. | |
customer_address |
String | Customer address details. | |
due_date |
String | Date | Due date details. |
payment_received |
Number | Payment received details. | |
previous_balance |
Number | Previous balance details. | |
service_address |
String | Service address details. | |
statement_date |
String | Date | Statement date details. |
Skill name: Extract personal identifiable information (PII)
Detects potential personal identifiable information (PII) mentioned in a raw text fragment.
Input
Field | Type | Definition |
---|---|---|
Text |
String | The raw text to extract potential PII entities. |
Language |
String | The language in which the document was written. The supported languages are:
|
Output
Field | Type | Definition |
---|---|---|
Entities |
Array | A list of potentials PII mentions. Each item of the list contains text , type , location , begin , and end . |
Text |
String | The text fragment that is detected as potential PII. |
Type |
String | The type of PII detected. For more information, see PII Entities documentation. |
Location |
Object | The offsets of the PII entity text. It shows the begin and end offset of the text fragment. |
begin |
Integer | The start offset of the text fragment that is detected as potential PII. The first character of the extracted text is at offset 0. |
end |
Integer | The end offset of the text fragment that is detected as potential PII. The last character of the mention is at offset - 1. |
Skill name: Extract text
Extracts the text from a document.
Input
Field | Type | Definition |
---|---|---|
File content |
File | The document to be extracted in text. The supported formats are:
|
Language |
String | The language in which the document was written. The supported languages are:
|
Output
Field | Type | Definition |
---|---|---|
Extracted text |
String | The text extracted from the document. |
Parent topic: