Detailed System Requirements
Abstract
Deep learning object detection is an advanced capability that generalizes the annotations from your training documents and dynamically applies them when possible. If your documents have a fixed format and the fields are located in the same places, you don't typically need this capability. When deep learning object detection is disabled, IBM Automation Document Processing extracts the fields from the same positions where they were annotated in the page. This solution works well on those fixed-format documents such as tax forms. If your documents have a dynamic format or sections with variable length, such as invoices, enabling deep learning object detection might yield better accuracy.
If you disable deep learning object detection, the performance is improved for document processing and data extraction training.
Content
ca_configuration:
ocrextraction:
deep_learning_object_detection:
enabled: false
Small profile recommendations for Document Processing Engine components:
ca_configuration:
global:
deployment_profile_size: "small"
Component |
CPU Request (m) |
CPU Limit (m) |
Memory Request (Mi) |
Memory Limit (Mi) |
Number of Replicas |
Pods are licensed for production and nonproduction |
Ephemeral Storage Limit (Mi) |
OCR Extraction |
200 |
1000 |
1024 |
2560 |
5 |
Yes |
3072 |
Classify Process |
200 |
500 |
400 |
2048 |
1 |
Yes |
3072 |
Processing Extraction |
500 |
1000 |
1024 |
3584 |
3 |
Yes |
3072 |
Natural Language Extractor |
200 |
500 |
600 |
1440 |
2 |
Yes |
3072 |
Postprocessing |
200 |
1000 |
400 |
1229 |
1 |
No |
3072 |
Setup |
200 |
1000 |
600 |
2048 |
2 |
No |
3072 |
Backend |
200 |
1000 |
400 |
2048 |
2 |
No |
4608 |
Webhook |
200 |
300 |
400 |
500 |
1 |
No |
1024 |
RabbitMQ |
100 |
1000 |
100 |
1024 |
2 |
No |
3072 |
WDU Extraction (technology preview) | 300 | 1000 | 500 | 1024 | 1 | No | 3072 |
WDU Runtime (technology preview) | 200 | 4000 | 1024 | 8192 | 1 | No | 4096 |
Medium profile recommendations for Document Processing Engine components:
ca_configuration:
global:
deployment_profile_size: "medium"
Component |
CPU Request (m) |
CPU Limit (m) |
Memory Request (Mi) |
Memory Limit (Mi) |
Number of Replicas |
Pods are licensed for production and nonproduction |
OCR Extraction |
200 |
1000 |
1024 |
2560 |
8 |
Yes |
Classify Process |
200 |
500 |
400 |
2048 |
2 |
Yes |
Processing Extraction |
500 |
1000 |
1024 |
3584 |
3 |
Yes |
Natural Language Extractor |
200 |
500 |
600 |
1440 |
2 |
Yes |
Postprocessing |
200 |
1000 |
400 |
1229 |
2 |
No |
Setup |
200 |
1000 |
600 |
2048 |
4 |
No |
Backend |
200 |
1000 |
400 |
2048 |
4 |
No |
Webhook |
200 |
300 |
400 |
500 |
2 |
No |
RabbitMQ |
100 |
1000 |
100 |
1024 |
3 |
No |
WDU Extraction (technology preview) | 300 | 1000 | 500 | 1024 | 1 | No |
WDU Runtime (technology preview) | 200 | 4000 | 1024 | 8192 | 1 | No |
Large profile recommendations for Document Processing Engine components:
ca_configuration:
global:
deployment_profile_size: "large"
Component |
CPU Request (m) |
CPU Limit (m) |
Memory Request (Mi) |
Memory Limit (Mi) |
Number of Replicas |
Pods are licensed for production and nonproduction |
OCR Extraction |
200 |
1000 |
1024 |
2560 |
13 |
Yes |
Classify Process |
200 |
500 |
400 |
2048 |
3 |
Yes |
Processing Extraction |
500 |
1000 |
1024 |
3584 |
6 |
Yes |
Natural Language Extractor |
200 |
500 |
600 |
1440 |
3 |
Yes |
Postprocessing |
200 |
1000 |
400 |
1229 |
2 |
No |
Setup |
200 |
1000 |
600 |
2048 |
6 |
No |
Backend |
200 |
1000 |
400 |
2048 |
6 |
No |
Webhook |
200 |
300 |
400 |
500 |
3 |
No |
RabbitMQ |
100 |
1000 |
100 |
1024 |
3 |
No |
WDU Extraction (technology preview) | 300 | 1000 | 500 | 1024 | 1 | No |
WDU Runtime (technology preview) | 200 | 4000 | 1024 | 8192 | 1 | No |
Was this topic helpful?
Document Information
More support for:
IBM Cloud Pak for Automation
Component:
Operate->ADP Install\Upgrade\Setup
Software version:
All Versions
Document number:
7151706
Modified date:
28 June 2024
UID
ibm17151706