PDF document processing
Business might process thousands of data files and need to evaluate them for information validation and decision-making. In IBM RPA, you can use PDF document processing features to automate invoice processing, process legal documents for keyword scanning, or scan account-based documents and scrape data into a third-party application for reporting, for example.
In the following sections, learn how to automate document processing by manipulating PDF files:
-
Opening, closing, and saving PDF files
Learn how to open, close, and save PDF files. -
Reading and getting values from PDF files
Learn how to read and get all texts, texts from specific regions, and values from fields in scanned PDF files, PDF forms, or PDF files with text-based content. -
Setting values to fields in a PDF Form
Learn how to set values to editable fields like input fields, radio buttons, and check boxes with aid of the Fields Mapper tool within IBM RPA Studio. -
Counting PDF pages
Learn how to count pages of a PDF file. -
Splitting a PDF file into multiple files
Learn how to split a specific page or a range of pages of a PDF file and transform it to a new PDF file with the split pages. -
Converting Office files to PDF
Learn how to convert Excel spreadsheets, Word files, and PowerPoint (PPT) decks to a PDF file. -
Comparing PDF files
Learn how to check the differences between two PDF files. -
Merging PDF files
Learn how to bring multiple files together to create a single PDF file. -
Extracting images from PDF files
Learn how to extract an entire page as image, or how to extract a specific region from a page as image with the Region Selector tool within IBM RPA Studio. -
Encrypting and decrypting PDF files
Learn how to encrypt and protect, and decrypt PDF files.