Input data requirements
The PDF indexer processes PDF input data.
Multiplatform input data requirements
PostScript data generated by applications must be processed by Acrobat Distiller before you run the PDF indexer.The online documentation provided with Acrobat Distiller describes methods you can use to generate PDF data.
You can use several methods to provide the PDF indexer with access to input data, including FTP and NFS. If you use a file transfer method to copy PDF data to the Content Manager OnDemand server, you must transfer the files in binary format.
MVS.JOBNAME.DATASET.FORM.YYDDD.HHMMSST.PDF
.PDF
file name extension is required to
initiate a load process.Unless you specify otherwise, the ARSLOAD
program uses the FORM
part of the file name to identify the
application group to load. However, you can use the -G
parameter to specify a different part of the file name (MVS™, JOBNAME, or
DATASET) to identify the application group to load.
If the application group contains more than one application, you must identify the application to load; otherwise the load will fail. You can run the ARSLOAD program with the -A parameter to specify the part of the input file name (MVS, JOBNAME, DATASET, or FORM) that identifies the application.
The
case of the identifier PDF
is ignored. Application group and
application names are case sensitive and can include special characters such as the
blank character.
IBM® i input data requirements
The PDF Indexer processes PDF input data. The Content Manager OnDemand directory monitor (started with the Start Monitor (STRMONOND) command with *DIR or *DIR3 specified for the Type parameter) and the Add Report (ADDRPTOND) command are the two most common ways to invoke the PDF Indexer to index and load PDF data into Content Manager OnDemand on IBM i. You can also use the ARSLOAD API.
The PDF Indexer generates the index data and then adds the index information to the database and loads the input data on to the storage media defined for the particular Content Manager OnDemand application group to which the data belongs.
If you plan to automate the data indexing and loading process on the Content Manager OnDemand server, either the input file name, specific parameters on the command used to load the data, or a monitor user exit program must identify the application group and application to load. The PDF file name extension is required to initiate a load process. The case (uppercase or lowercase) of the extension (.pdf) is ignored. Application group and application names are case sensitive. Application group and application names can include special characters such as the blank character when using ADDRPTOND or ARSLOAD with a specific application group and application name provided. However, STRMONOND and ARSLOAD when using the MVS naming convention (-A and -G parameters) do not support archiving PDF files that have spaces in the file name. See the IBM Content Manager OnDemand for i: Common Server Administration Guide for more information about using the STRMONOND and ADDRPTOND commands and the ARSLOAD API to load data into Content Manager OnDemand.