PDF by Paragraph

Converts input data of a user-specified content type to a representation where each paragraph is a page, adding content elements that identify the page and paragraph number of each paragraph.

Though the input content type is user-specified so that you can identify specific documents for processing by this converter, the internals of the converter assume that this content type is PDF, and calls the PDF to HTML converter to do the initial conversion.