DocBridge Mill
Document Batch Processing - In any Respect or Direction
DocBridge Mill is designed to convert documents and document batches or spools (print output) of almost any kind of forms in many ways by control of a profile. Thus the batches or the documents to be processed can be separated, distributed, classified, indexed, and converted under the control of profiles into the formats which are common in the document processing area. The impact is that the documents can almost anywhere be presented, printed, archived, or elsewhere analysed and processed.
Overview
- Automatic separation, splitting, merging, or filtering of input batches and documents (e.g. for COLD application)
- Automatic classifying and indexing
- Additional options for processing the page layout like e.g. automatic rotation of wrong aligned pages
- Support of many formats as well as converting into one of the supported formats
- Support of several batch types, among other types also many formats of the most used document management systems
- Extensive support of AFP resources
- Many control options via a versatilely configurable profile
- Processing document batches
Restructuring the Source Material
DocBridge Mill can process the structure of the input documents and batches in the following ways:
- Separating spools into single documents
- Splitting/Collecting documents according to predefined criteria (e.g. area code regions)
- Merging documents or several batches in one batch
- Filtering out documents or pages according to predefined criteria (e.g. payment forms)
Classifying and Indexing
The separated documents identified by DocBridge Mill can be classified and indexed. The criteria for assigning the document attributes and for the separation can be determined in a very flexible way by a powerful formula language in the controlling profile. The formulas that are definable in this profile for each class can particularly analyze following variables:
- Attributes available in the data stream like NOPs or TLEs (Tag Logical Elements) in case of AFP (Advanced Function Printing)
- General attributes like page number (e.g. page groups in AFP) or overlay names
- Text elements that can be picked up in a predefined area
- Terms that match definable search criteria (Levenstein-Algorithm)
- Values or texts in raster image documents that are extractable by barcode or character recognition
The data gained in this way can be stored in separate files sorted for specifications of the control profile.
Changing Page Information
In DocBridge Mill a set of additional options can be configured which can be used to change the page setup related information like:
- Recognition of the main text alignment and automatic rotation of misaligned pages
- Changing the page size and shift-ing the page contents
- Removing text
- Adding text and OMR marks
Converting the Document Format
A core function of DocBridge Mill is the option to export the processed documents or batches in another format. At the input and output side the following formats and format special-ties can be processed (for detailed specifications see the data sheet of DocBridge Mill):
Mixed Object Formats:
- IBM AFP
- ASCII-/EBCDIC-Linemode
- IBM MO:DCA (Mixed Object:Document Content Architecture)
- HP PCL (Printer Control Language)
- PDF (Portable Document Format) from Adobe
- GOF (Generic Output Format) from SAP (only as input format) with its subformats
- OTF (Output Text For-mat) and ALF (ABAP List Format)
Application related formats like Microsoft Office formats: They can be transformed as input format via raster printer driver or Adobe Acrobat.
Raster Formats:
BMP, GIF, IOCA, JPEG, PCX, PNG, TGA, and TIFF
For the export in these raster image formats the following options are available:
- Rasterizing of the pages by a scale-to-gray function;
- Reducing color images to monochrome images
Supported Batch Types and their Conversion
DocBridge Mill can process document batches that are organized by attributes like indexes or data for pages belonging together in the following way:
- Data stream oriented batch types that organize all pages and attributes in one file:
- AFP and MO:DCA-P data streams with TLEs and Page Groups generated by IBM ACIF
- IBM ImagePlus VALIN file (MO:DCA)
- External referenced batch types with pages or documents stored in single files and attributes and references to these files in a separate file organized in the following formats:
- Easy-Archive Import File
- Fixed record files with configurable record layout
- FileNET Import File
- Free definable attribute file
- ISIS Web Archive Format
- IXOS Import File
- XML file
The support of these batch formats is realized via specific drivers. They can be also implemented in a customer specific form.
Because of managing all relevant batch format information DocBridge Mill can also be used for converting the import and export format of one manufacturer into the import format of another manufacturer and therefore for transferring the content of one document management system to another one.
AFP Resource Management
For the formats AFP and MO:DCA DocBridge Mill also supports the management of resources which spans many documents or batches and which some formats are using for storage saving reasons:
- Inline resources in an AFP data stream can be extracted or external resources stored in separate files can be put together in a resource library, e.g. to transfer them in this form to an archiving system.
- Resources can be compared for identity (regardless of possibly various time stamps), to only generate a new resource library in case it is different.
Click here for more information on our products
|