Enterprise Document Capture

Content Workspace Module List

Content Workspace Processor uses modules to sequentially process image documents.

 

If specific functionality is not included on the module list, Professional Services or a .Net developer on your team can write a custom module that will seamlessly drop into CWS Processor.  See the SDK Training for more information.

The following modules can be used in various arrangements to form a complete document processing solution...

CWS ModuleDescription

Scan
The Scan Module interfaces with all TWAIN compatible scanners to capture images from paper.  The image can be saved in JPEG, TIFF or PDF format.

Email Importer
The Email Importer module grabs email attachments from a POP3 server.  This module may be used in conjunction with a multi-functional device that can send email to a service email account.

Email Exporter
The Email Exporter module sends email and optional attachments to an SMTP email account.  This module is excellent as a way to inform individuals that a process is complete or that a document needs attention.
 
Directory Watch
The Directory Watch Module motiors a shared file system for new image documents.  This module may be used in conjunction with a multi-functional device that can save files into shared file system.

See demos in the following solutions:  Paper Forms Processing, Data Entry from Image

Image Cleanup
The Image Cleanup Module can deskew and remove noise from an image.

Bar Code Recognition
The Bar Code Recognition Module looks for bar codes that are on the page.  It can be configured to look in a specific location or anywhere on the page.  See this module in the following solution demo:  Paper Forms Processing.

See a demo in the following solution:  Paper Forms Processing

OCR
The Optical Character Recognition Module can be configured in either of the following ways:
 - Full Page OCR recognizes every word on the page and stores the results in a text file
 - Form / Field Based OCR recognizes areas of the page and stores the results in fields

Indexing
The Indexing Module is an extremely versitile indexing application that allows configurable document indexing definitions, multiple types of editable fields, and validation.  This module can be extended in several different ways.

See a demo in the following solution:  Basic Scanning To File

Export to File
Export to File will export the images that have been created with this system to a particular shared file system.  The file path and file name can include fields that have been indexed or derived from bar codes.  There is also an example of how to use exported image documents on a web page.

See a demo in the following solution:  Basic Scanning To File

ODBC Export
ODBC Export will store index information and file information in a database.  This can be used in conjunction with a web service to allow end users to search for images.

Quality Assurance
The Quality Assurance module allows users to quickly view and reject images based on the quality of the image.  The images can be rescanned or an email can be sent back to the original sender using the Email Exporter module.

Print
The print module will automatically print all documents to a specific printer.  This module can be used with the watch module or the Documentum Import module to create a print server.

Export to Documentum
The Export to Documentum module will transfer the documents to documentum.  The following options can be set:
  • Attach a specific lifecycle
  • Start a specific workflow and attach the document to the workflow
  • Place the document into a specific folder.  The folder path can be set from any of the attributes that have been captured through the indexing, ocr, or barcode modules.

See demos in the following solutions:  Paper Forms Processing, Data Entry from Image


Documentum Importer
The Content Workspace Processor uses the Documentum Importer module to receive documents from Documentum.  Once received by the processor, changes to the documents can optionally be saved back into Documentum.
There are two ways that CWS Processor retrieves documents from Documentum:
  • Folder - CWS Processor can take all the documents in a folder, then after they are processed, they are placed in a new folder.
  • DQL Query - CWS Processor can query the docbase to find documents.  This option works well when the above two do not meet requirements, and especially well when moving documents from one docbase to another.

Documentum Workflow
Similar to the Documentum Importer module, the Documentum Workflow module queries Documentum's inbox to make CWS Processor to act as an activity in the workflow.  Automatically or manually, Processor will accept workflow tasks and process the documents accordingly. 

See a demo in the following solution:  Paper Forms Processing, Data Entry from Image

Metrics Capture
CWS Processor captures an audit trail for every module.  The events include:
  • Document Name
  • Module Instance Name
  • Machine Name
  • User Name
  • Start Time
  • End Time
  • Module Specific Comment

The Metrics Capture Module will store this information in a database for future reporting.


Timer
The Timer Module is configured to work with CWS Document Folders on shared drives.  The Timer Module will run at certain times of the day.  It will make documents available for processing in CWS according to specific attributes.  This modules is excellent when paired with the CWS Document Delete Module.

Automatic Annotation
The Automatic Annotation module will affix an annotation to the image.  The annotation can be a fixed value such as "For Internal Use Only", or can come from a variable such as "SOP00140042".  The annotation value can be configured to be at any location on the page and any font size.

Dual Data Entry Compare
Some solutions require two data entry operators to enter information from an image.  The Dual Data Entry Compare Module will compare the results of the two modules and determine if the document requires a third data entry process to resolve the discrepency.

Auto-Number
The Auto-Number Module is used create unique values for attributes.  The value can then be used as an object name in Documentum or a file name as the document is exported to a file system.  This module works within the Operations module.

Operations
The Operations Module is an automated module that can be used to add scripting between other modules.  Functionality includes:
  • Set attributes to specific values
  • Incorporate if-then logic that will determine if other modules are executed 
  • Lookup attributes in a database
  • Calculate mathematical values
  • Call out to custom functions

PDF Converter
The PDF Converter Module will save images to PDF format.  This PDF document can then be exported to a file system or to a content repository.

Image Enhance
Change the current image or create a new rendition of the image with any of the following filters applied: 
  • Change the dpi
  • Change the compression type
  • Change the color depth
  • Scale the image to a specific size
  • Create thumbnails of the image

These enhancements can improve the performance of the system or create a standard.


CWS Document Export
The CWS Document Export module will store all renditions and attributes about the document into folder.  The attributes will be stored in an XML file.  This is efficient when handing documents over to another CWS Processor using the CWS Document Import module.

CWS Document Import
The CWS Document Import module will look for documents and attributes on a file system that have been exported by the CWS Document Export module.  This is a good mechanism for picking up documents that have been saved by another user.

Delete
The CWS Document Delete module will delete the document folder and its contents.  This module is good when paired with the Timer Module to delete documents that have not been updated in X number of days.