Designing Document Templates |
Top Previous Next |
The Document Template Designer window allows you to conduct two types of extractions - statistical and content based. FRPs are used to recognize file types by identifying meta-tags in the document that are typically related to certain file types. For example, a document that contains the meta-tags: Equipment ID, Test Results, Sample Information, and so on, can be recognized as an instrument output file. Thus, during the recognition process, the system will attempt automatically to move the document to the matching file-type. After the FRP is created, a DRP can be created to contain the data extraction method, that is, the DRP is used to convert unstructured information in the file to structured data in the form of XML. After the templates are created, information extracted from the document can be bound to a Unified XML structure which can be read by external applications, such as a LIMS.
The following diagram displays how documents are processed:
|