File recognition is performed by SDMS as follows:
1. | After a document is uploaded, the system searches for specific keywords (meta tags) in the document using lists of meta tags from existing FRPs. |
2. | Meta tags are identified and each is assigned an Importance value. This value indicates how important it is for the document to contain this word in order to be classified as being of a certain type. This set of meta tags compose a newly uploaded document's FRP. |
3. | The system attempts to fit the new set of meta tags to already existing FRPs. If an FRP that includes some or all of the meta tags is identified, a calculation is done to determine the certainty of this recognition. If the certainty level is sufficient, the uploaded document is identified and assigned a file type. Otherwise, the file is assigned the Unrecognized status. |
In the next section, a diagram illustrates how an uploaded document is compared to configured file recognition patterns in order to classify the document according to file type.
|