Data extraction
Extract data for registration in system applications
With a data extraction module *) for PixEdit® Desktop, data is extracted from existing files or in connection with document scanning. Both single-page and multi-page documents, with or without colour, can be processed.
Data model
The work starts with defining a data model. The data model contains the type of information that is to be extracted from the documents and in what order. The data model can be used to extract data from structured documents (forms) and unstructured documents. The data can be, for example, name, agreement number, social security number etc.
Export to XML or CSV
The scanned documents are saved as PDF and the associated data extracts are saved as data files in a standard exchange format (XML or CSV). File name and storage location can be defined in the process. The data extracts can then be used in other systems or imported into Microsoft Excel.
Structured documents (forms)
*) The functionality is available as an add-on/extension module to PixEdit Desktop.