Intelligent Document Data
This function automatically extracts key values contained in documents.
The Intelligent Document Data extraction employs advanced Artificial Intelligence (AI) technology to streamline the extraction of data from documents, especially structured documents (forms), making it a powerful tool for enhancing efficiency and productivity.
It is important, however, to acknowledge the limitations of the AI in certain scenarios to manage expectations effectively. The success of the extraction process relies on the AI's ability to establish clear relationships between keys and values in the form. In instances where this relationship is unclear or ambiguous, the AI may struggle to provide accurate results.
Users should be mindful that the AI is a supportive tool rather than an infallible solution, and that not all forms may be processed with equal precision. The extraction can differ depending on the selected capture method, the scan settings, or other factors.
To ensure the best results, the extracted data should be reviewed for accuracy using the RSI LogicFlow validation feature, especially in cases where form layouts are unconventional or key/value relationships are ambiguous, for example when the value is not in the immediate vicinity of the key.
It is recommended to always enforce the RSI LogicFlow validation when the extracted data accuracy is critical.
To configure the Intelligent Document Data, the expected keyword should be entered. This is the information the AI will search for to extract relevant data, so it is important to match the keyword as it is written on the document. For each extracted data, it is possible to look for various keywords in order to support different document formats. The AI will select the best value found on the document.
It is possible to narrow down the data extraction to specific content within the document with the content filter and the page filter, ensuring you get the right data even if the key appears multiple times.
The data type should be selected and a unique Data ID should be assigned to the extracted information, serving as an identifier for subsequent workflow use and as a label in the validation interface.
Details of settings:
Category | Setting | Description | Options |
---|---|---|---|
Intelligent Document Data | Search Label | Label to search in the document. | Any string. |
Content filter | Allows filtering key-value pairs. |
Can filter using:
If the data type is On/Off (Checkbox), the content filter will be disabled. |
|
Zone filter | To restrict the search to a specific zone. Once this setting is enabled, the screen for setting coordinates will be available. |
Off/On ![]() Choose the zone to be included in the search. It is possible to load an image, portrait or landscape orientation. |
|
Page filter | To restrict the search to a specific page | Must be a number. | |
Data Type | To restrict the search to a specific data type. |
The type can be:
If the data type is On/Off (Checkbox), the content filter will be disabled. * Form Group allows to create a collection of key-value pairs under one single Data ID |
|
Data ID | ID for using the extracted data in the input fields of other functions. You can leave it as the default or change it to an original ID that is easier to identify. | Any string except “%”. |
Auto detect
Allows an administrator to extract from a document a list of Search Labels.
Each search label found will have its own editable Data ID.
These Search Labels will be used by the Intelligent Document Data to extract values contained in documents.
It is possible to create one Data per search Labels, or to group multiple Search Labels into a single Data ID.
Setting | Description | Options |
---|---|---|
Check Box | If selected a new Data will be added to the Intelligent Document Data. | On/Off |
Search Label | Label to search in the document. | Any string. |
Data ID | ID for using the extracted data in the input fields of other functions | Any string except “%”. |
Form Group
A Form Group is a collection of key-value pairs under one single Data ID.
A Form Group only accepts On/Off (Checkbox) as Data type.
When used inside an input field, the Form Group Data ID will be replaced by all the search labels whose values are "on", separated by commas.
All Search Labels will be used by the Intelligent Document Data to extract values contained in documents, before being grouped into one single Data ID.