Skip to main contentThe Service is a cloud-based service for automating document workflows using machine-learning models to extract key information from scanned, photographed or digital documents. The Service is provided as a web application and as a REST API. As part of the processing, the Service uses machine learning models to extract information from unstructured documents.
All predictions made by the machine learning models shall be considered as suggestions, and Licensee is responsible for deciding whether to trust the suggested result of the scanning and use the information from the scanning as suggested.
1. DATA FLOW
The following steps describe the flow of document data when using the Service:
- Licensee posts a document to the Service. A retention policy is used by Cradl AI to determine how long the document can be stored and used for further training. The document is stored securely on Cradl AI’s servers which are hosted on Amazon Web Services in Ireland.
- If the document is processed and extracted information is returned as a JSON response. Feedback posted to the Service for the purpose of improving the machine learning models is stored securely with the document.
- Licensee may at any time delete one or all of the documents posted to the Service by using the API.
- If the document is submitted to one of Cradl AI’s workflow endpoints as part of the processing, the document will become available in the validation interface.
2. CONTINUOUS TRAINING
Documents posted to the Service may be used by Cradl AI for training Cradl AI’s machine learning models. Cradl AI supports two training modes:
Model fine-tuning
The following steps outline the current process for manual fine-tuning.
- Automatic anomaly checks are run on the collected data to rule out data points that can be detrimental to the training process.
- Random checks of the data may be conducted by Cradl AI’s employees to rule out data points that can be detrimental to the training process.
After the pre-processing, the following training routine is executed:
- Cradl AI’s machine learning models are prepared and configured for training by Cradl AI.
- The trained models are evaluated, tested and deployed to production if the new model is significantly better (in terms of statistical significance) than the previous model.
In-context learning
The following steps outline the current process for manual fine-tuning.
- Automatic anomaly checks are run on the collected data to rule out data points that can be detrimental to the training process.
- Random checks of the data may be conducted by Cradl AI’s employees to rule out data points that can be detrimental to the training process.
- Training data is automatically used to improve the Licensee’s predictions, but without altering the underlying machine learning model.
3. FEATURES
A complete description of features and capabilities of the Service is available in Cradl AI’s documentation.