1. DATA FLOW
The following steps describe the flow of document data when using the Service:- Licensee posts a document to the Service. A retention policy is used by Cradl AI to determine how long the document can be stored and used for further training. The document is stored securely on Cradl AI’s servers which are hosted on Amazon Web Services in Ireland.
- If the document is processed and extracted information is returned as a JSON response. Feedback posted to the Service for the purpose of improving the machine learning models is stored securely with the document.
- Licensee may at any time delete one or all of the documents posted to the Service by using the API.
- If the document is submitted to one of Cradl AI’s workflow endpoints as part of the processing, the document will become available in the validation interface.
2. CONTINUOUS TRAINING
Documents posted to the Service may be used by Cradl AI for training Cradl AI’s machine learning models. Cradl AI supports two training modes:Model fine-tuning
The following steps outline the current process for manual fine-tuning.- Automatic anomaly checks are run on the collected data to rule out data points that can be detrimental to the training process.
- Random checks of the data may be conducted by Cradl AI’s employees to rule out data points that can be detrimental to the training process.
- Cradl AI’s machine learning models are prepared and configured for training by Cradl AI.
- The trained models are evaluated, tested and deployed to production if the new model is significantly better (in terms of statistical significance) than the previous model.
In-context learning
The following steps outline the current process for manual fine-tuning.- Automatic anomaly checks are run on the collected data to rule out data points that can be detrimental to the training process.
- Random checks of the data may be conducted by Cradl AI’s employees to rule out data points that can be detrimental to the training process.
- Training data is automatically used to improve the Licensee’s predictions, but without altering the underlying machine learning model.