What are Datasets?
How to start extracting data?
Data is extracted from documents that are filled out and submitted by your end users within a workflow. The data is saved to a report called Dataset, and can be then downloaded in Excel/CSV format, sent directly to third party data visualization software services, such as Power BI, Tableau, etc, or exported via Zapier, or to your database via API.
To start extracting data:
- Navigate to the Workflows tab under the Configuration section of the Fluix Admin Portal
- Open the workflow where you’d like to extract data from and click Edit Workflow
- Find the final submit rule for the documents (e.g. ‘Upload to folder’) and click ‘+’ to add an parallel submit action Extract Data:
4. Add new dataset (or select an existing one) where the data will be extracted to, and click Done:
5. Save changes to the workflow.
From now on all the data that is entered into fillable PDF forms within this workflow will be saved into the dataset each time the document is submitted. Photos that were inserted into the image fields in your form will be also saved into the dataset as a download link.
Each dataset will include the list of submitted documents, columns with the fields and data that was filled out in the documents:
The list of datasets can be found in the Datasets tab of the Data section:
By selecting any dataset, you can Customize it, Rename, Export to Tableau, download as CSV or Excel, Delete it, or set up Notification Rules:
To create a dataset to be used in a workflow, click the New Dataset button in the top right corner.
Subsets with Filtered Views
Be default, the dataset includes all the fields that are present in the submitted documents and the fields are named exactly as in PDFs. By creating filtered views (subsets), you can configure what fields to extract and rename them in a way you prefer.
Here are the steps:
1. Select the dataset and click Customize
2. Create New Subset
3. On the left, you will see the list of documents that have been already collected in the dataset. On the right there will be a list of fields from the collected documents. By default, all fields are deselected. You can select them by clicking on the button ‘Select all fields in all documents’ in the menu on the right.
4. To select a particular field, simply click on it on the document preview.
5. If the field names are generic (e.g. Text1, text2, etc.), navigate on the field to see its name, or navigate on its name in the list on the right and it will be ‘linked’ to the right field in the document:
5. Here you can also rename the fields by clicking on the pencil:
6. Type in a report name in the top left corner and save changes.
Filtered views can be exported, edited or deleted:
Besides extracting data, you can also track the deviations from the expected values in your documents and receive the email alerts in real time. To proceed, you need to create rules for email alerts:
- Select the Dataset and click on Notification Rules
- On the left sidebar, add recipients of the email alerts
- Click on Add Rule to create a rule for email alert
- Add a field name, select the condition More, Less or Equal to and specify the value
- Add a message to be emailed
- Save changes
Once the specified values deviate from the norm you expect, the respective email will be automatically sent to the recipients.
Access to Datasets
In case you do not see some options mentioned above, you can request access from your account owner. The permissions are regulated in the Roles tab under Configurations section of the Admin Portal:
Finding bottlenecks and benchmarking in your business processes is now very easy.
Was it helpful? Feel free to contact us at firstname.lastname@example.org if you have any questions or comments.
Check what you can do with Fluix. Explore features >