Test with a dry run

Before selecting the Normalise button, which will convert your imported dataset into the Global Schema and delete the original data, we recommended first testing with a dry run.

A dry run simulates what will happen when you finalise the normalisation process and will give you a good idea of how well your original data can be converted. Because a dry run works with a sample of rows rather than your full data, it's also considerably faster than normalising the dataset for real.

You can test with a dry run at any time. It may help you to spot and resolve problems with your data, as it may trigger warnings against some rows. Typically, groups of warnings will have a common cause - a mismatch between your data and InfoSum's Global Schema. By resolving warnings, testing with another dry run, and repeating as necessary, you can improve the quality of your normalised data.

How to test with a dry run

To start a dry run, click the button at the bottom right of the window and select the categories to be included in your dry run. You can then watch the progress of your dry run, which may take a few minutes to complete.

Once the dry run is finished,  you'll see a summary of the results in the progress bar at the top of the window. The fuller the progress bar, the better your data maps onto InfoSum's Global Schema and the more comprehensive your reports will be.

In this example, you can clearly see that there is a problem with the Personal Income category - where no row has successfully mapped to the Global Schema. Fixing this problem would considerably increase the overall quality of your reports.

How to suppress warnings from particular categories

Sometimes, you might want to suppress warnings for particular categories. You might do this, for example, if there are issues with your original data which you cannot fix. Being warned about these issues is not useful, and may get in the way of spotting other warnings from elsewhere in your dataset.

To suppress warnings for one or more categories, click the Dry Run button and simply unselect the checkboxes next to the categories you want to suppress. Starting from the next time you do a dry run, you won't see warnings for those categories.


Next up

Once you're happy with how successfully your imported data can be converted to the Global Schema, it's time to normalise and publish your dataset to make it available for queries.