Normalisation

Normalisation (or standardisation) is the process of mapping your original imported data onto InfoSum's Global Schema.

Each dataset must be normalised (and then published) before it can be referenced in queries.

Normalisation addresses the obvious problem that two separate imported datasets are likely to use different schemas. By normalising both datasets to the common Global Schema, they can be used together in statistical analysis or other queries.

Additionally, normalisation plays an important role in ensuring the security of your data. As part of normalisation, direct identifiers (which can be used to identify specific people) are irreversibly converted to anonymised keys. The original data is then permanently deleted. So, even if the bunker which holds your dataset were somehow compromised, this would not reveal the identity of any individuals.

You will need to perform a series of steps to prepare your data for normalisation. These include assigning categories to your original data columns, and possibly configuring mappings and transformations. You can complete all these tasks using your Bunker's web-based UI.