Assign columns to categories

The first step in normalising your data is to assign each column to a Global Schema category. This step tells the Platform what each column in your dataset means. 

Before starting, please make sure you have read the definition of columns, categories, representations and properties.

Some or all of your columns may have been assigned to a category during the import process, in particular if the columns have commonly used and descriptive names - things like Age or Postcode. 

Scroll right to find any columns that haven't been categorised. They'll be shaded light blue. If all your columns have been assigned and you're happy with the categories and mappings, you can move onto testing with a dry

Any unassigned categories will look like the image below. 

  

Assigning a column to a category

To assign a column to a category, click on the Settings button next to the column name, then Assign Category

 

From here, you can search for a relevant category or scroll through the list, then click Assign. You have now told the Platform the meaning of that column, which will no longer be shaded blue. 

Assigning more than one column to a category

Several columns in your original data can map onto a single category. For example, you might have individual columns for Street, Town and Postcode (or zip code). All of these together would map onto a single category, Address.

To assign more than one column to a category, open up the Categories dialog as before, then select multiple columns. 


Categories with properties

For a few categories, you need to configure properties to help your Bunker understand your original schema. For example, the Address category comes with a Postcode property, which tells your bunker which of your original data columns contains the postal code.

When you select the Address category in the Categories dialog, an additional option appears. Select the appropriate column from the drop-down, then click Assign.


Removing columns from a category

If you assign a category to the wrong column, you can edit or remove the assignment.

To do so, open the Categories dialog, then click on the tab labelled with the incorrect category name. In the image below, the age column has been incorrectly assigned to the gender category. Click delete to remove it.

 

Assigning a column to a custom category

If there isn't a relevant category available, you can create a custom category. This gives you the flexibility to use categories beyond what's included in the Global Schema. For example, if you have an internal ID or flag that you want to use.

To create a custom category, open up the Categories dialog at before, then select the create custom category button and an additional settings area will appear. You will now need to give the custom category a name and specify the type of data. Two custom categories in different datasets can only be matched if they have the same name, so this stage may require some coordination with other users.

If the column used for the custom category is an identifier, such as a Customer ID, you will need to select 'is key' for it be used later on to match keys across datasets.

Next up

Now all your columns have been assigned a Global Schema category, if the column is still red, you may now need to set up category mappings or use the transformation tools