Data connector for AWS S3
The data connector for S3 enables you to directly import a dataset from an AWS S3 bucket. It can be used to import delimiter separated value files to InfoSum Platform, such as data exported from AWS Redshift.
Before starting, you will need the following information to hand:
- Access key ID
- Access key secret
- File name
To configure a connection, login to Dataset Manager if you haven't already and either create a dataset or access the Bunker of an existing dataset. Once you're in the Bunker, select Import a dataset or use the Import tab, and locate the S3 connector.
Click Connect and enter your credentials as shown below.
Next, copy the file name into the Key box and select Download, then Connect.
A subset of the data will then appear as a preview. You can perform some minor manipulations as this point, such as selecting which columns to import, renaming columns and excluding rows.
When you're happy with the preview, accept the settings and you'll be taken to the Import Wizard. This will show how our Platform has understood your dataset and mapped columns into our Global Schema.
If this looks correct, accept the Wizard Settings, otherwise untick the boxes so they can be correctly mapped during the later normalisation phase.
Learn how to normalise the dataset into our Global Schema using data mappings and transformations.