Loading data from a CSV hosted on Amazon S3

On this page, we'll explain you how to add a CSV hosted on Amazon S3 as a data source and create a loading job using it to synchronize your source with your SlicingDice's databases.


Configuring Amazon S3 access

Before loading your data from your CSV on Amazon S3 you'll need the following information in order to connect to your source:
- The Amazon S3 object URL
- Your AWS Secret Key
- Your AWS Access Key
- The region of your Amazon S3 bucket

The following tutorial will show you how to retrieve this information that will be used while creating your data source.

Retrieving your Amazon S3 object URL and region

  • Log in to your AWS Account and then go to your Amazon S3 User Console.
  • Select an S3 bucket and note in which region this bucket is located. You’ll need that to insert data into SlicingDice later.
    Responsive image
  • Retrieve your Amazon S3 object URL
    • Double click on your bucket name and you'll see an Overview page.
    • Select your CSV file to see more information about it. Note that each file stored on Amazon S3 has an Object URL related.
    • Copy the URL and save it. You'll need this information later.
Retrieving your AWS Secret Key and Acess Key

The AWS Access Key and AWS Secret Key are credentials to access your AWS account. SlicingDice needs these keys in order to access and load your data.

If you don't have an Access Key or lost the Secret Key, you'll need to generate one. Follow the steps below to generate a new Access Key.


  • Click on your Account name and click on My Security Credentials.
  • In the My security credentials section, click on Create Access Key to create a new access key.
  • You'll be presented to a screen showing your new Access key and a hidden Secret Access Key. Click on the Show secret access key to retrieve your secret key. You'll need both keys in order to allow SlicingDice to load your data.
    Responsive image

Add a CSV hosted on Amazon S3 as a Data Source

Before adding your CSV from your Amazon S3 data source on SlicingDice, you need to be logged in our Control Panel. Then, you need to go to the Data Sources page so we can start our tutorial.


How to add a CSV hosted on Amazon S3 as a Data Source

Before creating your data source you need to have the following information in order to connect to your Amazon S3 bucket:
- The Amazon S3 object URL
- Your AWS Secret Key
- Your AWS Access Key
- The region of your Amazon S3 bucket

Now let's start creating your data source just clicking on the Create new data source button on SlicingDice's Data Source section


  • Data Source setup

    The first step is the configuration of your data source identification on SlicingDice. The following screen shows step 1.

    Responsive image

    Three fields will appear. Each field function is described at the table below.

    Field Description
    Data Source Name The name of your data source. Can be edited at any time. Mandatory
    Data Source Labels/Tags Labels/tags you might want to associate to a source, in order to organize your sources. Can be edited at any time. Optional
    Data Source Description The description of your data source. Can be edited at any time. Optional

    When ready, click on the Save & Continue button to go to Step 2.

  • Data Source Details

    Below you can see an example of the information and credentials that you should provide so SlicingDice can be able to connect to your Amazon S3 bucket.

    Responsive image

    Field Description
    Data Source Type The type of data source. In this case we're using CSV (Amazon S3).
    URL The Amazon S3 Object URL of your CSV file.
    Access Key Your AWS Access Key
    Secret Key Your AWS Secret Key
    Region The region where the S3 bucket is stored.

    You can test the connection by clicking on the Test Connection button. If everything goes ok you'll see a success message.

    Now you can go to the next step clicking on Save & Continue

  • Confirmation

    Here you'll see a summary of the configurations defined for this data source before you finally create it.

    The following image shows an example of a confirmation screen, which the name of the data source is drinks .

    Responsive image

    If everything is ok, click on the Submit button and then you'll receive a success creation message.

    Now you'll be able to find your new data source at the data sources list as you can see in the following image.

    Responsive image

    That's it! The next step is to load your CSV data from your Amazon S3 bucket into SlicingDice creating a loading job.

Add a new loading job using CSVs on Amazon S3 sources

Now all the connection configuration with your CSV from your Amazon S3 source is completed, so the next step is just to create and execute a loading job using this CSV (Amazon S3) Data Source you've configured on SlicingDice.

Here are the creation jobs tutorials for each type of loading job. Choose the most helpful for your use case:

  • One-time loading job: The one-time loading job loads your data once, needing manual intervention to execute it. This loading job type is useful if you don't update your data frequently.
  • Manual incremental loading job: The manual incremental loading job loads your data on-demand when new data needs to be inserted in SlicingDice. You need to manually start this loading job.
    Differently from an one-time loading job, only new rows will be inserted in the database. Your dataset needs to have a timestamp column in order to use this loading job type.
  • Automatic loading job: The automatic loading job loads your data frequently, specified by a predetermined time interval. You don't need to manually start this loading job, as it executes automatically.
    Your dataset needs to have a timestamp column in order to use this loading job type.