Loading data from a CSV hosted on FTP servers

On this tutorial, you'll learn how to add a CSV hosted on an FTP server as a data source and create a loading job using it to synchronize your source with your SlicingDice's databases.

Have a configured FTP server

The first step is to have an FTP server configured and accessible through the Internet - remember that you need to configure your router to be opened for these connections. Notice that you also may prefer to whitelist SlicingDice's IP addresses to avoid letting your FTP server completely opened on the Internet:

Add a CSV hosted on an FTP server as a Data Source

Before adding your FTP Server data source on SlicingDice, you need to be logged on the Control Panel. Then, you need to access the Data Sources section so we can start our tutorial creating this data source.

How to add a CSV hosted on an FTP server as a Data Source

Before creating your data source you need to have the following information in order to connect to your CSV hosted on your FTP Server.

  • The IP address from your FTP Server (remember that this IP should be exposed on Internet)
  • The port where your FTP server will receive the connection request
  • The name of the CSV that will be loaded

Now let's start creating your data source just clicking on the Create new data source button on SlicingDice's Data Source section

  • Data Source setup

    The first step is the configuration of your data source identification on SlicingDice. The following screen shows step 1.

    Responsive image

    Three fields will appear. Each field function is described at the table below.

    Field Description
    Data Source Name The name of your data source. Can be edited at any time. Mandatory
    Data Source Labels/Tags Labels/tags you might want to associate to a source, in order to organize your sources. Can be edited at any time. Optional
    Data Source Description The description of your data source. Can be edited at any time. Optional

    When ready, click on the Save & Continue button to go to Step 2.

  • Data Source Details

    Below you can see an example of the information and credentials that you should provide so SlicingDice can be able to connect to your SQL Server database.

    Responsive image

    Field Description
    Data Source Type The type of data source. In this case we're using CSV (FTP).
    URI The uri or ip of your FTP server concat with the configured port for FTP (default is 21) and the CSV file path.
    Username Your FTP Server username.
    Password The password for a given username.

    You can test the connection by clicking on the Test Connection button. If everything goes ok you'll see a success message.

    Now you can go to the next step clicking on Save & Continue

  • Confirmation

    Here you'll see a summary of the configurations defined for this data source before you finally create it.

    The following image shows an example of a confirmation screen, which the name of the data source is drinks.

    Responsive image

    If everything is ok, click on the Submit button and then you'll receive a success creation message.

    Now you'll be able to find your new data source at the data sources list as you can see in the following image.

    Responsive image

    That's it! The next step is to load your SQL Server data into SlicingDice creating a loading job.

Add a new loading job

Now all the connection configuration with your CSV (FTP) source is completed, so the next step is just to create and execute a loading job using this CSV (FTP) Data Source you've configured on SlicingDice.

Here are the creation jobs tutorials for each type of loading job. Choose the most helpful for your use case:

  • One-time loading job: The one-time loading job loads your data once, needing manual intervention to execute it. This loading job type is useful if you don't update your data frequently.
  • Manual incremental loading job: The manual incremental loading job loads your data on-demand when new data needs to be inserted in SlicingDice. You need to manually start this loading job.
    Differently from an one-time loading job, only new rows will be inserted in the database. Your dataset needs to have a timestamp column in order to use this loading job type.
  • Automatic loading job: The automatic loading job loads your data frequently, specified by a predetermined time interval. You don't need to manually start this loading job, as it executes automatically.
    Your dataset needs to have a timestamp column in order to use this loading job type.