Loading data from a CSV hosted on FTP servers
On this tutorial, you'll learn how to add a CSV hosted on an FTP server as a data source and create a loading job using it to synchronize your source with your SlicingDice's databases.
Have a configured FTP server
The first step is to have an FTP server configured and accessible through the Internet - remember that you need to configure your router to be opened for these connections. Notice that you also may prefer to whitelist SlicingDice's IP addresses to avoid letting your FTP server completely opened on the Internet:
Add a CSV hosted on an FTP server as a Data Source
Before adding your FTP Server data source on SlicingDice, you need to be logged on the Control Panel. Then, you need to access the Data Sources section so we can start our tutorial creating this data source.
How to add a CSV hosted on an FTP server as a Data Source
Before creating your data source you need to have the following information in order to connect to your CSV hosted on your FTP Server.
- The IP address from your FTP Server (remember that this IP should be exposed on Internet)
- The port where your FTP server will receive the connection request
- The name of the CSV that will be loaded
Now let's start creating your data source just clicking on the Create new data source button on SlicingDice's Data Source section
Data Source setup
The first step is the configuration of your data source identification on SlicingDice. The following screen shows step 1.
Three fields will appear. Each field function is described at the table below.
Field Description Data Source Name The name of your data source. Can be edited at any time. Mandatory Data Source Labels/Tags Labels/tags you might want to associate to a source, in order to organize your sources. Can be edited at any time. Optional Data Source Description The description of your data source. Can be edited at any time. Optional
When ready, click on the Save & Continue button to go to Step 2.
Data Source Details
Below you can see an example of the information and credentials that you should provide so SlicingDice can be able to connect to your SQL Server database.
Field Description Data Source Type The type of data source. In this case we're using CSV (FTP). URI The uri or ip of your FTP server concat with the configured port for FTP (default is 21) and the CSV file path. Username Your FTP Server username. Password The password for a given username.
Notice that the URI from your CSV hosted on a FTP server should follow this example pattern:
You can test the connection by clicking on the Test Connection button. If everything goes ok you'll see a success message.
Now you can go to the next step clicking on Save & Continue
Here you'll see a summary of the configurations defined for this data source before you finally create it.
The following image shows an example of a confirmation screen, which the name of the data source is
If everything is ok, click on the Submit button and then you'll receive a success creation message.
Now you'll be able to find your new data source at the data sources list as you can see in the following image.
That's it! The next step is to load your SQL Server data into SlicingDice creating a loading job.
Add a new loading job
Now all the connection configuration with your CSV (FTP) source is completed, so the next step is just to create and execute a loading job using this CSV (FTP) Data Source you've configured on SlicingDice.
Here are the creation jobs tutorials for each type of loading job. Choose the most helpful for your use case:
Remember that while setting up your loading job, you should set the CSV (FTP) Data Source you've created as the Data Source to be used on this job.
- One-time loading job: The one-time loading job loads your data once, needing manual intervention to execute it. This loading job type is useful if you don't update your data frequently.
- Manual incremental loading job: The manual
incremental loading job loads your data on-demand when new data needs to be inserted in SlicingDice. You
need to manually start this loading job.
Differently from an one-time loading job, only new rows will be inserted in the database. Your dataset needs to have a timestamp column in order to use this loading job type.
- Automatic loading job: The automatic loading
your data frequently, specified by a predetermined time interval. You don't need to manually start this
loading job, as it executes automatically.
Your dataset needs to have a timestamp column in order to use this loading job type.