Connecting data sources
On SlicingDice you can connect external sources to your account, so you can use it to load data into your physical databases or just execute queries and analyses directly over these sources. Here you'll learn how to connect them to SlicingDice using the Data Warehouse Module.
Adding a data source
Setting up a data source will save all the connection information (such as location and credentials) needed for its access so you can use all SlicingDice tools to query, analyse and execute predictions directly on the data stored on this external source. Below we explain how you can add a new data source to SlicingDice through the Data Warehouse Module.
The first step is to configure your data source identification as you can see below.
Here you'll define the name, description, possible labels and the data availability for this source that defines if this data source will be available to be queried directly (Logical Data Sources), to be loaded into SlicingDice physical databases (Data Loading) or both. As the purpose of this guide is to use this source to execute queries and analyses directly over it (Logical Data Sources) you can choose the SlicingDice Databases or Both option. When ready, click on Save & Continue.
Now you'll define the appropriate data source credentials. Each data source type has its own credentials. Common credentials are IP, port, hostname, username, password, database name, URL address, access type, region and so on.
Below is an example of how to add a CSV (HTTP) data source.
You can test the connection by clicking on Test Connection. If everything goes ok, you'll see a Connection Success message and then you can click on Save & Continue.
If you need to connect using a SSH Tunnel, take a look at the Creating a SSH Key connection guide.
As you chose between Logical Data Warehouse or Both option on data availability, a new step called Logical Config will be added to this wizard. Here you should define if this source will be available to all databases you have or for just specific ones. You should also define the tables from your external source that will be available for queries.
External sources behave like common dimension
Notice that if you add a data source choosing the data availability option as Logical Data Warehouse or Both, this source will behave like a common Dimension in your database that can be queried and analyzed the same way as if it was a physical one.
Finally just check your data source configuration and click on Submit. Now you'll be able to find your created data source at the data sources list. That's it!, now you can use this data source to create a loading job that will connect to this source and load your data.
If you need to whitelist the IPs which SlicingDice will use to connect to your source, use the following ones shown below.