Creating connection to R Data Science Engine requires a prerequisite file system-based connection. This connection’s file location helps as a shared location to exchange data between Intellicus and R.
The first step will be to create a file system connection. The second step will be to create connection to Data Science Engine and providing the file connection location created in step 1.
Creating connection to File System
To create this, please follow the below steps.
- Login to Intellicus – Navigate – Administration – Configure – Databases Tab
- Click on Add
- The page will display the following options
Figure 1: Adding a file system-based connection
Connection Properties
Item | Values | Comments |
Connection Name | Provide a name as per your desire | This connection is used as a file-system to exchange data, and this name will be required while connecting to Data Science Engine |
Provider | Select from the drop down | Intellicus stores the temporary processing data as a file, hence you need to select Files/Stream |
Driver Type | Select from the drop down | You need to select network path to specify the location to save your file |
LOCATION | Provide the location where the file will be saved | This location will be used by the Data Science Engine and Intellicus to exchange data while processing the data in Data Science step. |
Make sure you uncheck read only in the settings of this connection as it will be accessed by both Intellicus and Data Science Engine to exchange data.
Note: To get more details to create a file system connection, please refer “WorkingwithDatabaseConnections.pdf”
Once you have given the required details, you can test your connection if it has been successfully created, Save it once you get the message ‘Connection Test Succeeded.’ Cancel if you want to start afresh.
You can Delete a connection once you have saved it or Refresh Schema to let Intellicus refresh the data from the connected database.
Creating connection to Data Science Engine
To create this, please follow the below steps –
- Login to Intellicus – Navigate – Administration – Configure – Databases tab
- Click on Add
- The page will display the following options
Figure 2: Creating a connection to Data Science Engine
Note: To create a connection to Data Science Engine, you must have a Data Science Engine running parallel on this network.
Data Science Engine Connection Properties
Item | Values | Comments | |
Connection Name | Rserver | This property will get affected once you choose Data Science Engine provider from the provider drop down. By default, the connection name will remain Rserver as Intellicus is providing connection to R server in this version. The name will be used as a reference at many places hence it is set by default
Note- you need to select ‘Provider’ to be able to enter this field |
|
Provider | Select DATA SCIENCE ENGINE from the drop down | Since we need to create a connection to Data Science engine, run to the bottom of the list and select DATA SCIENCE ENGINE | |
Platform | Select from the drop down | Specify the Data Science engine you need to connect. For now, RServe is available | |
Server | Type yourself | Provide the server IP address where your Data Science engine is running | |
Port | Type yourself | Provide the port details on which your Data Science engine is running | |
Dump Connection Name | Type yourself | Here you need to type the name of the file-based system connection you formed in the first step | |
Rows To Dump While Verification | Type yourself | Whatever number you enter here, only those many rows from your raw data will be dumped to verify the correctness of the script. By default, the filed remains blank. | |
Connection String | Autogenerated | Connection string to connect to the Data Science engine | |
Mask Connection String | Check/Uncheck | If checked connection string is masked | |
Charset Encoding | Select from list | Leave it blank | |
Pool Settings | Initial Connections | Type yourself | This feature helps you to define how many initial connections Intellicus will form to the engine so as once you start using it, there are no delays and your process carries out smoothly.
Default: 5 |
Incremental Size | Type yourself | Once all the available connections are used, increment size helps to increase the number of connections and forms new connections as per the specified number.
Default: 5 |
|
Resubmit Time | Type yourself | If the connections were increased and in use, resubmit time checks the current number of connections in use. If the connection value goes down to initial connection value, the incremented connections are released
Default: 30 seconds |
|
Max Connections | Type yourself | The value here specifies how many connections can be formed at max. Say your max connections are 30 and all are in use. If any more connection request is raised, it will go to a queue
Default: 30 |
Once you have given the required details, you can Test your connection if it has been successfully created, save it once you get the message ‘Connection Test Succeeded.’ Cancel if you want to start afresh.
You can Delete a connection once you have saved it or Refresh Schema to let Intellicus refresh the data from the connected source.