Installing database drivers¶
Before being able to create SQL-based datasets, you need to install the proper JDBC drivers for the database that you intend to connect to.
Additionally, PostgreSQL script recipe support requires the command-line psql client to be installed. See PostgreSQL support.
Download the driver¶
Data Science Studio comes with bundled drivers for :
- PostgreSQL 9
- EMC Greenplum
- Amazon Redshift
Drivers for other databases must be downloaded from your database vendor.
|HP Vertica||https://my.vertica.com/download-community-edition/||Requires a My Vertica account|
Stop Data Science Studio¶
In this page, DATA_DIR refers to the data directory where you installed Data Science Studio.
On Mac OS X, the DATA_DIR is always: $HOME/Library/DataScienceStudio/dss_home
Installation of JDBC drivers must be done while Data Science Studio is stopped.
Copy the driver¶
Copy the driver’s JAR file (and its dependencies, if any) to the DATA_DIR/lib/jdbc folder
Restart Data Science Studio¶
Data Science Studio supports datasets stored in PostgreSQL 9.
PostgreSQL version 8 is not supported.
PostgreSQL script recipe support additionally requires the command-line psql client to be available in the search PATH of the Studio Linux account.
You should install a command-line client compatible with your version of the server. Depending on your Linux distribution, the appropriate client may be available in a standard OS package named “postgresql-client” (Debian / Ubuntu) or “postgresql” (RedHat / CentOS 7). If that is not the case, you can install the correct client for your server and OS by configuring an extra package repository as described at http://www.postgresql.org/download/ .