Supported data sources in DataStage

Data source connectors provide data connectivity and metadata integration to external data sources, such as relational databases, cloud storage services, or messaging software.

DataStage® supports the following data sources. To establish the connection for the DataStage connector for these data sources, see Connecting to a data source in DataStage.

The "(optimized)" version of a connection gives you increased performance and more features such as before and after SQL statements and reject links. However, you cannot use the "(optimized)" connection outside of the DataStage service. You can use the connections that are available to other tools and services (for example, the connection for Salesforce.com) , if you already created the connection, and you want to reuse it in DataStage.

Connection Corresponding DataStage connector
Amazon RDS for MySQL Amazon RDS for MySQL
Amazon RDS for Oracle Amazon RDS for Oracle
Amazon RDS for PostgreSQL Amazon RDS for PostgreSQL
Amazon Redshift Amazon Redshift
Amazon S3 Amazon S3
Apache Cassandra Apache Cassandra
Apache Cassandra (optimized)* Apache Cassandra (optimized)
Apache Cassandra data source in ODBC* ODBC
Apache HBase* Apache HBase
Connection Corresponding DataStage connector
Apache HDFS Apache HDFS
Apache Hive

Supports source connections only.

Apache Hive

If you select Use DataStage properties, you can use the connector as a target.

Apache Hive data source in ODBC* ODBC
Apache Kafka Apache Kafka
Box Box
Cloudera Impala

Supports source connections only.

Cloudera Impala

If you select Use DataStage properties, you can use the connector as a target.

Dremio Dremio
Dropbox Dropbox
Elasticsearch Elasticsearch
Exasol Exasol
Connection Corresponding DataStage connector
FTP FTP
Generic JDBC

Use the Generic JDBC connection to connect to a data source that does not have a defined connection for Cloud Pak for Data.

Generic JDBC
Generic S3 Generic S3
Google BigQuery Google BigQuery
Google BigQuery data source in ODBC* ODBC
Google Cloud Pub/Sub* Google Cloud Pub/Sub
Google Cloud Storage Google Cloud Storage
Greenplum Greenplum
Greenplum data source in ODBC* ODBC
HTTP

Supports source connections only.

HTTP
Connection Corresponding DataStage connector
IBM Cloud® Databases for MongoDB

Supports source connections only.

IBM Cloud Databases for MongoDB
IBM Cloud Databases for MySQL IBM Cloud Databases for MySQL
IBM Cloud Databases for PostgreSQL IBM Cloud Databases for PostgreSQL
IBM Cloud Object Storage IBM Cloud Object Storage
IBM Cognos Analytics

Supports source connections only.

Cognos Analytics
IBM® Data Virtualization Manager for z/OS® IBM Data Virtualization Manager for z/OS
IBM Db2® IBM Db2
IBM Db2 (optimized)* IBM Db2 (optimized)
IBM Db2 Big SQL IBM Db2 Big SQL
Connection Corresponding DataStage connector
IBM Db2 data source in ODBC* ODBC
IBM Db2 for i IBM Db2 for i
IBM Db2 for z/OS IBM Db2 for z/OS
IBM Db2 on Cloud IBM Db2 on Cloud
Connection Corresponding DataStage connector
IBM Db2 on iSeries (AS400) data source in ODBC* ODBC
IBM Db2 on Linux on System z data source in ODBC* ODBC
IBM Db2 Warehouse IBM Db2 Warehouse
IBM Informix® IBM Informix
IBM Informix data source in ODBC* ODBC
IBM Match 360 Match 360
IBM MQ* IBM MQ
IBM Netezza® Performance Server IBM Netezza Performance Server
Connection Corresponding DataStage connector
IBM Netezza Performance Server (optimized)* IBM Netezza Performance Server (optimized)
IBM Netezza data source in ODBC* ODBC
IBM Watson™ Query Watson Query
Impala data source in ODBC* ODBC
MariaDB MariaDB
Microsoft Azure Blob Storage Microsoft Azure Blob Storage
Microsoft Azure Cosmos DB Microsoft Azure Cosmos DB
Microsoft Azure Data Lake Storage Microsoft Azure Data Lake Storage
Microsoft Azure File Storage Microsoft Azure File Storage
Microsoft Azure SQL Database Microsoft Azure SQL Database
Connection Corresponding DataStage connector
Microsoft SQL Server Microsoft SQL Server
Microsoft SQL Server data source in ODBC* ODBC
MongoDB

Supports source connections only.

MongoDB
MongoDB data source in ODBC* ODBC
MySQL MySQL
MySQL data source in ODBC* ODBC
ODBC*

Multiple data sources are available from the ODBC connection, which is optimized for DataStage.

Oracle Oracle
Oracle (optimized)* Oracle (optimized)
Oracle data source in ODBC* ODBC
PostgreSQL PostgreSQL
Connection Corresponding DataStage connector
PostgreSQL data source in ODBC* ODBC
Presto

Supports source connections only.

Presto
Salesforce.com

Supports source connections only.

Salesforce.com
Salesforce.com (optimized)* Salesforce.com (optimized)
SAP ASE

Supports source connections only.

SAP ASE
SAP ASE data source in ODBC*

Supports source and target connections

ODBC
SAP Bulk Extract*

Supports source connections only.

SAP Bulk Extract
SAP Delta Extract*

Supports source connections only.

SAP Delta Extract
SAP HANA SAP HANA
SAP IDoc* SAP IDoc
SAP IQ

Supports source connections only.

SAP IQ
Connection Corresponding DataStage connector
SAP IQ data source in ODBC* ODBC
SAP OData SAP OData
SingleStoreDB SingleStoreDB
Snowflake Snowflake
Storage volume Storage volume
4.7.1 and later Tableau Tableau
Teradata Teradata
Teradata (optimized)* Teradata (optimized)
Text data source in ODBC* ODBC

* Denotes a connection that is available for DataStage only.