Supported data sources for Data Refinery
Data Refinery supports the following data sources in connections. These connections can be used for Data Refinery jobs with Default Data Refinery XS and Spark & R runtime environments.
For Data Refinery running with a Hadoop environment see Hadoop cluster Data Refinery.
Data Refinery does not support connecting to data sources with Kerberos authentication.
IBM services
- HDFS via Execution Engine for Hadoop
- Hive via Execution Engine for Hadoop (Supports source connections only)
- IBM Cloud Data Engine (Supports source connections only)
- IBM Cloud Databases for MongoDB (Supports source connections only)
- IBM Cloud Object Storage
- IBM Cloudant
- IBM Cognos Analytics (Supports source connections only)
- IBM Data Virtualization (Supports source connections only)
- IBM Data Virtualization Manager for z/OS
- IBM Db2
- IBM Db2 Big SQL
- IBM Db2 for z/OS
- IBM Db2 on Cloud
- IBM Db2 Warehouse
- IBM Match 360
- IBM Planning Analytics (Supports source connections only)
- IBM Product Master
- IBM SPSS Analytic Server (Supports source connections only)
- IBM watsonx.data Presto (Supports source connections only)
- Impala via Execution Engine for Hadoop (Supports source connections only)
- Storage volume
Third-party services
- Amazon RDS for MySQL
- Amazon RDS for Oracle
- Amazon RDS for PostgreSQL
- Amazon Redshift
- Amazon S3
- Apache Cassandra
- Apache Derby
- Apache HDFS
- Apache Hive (Supports source connections only)
- Box
- Apache Impala (Supports source connections only)
- DataStax Enterprise
- Dremio (Supports source connections only)
- Dropbox
- Elasticsearch
- Exasol
- FTP
- Generic S3
- Google BigQuery
- Google Cloud Storage
- Greenplum
- HTTP (Supports source connections only)
- MariaDB
- Microsoft Azure Blob Storage
- Microsoft Azure Cosmos DB
- Microsoft SQL Server
- MongoDB (Supports source connections only)
- OData (Supports source connections only)
- Oracle
- PostgreSQL
- Presto (Supports source connections only)
- SAP HANA
- SAP OData (Supports source connections only)
- SingleStoreDB
- Snowflake
User-defined
- Custom JDBC connector. The name, details, and properties of the connection are defined by the administrator who creates the connector.
- Generic JDBC
Parent topic: Refining data