Supported data sources for SPSS Modeler

In SPSS Modeler, you can connect to your data no matter where it lives.

Connectors

The following table lists the data sources that you can connect to from SPSS Modeler.

For additional details about SQL pushback—including lists of nodes, CLEM expressions, and operators that support SQL pushback—see SQL optimization and its subsections.

For a list of databases that support using custom SQL queries to pull in data, see Data Asset node.

Connector Read Only Read & Write SQL Pushback Notes
Amazon RDS for MySQL   Replace the data set option isn't supported for this connection.
Amazon RDS for PostgreSQL Replace the data set option isn't supported for this connection.
Amazon Redshift  
Amazon S3    
Apache Cassandra    
Apache Derby    
Apache HDFS (formerly known as "Hortonworks HDFS")      
Apache Hive    
Box  
Connector Read Only Read & Write SQL Pushback Notes
Cloud Object Storage  
Cloud Object Storage (infrastructure)  
Cloudant
Cloudera Impala  
Cognos Analytics  
Data Virtualization Manager for z/OS    
Db2    
Db2 Big SQL    
Db2 for i    
Connector Read Only Read & Write SQL Pushback Notes
Db2 for z/OS    
Db2 on Cloud    
Db2 Warehouse    
Dropbox  
Exasol   SQL pushback is supported with a custom image. See Building custom images to install ODBC drivers.
FTP (remote file system transfer)  
Generic JDBC     Use the Generic JDBC connection to connect to a data source that doesn't have a defined connection for Cloud Pak for Data.
Google BigQuery    
Google Cloud Storage      
Connector Read Only Read & Write SQL Pushback Notes
Greenplum    
HDFS via Execution Engine for Hadoop     You can write to an existing data asset, but writing to a new asset isn't supported currently.
Hive via Execution Engine for Hadoop  
HTTP      
IBM Cloud Databases for MySQL    
IBM Cloud Data Engine      
IBM Watson Query  
IBM Cloud Databases for DataStax      
IBM Cloud Databases for MongoDB      
IBM Cloud Databases for PostgreSQL      
Connector Read Only Read & Write SQL Pushback Notes
Impala via Execution Engine for Hadoop      
Informix      
Looker    
MariaDB    
Microsoft Azure Blob Storage      
Microsoft Azure Cosmos DB    
Microsoft Azure Data Lake Storage  
Microsoft Azure File Storage      
Microsoft Azure SQL Database  
Microsoft SQL Server    
Connector Read Only Read & Write SQL Pushback Notes
MinIO      
MongoDB      
MySQL      
Netezza Performance Server    
OData      
Oracle    
Planning Analytics (formerly known as "IBM TM1")     Only the Replace the data set option is supported.
PostgreSQL    
Salesforce.com      
SAP ASE      
Connector Read Only Read & Write SQL Pushback Notes
SAP HANA      
SAP IQ      
SAP OData      
Snowflake    
SPSS Analytic Server      
Storage volume (formerly known as Mounted volume)   If your data contains a column or row delimiter such as a comma (,), your flow may fail when attempting to write to a storage volume. As a workaround, you can first use a Filler node to replace the delimiters.
Tableau      
Teradata   SQL pushback is supported with a custom image and ODBC driver installed. See Building custom images to install ODBC drivers.

Data files

In addition to using data from remote data sources or integrated databases, you can use data from files. You can work with data from the following types of files in SPSS Modeler.

Connector Read Only Read & Write
Avro
CSV/delimited
Excel (XLS, XLSX)
JSON
ORC
Parquet
SAS
SAV
SHP
XML

ODBC drivers

Cloud Pak for Data connections use JDBC drivers. You can also use ODBC drivers to take advantage of SQL optimization and pushback. For details about SQL pushback support, see the previous tables on this page.

The following ODBC drivers are preinstalled with SPSS Modeler:
  • SPSS Data Access Pack 8.1.1.0
  • Netezza native driver 7.2.1.10
  • Db2 native driver 11.5.4
The following ODBC drivers can be installed via a custom SPSS Modeler image. For more information, see Building custom images to install ODBC drivers.
  • SAP HANA driver (hanaclient-2.7.26-linux-x64.tar.gz)
  • Exasol driver (EXASOL_ODBC-7.1.4.tar.gz)
  • Teradata driver (TeradataToolsAndUtilitiesBase__linux_x8664.17.20.05.00-1.tar.gz)