Supported data sources for SPSS Modeler
In SPSS Modeler, you can connect to your data no matter where it lives.
Connectors
The following table lists the data sources that you can connect to from SPSS Modeler.
For more information about SQL pushback (such as lists of nodes, CLEM expressions, and operators that support SQL pushback), see SQL optimization.
For a list of databases that support custom SQL queries to pull in data, see Data Asset node.
- Db2
- Db2 Big SQL
- Db2 for z/OS
- Db2 Warehouse
- Microsoft SQL Server
Connector | Read Only | Read & Write | SQL Pushback | Notes |
---|---|---|---|---|
Amazon RDS for MySQL | ✓ | Replace the data set option isn't supported for this connection. | ||
Amazon RDS for Oracle | ✓ | |||
Amazon RDS for PostgreSQL | ✓ | ✓ | Replace the data set option isn't supported for this connection. | |
Amazon Redshift | ✓ | ✓ | ||
Amazon S3 | ✓ | |||
Apache Cassandra | ✓ | |||
Apache Derby | ✓ | |||
Apache HDFS (formerly known as "Hortonworks HDFS") | ✓ | |||
Apache Hive | ✓ | ✓ | ||
Apache Impala | ✓ | ✓ | SPSS Modeler supports only the Username and password authentication method. | |
Box | ✓ | |||
Cloud Object Storage | ✓ | |||
Cloud Object Storage (infrastructure) | ✓ | |||
Cloudant | ✓ | |||
Cognos Analytics | ✓ | |||
Connector | Read Only | Read & Write | SQL Pushback | Notes |
DataStax Enterprise | ✓ | |||
Db2 | ✓ | ✓ | ||
Db2 Big SQL | ✓ | ✓ | ||
Db2 for i | ✓ | ✓ | ||
Db2 for z/OS | ✓ | ✓ | ||
Db2 on Cloud | ✓ | ✓ | ||
Db2 Warehouse | ✓ | ✓ | ||
Dremio | ✓ | |||
Dropbox | ✓ | |||
Exasol | ✓ | ✓ | SQL pushback is supported with a custom image and ODBC driver installed. For more information, see ODBC drivers. |
|
FTP (remote file system transfer) | ✓ | |||
Generic JDBC | ✓ | Use the Generic JDBC connection to connect to a data source that doesn't have a defined connection for watsonx.ai. | ||
Google BigQuery | ✓ | ✓ | Google BigQuery has these limitations when SQL pushback is enabled:
For more information, see Known issues and limitations for SPSS Modeler. |
|
Google Cloud Storage | ✓ | |||
Connector | Read Only | Read & Write | SQL Pushback | Notes |
Greenplum | ✓ | ✓ | ||
HDFS via Execution Engine for Hadoop | ✓ | You can write to an existing data asset, but writing to a new asset isn't currently supported. | ||
Hive via Execution Engine for Hadoop | ✓ | |||
HTTP | ✓ | |||
IBM Cloud Databases for MySQL | ✓ | |||
IBM Cloud Data Engine | ✓ | |||
IBM Cloud Databases for MongoDB | ✓ | |||
IBM Cloud Databases for PostgreSQL | ✓ | ✓ | ||
Impala via Execution Engine for Hadoop | ✓ | |||
Informix | ✓ | |||
Looker | ✓ | |||
MariaDB | ✓ | |||
Microsoft Azure Blob Storage | ✓ | |||
Microsoft Azure Cosmos DB | ✓ | |||
Connector | Read Only | Read & Write | SQL Pushback | Notes |
Microsoft Azure Databricks | ✓ | |||
Microsoft Azure Data Lake Storage | ✓ | |||
Microsoft Azure File Storage | ✓ | |||
Microsoft Azure SQL Database | ✓ | ✓ | ||
Microsoft Azure Synapse Analytics | ✓ | ✓ | ||
Microsoft SQL Server | ✓ | ✓ | ||
MinIO | ✓ | |||
MongoDB | ✓ | |||
MySQL | ✓ | |||
Netezza Performance Server | ✓ | ✓ | ||
OData | ✓ | |||
Oracle | ✓ | ✓ | ||
Planning Analytics (formerly known as "IBM TM1") | ✓ | Only the Replace the data set option is supported. | ||
Presto | ✓ | |||
Connector | Read Only | Read & Write | SQL Pushback | Notes |
PostgreSQL | ✓ | ✓ | ||
Salesforce.com | ✓ | |||
SAP ASE | ✓ | |||
SAP HANA | ✓ | ✓ | SQL pushback is supported with a custom image and ODBC driver installed. For more information, see ODBC drivers. |
|
SAP IQ | ✓ | |||
SAP OData | ✓ | |||
SingleStoreDB | ✓ | |||
Snowflake | ✓ | ✓ | ||
SPSS Analytic Server | ✓ | |||
Storage volume (formerly known as Mounted volume) | ✓ | If your data contains a
column or row delimiter such as a comma (, ), your flow might fail
when it tries to write to a storage volume. As a workaround, you can first use a
Filler node to replace the delimiters. |
||
Tableau | ✓ | |||
Teradata | ✓ | ✓ | SQL pushback is supported with a custom image and ODBC driver installed. For more information, see ODBC drivers. |
|
Vertica | ✓ |
Data files
In addition to using data from remote data sources or integrated databases, you can use data from files. You can work with data from the following types of files in SPSS Modeler.
Connector | Read Only | Read & Write | Notes |
---|---|---|---|
Avro | ✓ | ||
CSV, Delimited | ✓ |
Attention: If your .csv file
contains any malicious payloads in an input field (in formulas for example), these payloads
might be executed.
|
|
JSON | ✓ | ||
ORC | |||
Parquet | ✓ | ||
SAS | ✓ | ||
SAV (SPSS Statistics) | ✓ | ||
SHP | |||
XLS, XLSX (Excel) | ✓ | ||
XML | ✓ |
ODBC drivers
Watsonx.ai connections use JDBC drivers. You can also use ODBC drivers to take advantage of SQL optimization and pushback.
- SPSS Data Access Pack 8.1.1.0
- Netezza native driver 7.2.1.10
- Db2 native driver 11.5.4
- SAP HANA driver (hanaclient-2.7.26-linux-x64.tar.gz)
- Exasol driver (EXASOL_ODBC-7.1.4.tar.gz)
- Teradata driver (TeradataToolsAndUtilitiesBase__linux_x8664.17.20.05.00-1.tar.gz)
For more information, see Building custom images to install ODBC drivers.