IBM Cloud Pak® for Data Version 4.6 will reach end of support (EOS) on 31 July, 2025. For more information, see the Discontinuance of service announcement for IBM Cloud Pak for Data Version 4.X.
Upgrade to IBM Software Hub Version 5.1 before IBM Cloud Pak for Data Version 4.6 reaches end of support. For more information, see Upgrading IBM Software Hub in the IBM Software Hub Version 5.1 documentation.
Set up your system
Before you can use IBM Federated Learning, ensure that you have the required hardware, software, and dependencies.
Core requirements by role
Each entity that participates in a Federated Learning experiment must meet the requirements for their role.
Admin software requirements
Designate an admin for the Federated Learning experiment. The admin must have:
-
Access to Cloud Pak for Data with Watson Studio and Watson Machine Learning enabled.
You must install Watson Machine Learning as a component in your Cloud Pak for Data image. -
A project for assembling the global model.
Party hardware and software requirements
Each party must have a system that meets these minimum requirements.
Note: Remote parties participating in the same Federated Learning experiment can use different hardware specs and architectures, as long as they each meet the minimum requirement.
Supported architectures
- x86 64-bit
- PPC
- Mac M-series (runtime
fl-rt22.2and later only) - 4 GB memory or greater
Supported environments
- Linux
- Mac OS/Unix
- Windows
Software dependencies
- A supported Python version and a machine learning framework.
- The Watson Machine Learning Python client.
- If you are using Linux, run
pip install 'ibm-watson-machine-learning[fl-rt22.2-py3.10]'. - If you are using Mac OS with M-series CPU and Conda, download the installation script and then run
./install_fl_rt22.2_macos.sh <name for new conda environment>.
- If you are using Linux, run
Network requirements
An outbound connection from the remote party to aggregator is required. Parties can use firewalls that restrict internal connections with each other.
Data sources requirements
Data must comply with these requirements.
- Data must be in a directory or storage repository that is accessible to the party that uses them.
- Each data source for a federate model must have the same features. IBM Federated Learning supports horizontal federated learning only.
- Data must be in a readable format, but the formats can vary by data source. Suggested formats include:
- Hive
- Excel
- CSV
- XML
- Database
Parent topic: Creating a Federated Learning experiment