Pipeline nodes
Pipeline nodes are the physical machines that host one or more pipeline processes.
The pipeline node is where you install and start the pipeline executable that runs the pipeline processes. You configure and maintain the pipeline configuration file for all pipelines that are hosted by this machine. The system also writes the pipeline messages to the log files on the pipeline nodes.
Pipeline nodes connect pipeline processes to these components of the product architecture:
- Acquisition programs
- As part of the extract, transform, and load (ETL) process, acquisition programs use transports to send UMF data into pipelines for processing. You use the transport method appropriate to the type of acquisition program to connect to the pipelines. For example, if you use the UMF file utility as an acquisition program, you use the file transport.
- Entity database
- The entity database contains entity information. Pipelines access entity information while processing incoming records for entity and relationship resolution. The pipeline node must have the appropriate database client installed and configured, so that the pipelines can access the entity database.
- Queues
- If your system uses queues as transport methods to send data to the pipelines for processing, you must install and configure the appropriate message queuing software on each pipeline node.
- Address hygiene servers
- If your system uses address hygiene products from other companies for additional address cleansing, each pipeline node must be configured to connect to the address hygiene servers.
- Web services
- You must use an HTTP transport to connect the pipeline processes on the pipeline node to the Web services.