my understanding of the ideal reference architecture for hadoop is 1 management node with 3 data nodes, with all servers connected via their own switch for a private network, with multiple ports bonded for throughput.
data and tools still need to get in and out so one of the ports on the management node should be attached to the public intranet.
data nodes will send results through the management node as well as receive data from it, so they need no public intranet access (aside from IMM ports).
is this a correct understanding?
Pinned topic do data nodes need public IPs?
Answered question This question has been answered.
Unanswered question This question has not been answered yet.
Updated on 2013-01-23T17:31:04Z at 2013-01-23T17:31:04Z by SystemAdmin
Re: do data nodes need public IPs?2013-01-22T23:36:39ZThis is the accepted answer. This is the accepted answer.Hi Mike,
Correct, data nodes should communicate over a private network.
Please see the diagram at this link: