I just looking at it and are extremely new on Hadoop. Just in a stage to investigate and downloading code to install my first test clusters with Hadoop based on HortonWorks, IBM BigInsight and also a clean Apache version. (3 different Clusters)
My main goal with this test is to see how to backup and restore 100s of PB of storage located in a Hadoop Cluster based on HDFS and also GPFS.
My question to you all, is their anyone in this forum that have some knowledge how to setup Hadoop together with GPFS and do you have any comments or tips and tricks for me before I start?
For all other who are interested of the result or just are very geeky please feel free to join me on Twitter (@IssenSvensson) / DW-Forum.
I hope to start very small before New Year and grow the information during the 1st half year 2014 and maybe have a short summary at IBM Edge in June.
Thanks for your valuable information