Dumping data into a Hadoop or a Hortonworks data platform alone won’t accelerate your analytics efforts. Without appropriate governance or quality, data lakes can quickly turn into unmanageable data swamps. Data users know that the data they need lives in these swamps, but without a clear data governance strategy they won’t be able to find it, trust it or use it.
A governed data lake contains clean, relevant data from structured and unstructured sources that can easily be found, accessed, managed and protected. The platform your data resides on is security-rich and reliable. Data that comes into your data lake is properly cleaned, classified and protected in timely, controlled data feeds that populate and document it with reliable information assets and metadata.