How an integrated data science platform can transform your ability to make productive use of big data Leverage an integrated data and analytics toolset that makes data science smooth and seamless Unlock self-service big data analytics for users of all skill levels – from data scientists to citizen analysts Get a new end-to-end data science workflow up and running within a day
We hope you have been having a great experience discovering, cataloguing and governing data with IBM Data Catalog as part of IBM Watson Data Platform. We’d like to inform you that the Data Catalog service is now generally available (GA), and all Beta plan instances will be retired on January 31, 2018.
IBM Watson Data Platform, a platform designed to make data discoverable and accessible to data professionals around the world. Central to this is our new IBM Data Catalog offering, which enables companies to build a 360 view of all information, making data accessible for self-service analytics and data science initiatives.
When we talk to data scientists, we hear the same sad story again and again. They tell us how their organization fell in love with the idea of building a data lake as a single platform for self-service data science. How they were wooed and won by a vendor with a solution that promised much, but delivered little. How their vision of a data lake as a clear source of business insight has turned into a stagnant swamp—a dumping ground where data goes to die.
Ten years ago, Chief Data Officers (CDOs) were a rarity. Large corporations such as Visa, Capital One and Yahoo! led the way in appointing CDOs, but the job title had yet to become mainstream. Then the global financial crisis of 2007-2008 hit. Organizations heard the alarm bells ring, and CDOs – suddenly in high demand – were asked to help align operations with a raft of new regulatory requirements around data governance and reporting.
Built on Watson Data Platform, IBM Data Catalog is IBM’s next-generation, cloud-based enterprise data catalog. It promises to provide a central solution where users can catalog, govern and discover information assets, and it is designed to slash the time spent searching for and hesitating over sharing data, so that you can focus on extracting business value from your data assets.
Data governance is rarely seen as a glamorous topic, and even the mere mention of the ‘G’ word often inspires groans and yawns from non-specialists. But are they missing a trick? It’s possible that the failure to appreciate data governance comes from a lack of understanding about the value it can deliver, and just how important it is to future success. Today, we’re going to attempt to address that gap in understanding. First, let’s define our terms: by data governance, we’re referring to the overall management of the availability, usability, integrity, and security of the data employed in an enterprise. A sound data governance program includes a defined set of procedures, a plan to execute those procedures, and people who are responsible for putting that plan into action. This might sound like a lot of work without much payoff—but the truth is that data governance plays a key role in ensuring that data is used to its full potential.
IBM’s aim with Watson Data Platform is to make data accessible for anyone who uses it. An integral part of Watson Data Platform will be a new intelligent asset catalog, IBM Data Catalog, a solution underpinned by a central repository of metadata describing all the information managed by the platform. Unlike many other catalog solutions on the market, the intelligent asset catalog will also offer full end-to-end capabilities around data lifecycle and governance.
The Entity Analytics beta capability of Bluemix Data Connect will be switched off on Friday 14th April. During the beta program we gained a lot of useful feedback as to how these Entity components provides an important set of features to power self service analytics.