Data science is a rapidly evolving discipline that leverages an ever-widening array of tools and capabilities to learn and exploit. Because of such inherent complexities surrounding adoption, integration and support, the work of the data scientist can be daunting.
That complexity is one of the reasons IBM several years ago set out to bring clarity and uniformity to the otherwise disparate data discovery and analytics process. The goal: create a solution that leveraged the best capabilities available, in an integrated, collaborative platform that was easy to access and use. With it, everyone from data scientists to business analysts would be able to not only tackle the discipline, but conquer it.
Along the way, we learned a lot about the role of data scientists; their challenges, their tools of choice, and how they valued certain processes and functionalities over others.
But, first a little background. Up until the time IBM rolled out the popular Data Science Experience, if someone wanted to engage in data science, he/she would have to search the web, review components like Jupyter notebooks, or development platforms like Scala and r, big data tools like Hadoop, and much more – and then learn how to use them.
Not unexpectedly, the wide variety of tools and programs led to relatively slow adoption, challenging integration, and cumbersome support.
In addition, our research showed that once up and running, most data scientists’ workflows were often fragmented, requiring them to toggle between a variety of workspaces and tools to complete a job. For example, they might use Data Shaper to clean data, Jupyter for modeling and MatPlotLib for visualization. These tools support a linear process, but data scientists’ workflows are more cyclical — like this:
When we launched the Data Science Experience in 2016, users for the first time had a solution that integrated the most sought-after development, notebook and analytics tools, in a simple-yet-scalable web-based platform. It also enabled users to connect live with IBM for any and all support. It was a breakthrough in this fast-evolving corner of the tech industry. In 2017 we rolled out a “Local” version for organizations to install behind their firewalls in their own data centers.
In addition to the aforementioned goals, the vision for Data Science Experience was that it would accommodate the agile workflow, simplify the experience of working with data, and bring all the tools into a unified data ecosystem.
As Caroline Law, Design Lead at IBM, said, “The Data Science Experience is a beautiful manifestation of the power of user research to understand our users’ needs, challenges and motivations.”
In particular, the design team in charge of the ideation of the solution identified some of data scientists’ biggest needs, including the ability to collaborate with fellow data scientists and learn from each other; the ability to share algorithms and exchange data analysis techniques; and the ability to publish the results of their work and collaborate with peers across neighboring disciplines – people like data engineers who can help them prepare data and business analysts who can translate their insights into data-informed decisions.
The design team also wanted the experience to be easy to use and accessible for companion sites built for data engineers, system administrators and other user personas. They started work on building the ultimate data ecosystem, an environment that would intuitively connect related data functions and allow easy collaboration.
In the process of designing Data Science Experience, IBM’s San Francisco design team also developed interface frameworks that are now used and applied across IBM.
One example: Cognitive Assistance for Data Scientists (CADS) suggests, tests, and deploys machine learning models for you, so you don’t have to be an expert data scientist to build cognitive applications.
IBM Data Science Experience, in function and form, helps simplify the data science universe. Today, Data Science Experience is one of the premier data science systems available on the market, with thousands of users worldwide.
The diligence and restless pursuit for innovation of the IBM Cloud design team is paying off and its work recognized. Last fall, the Data Science Experience won the prestigious 2017 Red Dot design award. And this week it was announced that the iF International Forum Design, GmbH, has given the IBM Data Science Experience its iF Design Award 2018 in the Communications — Software Application category.
And though such recognitions are well received, it’s what the designs enable people to do that keeps us driving into the future.
A recent PwC study indicated that AI has the potential to add close to $16 trillion to the global economy by 2030, yet the technology has had an adoption rate of only 4%, according to other research reports. There seems to be no debate on the strategic value of AI to a business, so how […]
We are at an inflection point. Data Science and AI technologies are not just a continuum of old technologies. They represent a complete paradigm shift that take us from a deterministic world to a probabilistic world, increasing the potential to profoundly change human society, and to become a new engine of economic development. Around the world, […]
In this era of swelling data, the mining of insights to predict future outcomes with greater accuracy, to automate tasks, and to recommend actions based on that data is growing increasingly critical for organizations and businesses of all sizes. Such is the role of the data scientist, the profession the Harvard Business Review dubbed the […]