Andrea Braida
BigInsights 4.2 release – Q&A with Program Director Priya Krishnan
With the release of BigInsights 4.2, IBM is making self-service, powerful advanced analytics - including Apache Spark - available on an optimal Hadoop distribution. To find out more, we chatted with Priya Krishnan, Program Director and Product Manager for BigInsights at IBM, about the new release and how its comprehensive, open, and flexible architecture makes the delivery of big data analytics and business applications easier.
Fast-tracking Business Apps with Apache Spark Submit
Up to now, we had one entry point to IBM’s Spark cloud service, which was Jupyter Notebooks. By adding Spark-submit as a new way to access the service, IBM is enabling remote programmatic access to Spark clusters on the IBM cloud, so that external applications can seamlessly execute data processing on Spark. This ability to build direct connections from any user-facing application will bring new power and intelligence to day-to-day operations. More...
Accelerating data science with Jupyter Notebooks and Apache Spark
The newly integrated notebooks within the IBM Analytics for Apache Spark service aim to provide analysts and data scientists with an iterative, flexible environment that supports end-to-end analysis. Melissa Rodriguez Zynda, IBM Offering Manager for Analytics Platform Services, has previously worked as a design researcher for IBM Watson Analytics and Social Media Analytics. A designer by training with a background in anthropology, Melissa explains how notebooks transform the way users interact with technologies such as Apache Spark to rapidly analyze data.
Apache Spark: Upgrade and speed-up your analytics
One of the best things about Apache Spark is that it makes real-time analytics of vast unstructured datasets – like social media sites – feasible and affordable for companies of all sizes. But what are the practicalities of performing this kind of analysis? And how would you get started? Chetna Warade, Developer Advocate at IBM, is a software engineer who works in research and product development. We spoke to Chetna about a recent project to demonstrate the potential of Spark for social media analytics, which focused on the popular “Ask Me Anything” (AMA) section of the social news and entertainment site, Reddit.
BigInsights on Cloud: Making Hadoop easier to adopt with a managed services strategy
Hadoop is a powerful technology, but it’s not the easiest to get up and running, particularly for companies that don’t have any experience of big data technologies. Jim Wankowski, Technical Sales Specialist for IBM Cloud Data Services, talks us through the challenges that many businesses face when adopting technologies such as Hadoop.
BigInsights on Cloud: Where Critical Data Goes to Achieve Enterprise-Grade Security
IBM BigInsights on Cloud is an enterprise-ready Apache Hadoop distribution designed to enable analytics on larger volumes of data more cost-effectively. Earlier in this interview series, Rohan Vaidyanathan, offering manager for BigInsights on Cloud, introduced the service and explored specific use cases for big data projects. In this installment, Rohan is joined by Katherine Franklin, program director for security and compliance, to share some important security developments around BigInsights and other Cloud Data Services offerings.
IBM Analytics for Apache Spark – simple, open, integrated and powerful
IBM Analytics for Apache Spark is a highly reliable and easy-to-use managed Spark service available on IBM Bluemix. The Spark service offers new levels of flexibility and access for data professionals and line of business stakeholders to rapidly analyze large data sets and uncover new insights to transform their industries. Try it yourself!
Can Apache Spark MLlib help you find a cab in NYC?
Clustering algorithms are one way data scientists and developers group and classify data. K-means clustering is a widely used clustering algorithm and has loads of practical applications. For example, you might want to identify neighborhoods in a certain city that share similar characteristics. The K-means clustering algorithm is supported by Apache Spark’s machine learning library, MLlib. Follow along in this video while IBM’s Dan Kikuchi demos this technology using IBM Analytics for Apache Spark, a managed Spark-as-a-service offering available on Bluemix.
BigInsights on Cloud: How businesses are getting insight the right way
IBM BigInsights is an enterprise-ready Hadoop distribution designed to enable analytics on larger volumes of data more cost-effectively. In part 1 of this interview, we sat down with Rohan Vaidyanathan, the offering manager for BigInsights on Cloud, to discuss major benefits of the cloud service. In this installment, we'll talk about how companies are actively using BigInsights on Cloud today to tackle their big data projects.