Analytics for Apache Spark users can now access and do analytics on data stored in the IBM Cloud Object Storage Cross Region service with new beta capability.
IBM Cloud Object Storage offers high-capacity, cost-effective storage for analytics and other applications that is scalable, flexible and simple to use.
The Apache Spark accesses IBM Cloud Object Storage data through a storage connector based on Stocator technology, which is implicitly designed for Object Storage and thus faster than legacy Object Storage connectors. As a user, you do not need to change or recompile Apache Spark code.
Access and Analyze data in IBM Cross Region Cloud Object Storage blog post describes usage of IBM Cloud Object Storage data with Analytics for Apache Spark on IBM Cloud and the IBM Data Science Experience (DSx).
Please reach out to us at sparksrv@us.ibm.com, if you have any questions or comments. Your input is greatly appreciated!