Parallel processing of unstructured data, Part 2, Use AWS S3 as an unstructured data repository

With DB2 for Linux, UNIX and Windows

From the developerWorks archives

Steve Raspudic, Alexander Abrashkevich, and Toni Kunic

Date archived: January 12, 2017 | First published: March 13, 2014

This series explores how to process unstructured data in parallel fashion — within a machine and across a series of machines — using the power of IBM DB2® for Linux®, UNIX® and Windows® (LUW) and GPFS™ shared-nothing cluster (SNC) to provide efficient, scalable access to unstructured data through a standard SQL interface. In this article, learn to provide access to unstructured data stored on the cloud. Also see how to analyze the data in a highly parallel fashion using an SQL interface provided by DB2 LUW and a table function included in this article.

This content is no longer being updated or maintained. The full article is provided "as is" in a PDF file. Given the rapid evolution of technology, some steps and illustrations may have changed.

Zone=Information Management
ArticleTitle=Parallel processing of unstructured data, Part 2: Use AWS S3 as an unstructured data repository