Parallel processing of unstructured data, Part 2, Use AWS S3 as an unstructured data repository
With DB2 for Linux, UNIX and Windows
From the developerWorks archives
Date archived: January 12, 2017 | First published: March 13, 2014
This series explores how to process unstructured data in parallel fashion — within a machine and across a series of machines — using the power of IBM DB2® for Linux®, UNIX® and Windows® (LUW) and GPFS™ shared-nothing cluster (SNC) to provide efficient, scalable access to unstructured data through a standard SQL interface. In this article, learn to provide access to unstructured data stored on the cloud. Also see how to analyze the data in a highly parallel fashion using an SQL interface provided by DB2 LUW and a table function included in this article.
This content is no longer being updated or maintained. The full article is provided "as is" in a PDF file. Given the rapid evolution of technology, some steps and illustrations may have changed.