Pig versus Hive: Benchmarking high level query languages
Results of benchmarking studies run on small clusters of nodes
From the developerWorks archives
Date archived: January 13, 2017 | First published: May 27, 2014
This article presents benchmarking results of two benchmarking sets applied to Hive and Pig, running on Hadoop 0.14.1. In the first benchmarking study, the Apache Pig benchmark (Apache Foundation, 11/07/07) was replicated. In the second study, results were obtained by applying TPC-H benchmarks. (TPC-H is a decision support benchmark published by the Transaction Processing Performance Council, an organization founded to define global database benchmarks). The two studies showed conflicting results.
This content is no longer being updated or maintained. The full article is provided "as is" in a PDF file. Given the rapid evolution of technology, some steps and illustrations may have changed.