Pig versus Hive: Benchmarking high level query languages

Results of benchmarking studies run on small clusters of nodes

From the developerWorks archives

Benjamin Jakobus and Peter McBrien Dr.

Date archived: January 13, 2017 | First published: May 27, 2014

This article presents benchmarking results of two benchmarking sets applied to Hive and Pig, running on Hadoop 0.14.1. In the first benchmarking study, the Apache Pig benchmark (Apache Foundation, 11/07/07) was replicated. In the second study, results were obtained by applying TPC-H benchmarks. (TPC-H is a decision support benchmark published by the Transaction Processing Performance Council, an organization founded to define global database benchmarks). The two studies showed conflicting results.

This content is no longer being updated or maintained. The full article is provided "as is" in a PDF file. Given the rapid evolution of technology, some steps and illustrations may have changed.

Zone=Big data and analytics
ArticleTitle=Pig versus Hive: Benchmarking high level query languages