What is Pig and PigLatin?

Pig™ was initially developed at Yahoo! to allow people using Apache Hadoop® to focus more on analyzing large data sets and spend less time having to write mapper and reducer programs. Like actual pigs, who eat almost anything, the Pig programming language is designed to handle any kind of data—hence the name!

Apache® Pig™ is made up of two components: the first is the language itself, which is called PigLatin (yes, people naming various Hadoop projects do tend to have a sense of humor associated with their naming conventions), and the second is a runtime environment where PigLatin programs are executed. Think of the relationship between a Java Virtual Machine (JVM) and a Java application. In this section, we’ll just refer to the whole entity as Pig.

Related products or solutions

IBM BigInsights screenshot

IBM BigInsights

Spark and Hadoop for the Open Enterprise which can be used to scale analytics quickly and easily. Available on-premises, on-cloud, and integrated with other systems in use today.

Learn more

 Read the data sheet



The Data Warehouse Evolved: A Foundation for Analytical Excellence

ReExplore a Best-in-Class approach to data management and how companies are prioritizing data technologies to drive growth and efficiency.


Understanding Big Data Beyond the Hype

Read this practical introduction to the next generation of data architectures that introduces the role of the cloud and NoSQL technologies and discusses the practicalities of security, privacy and governance.


Ambari Pig View in Kerberos Enabled Clusters

This blog post provides the necessary steps involved to successfully configure Pig View in IOP 4.2 release.