Home / Tag Archives: pig

Tag Archives: pig



Joins Two Tables Retail_invoice = LOAD '/Retail_invoice_hdfs' USING PigStorage('\t') as (uniq_idi:chararray, InvoiceNo:chararray, StockCode:chararray, Description:chararray,Quantity:INT); DESCRIBE Retail_invoice; Retail_Customer = LOAD '/Retail_Customer_hdfs' USING PigStorage('\t') as (uniq_idc:chararray, InvoiceDate:chararray, UnitPrice:INT, CustomerID:chararray,Country:chararray); DESCRIBE Retail_Customer; Left Outer Join Left_join = JOIN Retail_invoice BY uniq_idi LEFT OUTER, Retail_Customer BY uniq_idc; DESCRIBE Left_join; DUMP Left_join; Right Outer Join …

Read More »

301.4.6-Filter & Sorting

Filter and Sorting

Filter and Sorting Filter The basic syntax of filtering is use the filter operation and then relation name which is needed to be filtered, followed by the condition. Filtering – Rows Filter on Numerical variable. For now we will see how to filter the numerical variable. Retail_Customer_F1 = FILTER Retail_Customer_pig …

Read More »

301.4.5-Group By

Group By

Group by Group by in Pig In this section we will see about the grouping in the pig, grouping in the pig is very important because most of the pig function takes the bag as the input parameter. Grouping before using functions Most of the inbuilt functions in pig take …

Read More »



  Functions In this section we will talk about the functions in the pig. Function is very important concepts while doing the analysis we might use several functions like “sum, count, average, summary functions, numerical functions , string functions and etc.”” Writing the map reduce code for each one them …

Read More »

301.4.1-Pig Introduction

Pig Introduction

  Pig Introduction In this particular session we are going to learn the basic of the pig, such as “what is a pig, pig architecture ,pig latin scripts, pig basic operations , loading the data into pig , group by ,filtering , sorting, functions in pig , joins in pig …

Read More »