Friday, April 18, 2014

Distributed Cache - Pig

I had been trying to use Distributed-Cache in Pig.
After a lot of trial and errors behold SUCCESS!
Lets get to the meat.

Lets go through the steps.
a)Create an Eval UDF
b)Initialize Distributed Cache using getCachedFiles()
c)Initialize the Data Structure using step b.
d)Finally apply your logic on the data.

1 comment:

  1. This program is giving me this error ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2998: Unhandled internal error.

    ReplyDelete