Apache Hadoop is an open source Java software framework for running data-intensive applications on large clusters of commodity hardware, which is heavily invested in and used by Yahoo. Pig is a platform on top of Hadoop that includes a high-level language for expressing data analysis programs in a simple manner. Hadoop and Pig are used in Yahoo for different tasks, from helping create the various indexes for web search to multi-language entity recognition, handling levels of petabytes of info and tens of thousands of jobs per week.
In this talk we will offer an introduction to the MapReduce model, Hadoop and Pig and how you can leverage them to process big data with a small cost.
Registration
Participation is free for registered delegates. You can register here: http://skillsmatter.com/event/os-mobile-server/introduction-to-data-processing-with-hadoop-and-pig/rl-333
Comments