Cornell University

Hints and Examples of MapReduce Programming

This is a miscellaneous collection of hints and example on MapReduce programming. It is intended for programmers with some Java experience, who are new to Hadoop.

  1. Word count. This is the example used in the MapReduce tutorial and the Quickstart guides on the Apache web site.
  2. Scale. This example shows how to manage data of varying data types.
  3. Indexer. Example used in CS/Info 4300. Assembles data needed for tf.idf weightings.

Last revised: October 19,2008